Current issue: Vol. 6 2025


Nuttawut Thuayhanruksa and Pree Thiengburanathun

Published in Data Science and Engineering (DSE) Record 2025 Vol. 6 No. 1 pp. 1-30

This paper explores the application of various natural language processing (NLP) models for sentiment analysis on financial news articles sourced from Thai financial news websites, focusing on Thai-language data. The study evaluates machine learning and deep learning models, including Lo-gistic Regression, Bidirectional Long Short-Term Memory (Bi-LSTM), Con-volutional Neural Networks (CNN), WangChanBERTa, OpenAI’s GPT-3.5 and OpenThaiGPT. The models' performance is assessed using accuracy, precision, recall, and F1-score. The findings reveal that the Fine-tuned WangChanBERTa model achieved the highest accuracy of 0.84 on the test-ing set, demonstrating its superior ability in classifying sentiment in Thai financial news. BI-LSTM and CNN models also performed well, with test-ing accuracies of 0.781 and 0.791 In contrast, OpenAI’s GPT-3.5 and Open-ThaiGPT, which lacked fine-tuning and optimized prompts due to computa-tional constraints, exhibited practical limitations in resource-constrained settings.

Kitichart Nukaew and Arinya Pongwat

Published in Data Science and Engineering (DSE) Record 2025 Vol. 6 No. 1 pp. 31-55

Massive Open Online Courses (MOOCs) have seen continuous growth in popularity and rapid expansion. In the instructional design process, receiv-ing feedback from learners is crucial, as it helps tailor the content to better meet learners' needs. The application of NLP models in analyzing learners' feedback is an effective approach for extracting insights from a large volume of comments related to the courses. These models can categorize feedback into three distinct categories: course, instructors, and assessments. Addi-tionally, the models can predict the sentiment of the feedback, determining whether it is positive or negative. In developing these models, semi-supervised learning techniques have been employed to address the chal-lenge of limited data availability. Experimental results indicate that, for feedback categorization, a GRU model combined with tri-training with dis-agreement yields the highest prediction accuracy. Conversely, for sentiment analysis, a GRU model combined with tri-training produces the best out-comes.