<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
<title>Master Thesis</title>
<link href="https://onlineresource.ucsy.edu.mm/handle/123456789/2230" rel="alternate"/>
<subtitle/>
<id>https://onlineresource.ucsy.edu.mm/handle/123456789/2230</id>
<updated>2026-06-08T09:19:20Z</updated>
<dc:date>2026-06-08T09:19:20Z</dc:date>
<entry>
<title>OPINION MINING SYSTEM OF CUSTOMER REVIEWS  BY USING FEATURE EXTRACTION</title>
<link href="https://onlineresource.ucsy.edu.mm/handle/123456789/2796" rel="alternate"/>
<author>
<name>LWIN, NANDAR MOH MOH</name>
</author>
<id>https://onlineresource.ucsy.edu.mm/handle/123456789/2796</id>
<updated>2023-08-09T15:25:24Z</updated>
<published>2023-07-01T00:00:00Z</published>
<summary type="text">OPINION MINING SYSTEM OF CUSTOMER REVIEWS  BY USING FEATURE EXTRACTION
LWIN, NANDAR MOH MOH
Due to the dramatic improvement of ecommerce, web sources which are&#13;
important for both potential customers and service providers rapidly emerge in &#13;
prediction and decision purposes. Opinion mining techniques become popular to &#13;
automatically process customer reviews by extracting features and user opinions &#13;
expressed over them. To overcome the task of manual scanning through the large &#13;
amount of one-by-one review, people have interested to automatically process the &#13;
various reviews and to provide the information which is useful for customers and &#13;
service providers. By applying dependency relations, it can properly identify the &#13;
semantic relationships between features and opinions of each review. It can find the &#13;
numeric score of all the features using SentiWordNet. This system is intended to &#13;
collect customer reviews from tourism field and then extract the related features and &#13;
opinions to rate the services. Finally, it can rank each agency according to the final &#13;
result of each review sentence. In this thesis, Standard Parser is used to generate the &#13;
features, opinions and the dependency relations for each trip review at the &#13;
preprocessing. The two methods of features extraction such as frequency-based &#13;
feature extraction and dependency grammar-based feature extraction are used to &#13;
extract the most relevant trip features. Moreover, SentiWordNet 3.0 is used to get the &#13;
positive score and negative score for each trip feature and then the system calculates &#13;
the total weight of the trip review by using these numeric scores. The objective of the &#13;
system is to rank the travel agencies according to the final weight of each travel &#13;
agency that is collected by adding the total weight of the trip reviews for that agency. &#13;
Therefore, the system implements efficiency and effectiveness in opinion mining to &#13;
express the reviewer’s opinion and feeling for next customers’ trip plans by using &#13;
features extraction. In this system, Tourism Reviews are applied as the case studies to &#13;
identify what elements of an agency affect sales most and what are the features the &#13;
customer like or dislike so that trip managers and agency owners can target on those &#13;
areas. The system is developed using Java language and MySQL to build the &#13;
database.
</summary>
<dc:date>2023-07-01T00:00:00Z</dc:date>
</entry>
<entry>
<title>THE CAR INSURANCE CLAIM PREDICTION  SYSTEM BY USING MACHINE LEARNING  ALGORITHMS ON APACHE SPARK PLATFORM</title>
<link href="https://onlineresource.ucsy.edu.mm/handle/123456789/2795" rel="alternate"/>
<author>
<name>Ko, Thein Than</name>
</author>
<id>https://onlineresource.ucsy.edu.mm/handle/123456789/2795</id>
<updated>2023-06-04T15:10:22Z</updated>
<published>2023-05-01T00:00:00Z</published>
<summary type="text">THE CAR INSURANCE CLAIM PREDICTION  SYSTEM BY USING MACHINE LEARNING  ALGORITHMS ON APACHE SPARK PLATFORM
Ko, Thein Than
Car insurance companies face a major challenge in dealing with insurance claims, &#13;
which are prone to fraud and increasing in volume. This makes it difficult for insurers &#13;
to classify claims during the review process. To address this issue, the aim of this study &#13;
is to develop four Car Insurance Claim Prediction Classifiers with Random Forest and &#13;
Logistic regression based on the car insurance claim dataset respectively and supports &#13;
for comparison which method and attributes are more suitable for car insurance &#13;
companies. Firstly, this proposed system creates a feature selection model using &#13;
Variance Threshold Selector method to select the important attributes impact on the &#13;
accuracy of car insurance claim prediction classifiers. The data set is split into training &#13;
with 80% and testing sets with 20% randomly and the two classifiers with all attributes, &#13;
the training dataset is used to create the LR classifier and RF classifier. For two &#13;
classifiers with the feature selection method, the system creates the new training dataset &#13;
and new testing dataset by removing low variance value of attributes using Variance &#13;
Threshold Selector method. After that, two LR classifier and RF classifier are been &#13;
created by using new datasets. The system has analyzed the different attributes: 30, 32, &#13;
34, 36, 38, 40 and 42 to choose the number of attributes and important attributes and &#13;
tested 10 times for each attribute number because of splitting training and testing &#13;
datasets randomly. Finally, the system compares the evaluation results with metrics: &#13;
accuracy and f score. RF classifiers with and without the feature selection method are &#13;
suitable for the proposed system than LR classifiers. Among different attribute &#13;
numbers, the classifiers based on 38 attributes and 40 attributes are the best classifiers &#13;
and classifier based on 42 attributes are the second best classifier.
</summary>
<dc:date>2023-05-01T00:00:00Z</dc:date>
</entry>
<entry>
<title>CLASSIFICATION OF BANK MARKETING DATA  USING SUPPORT VECTOR MACHINE</title>
<link href="https://onlineresource.ucsy.edu.mm/handle/123456789/2794" rel="alternate"/>
<author>
<name>Khin, Ei Ei</name>
</author>
<id>https://onlineresource.ucsy.edu.mm/handle/123456789/2794</id>
<updated>2023-06-04T14:35:22Z</updated>
<published>2023-05-01T00:00:00Z</published>
<summary type="text">CLASSIFICATION OF BANK MARKETING DATA  USING SUPPORT VECTOR MACHINE
Khin, Ei Ei
Nowadays, banking system plays an important role of financial &#13;
sectors all over the world. The more accurate predictive modeling system &#13;
is required for their services or products in the banking industry. Bank &#13;
workers can make those predictive models with manually, but this process &#13;
takes long time and lots of man-hours. For these reasons, machine learning &#13;
techniques are useful to predict the outcomes with huge amounts of data. &#13;
Classification is an important technique to analyze and to predict the data. &#13;
This system will implement the classification of bank marketing data using &#13;
support vector machine (SVM) to predict the probability of the customers’ &#13;
subscription to the term deposit whether subscribe or not. Support Vector &#13;
Machine (SVM) is a supervised learning model used for classification and &#13;
prediction of data. The purpose of this system is to predict the customers' &#13;
response to the term 'deposit' using bank marketing data. The precision, &#13;
recall, and F-Measure confusion matrix is used to gauge the system's &#13;
correctness. In the first experiment when the training data is used, the &#13;
accuracy without feature engineering is 86%, the accuracy with feature &#13;
engineering is 83% and the accuracy with feature engineering of &#13;
Correlation Matrix and Principal Component Analysis gets 96%. In the &#13;
second experiment which is used the testing data, the accuracy without &#13;
feature engineering gets 85%, the accuracy with feature engineering before &#13;
using PCA is 83% and the accuracy after using PCA is 95%. The system &#13;
shows the best results in both training data and testing data after using the &#13;
Principal Component Analysis.
</summary>
<dc:date>2023-05-01T00:00:00Z</dc:date>
</entry>
<entry>
<title>MYANMAR ENTITY IDENTIFICATION FOR NATURAL LANGUAGE UNDERSTANDING USING BIDIRECTIONAL LONG SHORT TERM MEMORY (BiLSTM)</title>
<link href="https://onlineresource.ucsy.edu.mm/handle/123456789/2790" rel="alternate"/>
<author>
<name>PHWAY, SAUNG THAZIN</name>
</author>
<id>https://onlineresource.ucsy.edu.mm/handle/123456789/2790</id>
<updated>2023-02-17T06:20:22Z</updated>
<published>2023-01-01T00:00:00Z</published>
<summary type="text">MYANMAR ENTITY IDENTIFICATION FOR NATURAL LANGUAGE UNDERSTANDING USING BIDIRECTIONAL LONG SHORT TERM MEMORY (BiLSTM)
PHWAY, SAUNG THAZIN
Entity identification is an exacting function which has commonly&#13;
appropriate broad chunk of awareness in the course of feature engineering&#13;
and word list to attain great achievement.. Entity Identification (EI) is&#13;
indispensable of perceptive article character from basic input and resolve the&#13;
division the morphemes characterizes. This paper presents every Entities&#13;
Recognition (ER) for Myanmar language using Bidirectional Long Short&#13;
Term Memory (BiLSTM), eliminating the need for most feature construction.&#13;
Entity contains people, location, grouping, date_time_month, numerical&#13;
values, etc. Myanmar expression is still ambitious to analyze Name Entity&#13;
(NE) as well as familiar conversation so it bags of geographical instruction&#13;
towards noticeable items, never barrier explanation among words and none&#13;
capitalization comparable other languages. Myanmar Natural Language&#13;
Processing (NLP) is told to be closed growing along with has directly been&#13;
excruciating to be matured. Considering that logic, Entity Identification (EI)&#13;
entitled collection for Burma ER analysis is annually explained and built as&#13;
composing that monograph. The elucidate EI bulk is crucial for Myanmar&#13;
ER research’s improvement . For planned entity classification research, those&#13;
entity titled compilation is tested all the while entire the aimed evidence for&#13;
Burma ER and it will also be determined. By using BiLSTM based network&#13;
architecture, the best accuracy is achieved with 83.62%. Accordingly, here&#13;
task dispose of the aspect engineering development and does not demand to&#13;
acquire not only expression but also territory ability.
</summary>
<dc:date>2023-01-01T00:00:00Z</dc:date>
</entry>
</feed>
