Journal of Science and Technology
URI vĩnh viễn cho bộ sưu tập này
Duyệt qua
Đang duyệt Journal of Science and Technology theo Chủ đề
Đang hiển thị 1 - 1 trong tổng số 1
Kết quả mỗi trang
Tùy chọn sắp xếp
- Tài liệuK-Medoids algorithm used for english sentiment classification in a distributed system(Trường Đại học Nguyễn Tất Thành, 2018-01-30) Vo, Ngoc Phu; Vo, Thi Ngoc TranIn this research, we have proposed a new model for Big Data sentiment classification in the parallel network environment – a Cloudera system with Hadoop Map (M) and Hadoop Reduce (R). Our new model has used a K-Medoids Algorithm (PAM) with multi-dimensional vector and 2,000,000 English documents of our English training data set for English document-level sentiment classification. Our new model can classify sentiment of millions of English documents based on many English documents in the parallel network environment. However, we tested our new model on our testing data set (including 1,000,000 English reviews, 500,000 positive and 500,000 negative) and achieved 85.98% accuracy.