基於類別語言模板之文章向量於文本分類研究

簡易檢索 / 詳目顯示

回結果列表

研究生：	林政文 Lin, Cheng-Wen
論文名稱：	基於類別語言模板之文章向量於文本分類研究 Domain Knowledge Linguistic Pattern-based Document Representation for text Classification
指導教授：	許聞廉 Hsu, Wen-Lian
口試委員:	張詠淳 Chang, Yung-Chun 戴敏育 Day, Min-Yuh
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 資訊系統與應用研究所 Institute of Information Systems and Applications
論文出版年：	2019
畢業學年度：	107
語文別：	中文
論文頁數：	53
中文關鍵詞：	語言模板、文章向量表示、文本分類、類神經網絡、文本推論
外文關鍵詞：	Linguistic Pattern, Document Vector Representation, Text Classification, Deep Neural Network, Interpretable Inference
相關次數：	點閱：2 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

當今深度學習為顯學的年代，大多數自然語言處理任務都因深度學習有更
好的表現。然而文本推論、理解的部分屬於複雜的任務，若想從一個類神經網
絡的文本分類器中，去得到分類的歸因，往往不符合人類思考方式，且解釋性
不佳。因此本論文使用貼近人類思考方式的語言模板(Linguistic Pattern)為基礎，
在文本分類問題任務中，我們將以語言模板作為文章推論原因，並結合上當今
深度學習的方法，使文本分類系統具備高準確率及符合人思維的推論性。本研
究分三階段：類別語言模板生成、基於類別語言模板之文本表示法、基於類神
經網絡之文本分類模型。本研究方法的實驗結果於新聞讀者情緒語料上比對照
組多7%準確率；而於新聞主題語料上F1-score 和對照組比，有20%驚人成長。

Nowadays, the majority of Natural Language Processing (NLP) tasks have witnessed performance improvements due to the advancement of Deep Learning techniques. However, logical inference and language understanding remain difficult tasks in NLP. Unlike the human thinking process, the outcome produced by neural network-based text classifiers are usually difficult to interpret directly, and sometimes even unreasonable. Therefore, we deliver a method based on Linguistic Patterns that are closer to the human thinking process. Moreover, these patterns are easily readable. In this thesis, we will combine the linguistic-based as well as deep learning-based methods and try to achieve both high performances and interpretable inference results. There are three major steps in our method, namely, Linguistic Pattern-based Generation for Domain Knowledge, Domain Knowledge Linguistic Pattern Document Representation, and Text Classification Model based on Deep Neural Networks. Results show that our approach improves upon current state-of-the-art methods on emotion classification and news topic classification. Specifically, we observe a 7% absolute increase on the accuracy of emotion classification, and a 20% absolute improvement on F1-score of the topic classifier.

摘要    i
誌謝辭    iii
目錄    v
圖目錄    viii
表目錄    ix
第一章 簡介(Introduction)    1
1 研究動機與目的(Research Motivation and Proposes)    1
2 論文架構(Thesis Structure)    2
第二章 相關文獻探討(Related Work)    3
1 關鍵詞提取(Keyword Extraction)    3
2 本體論(Ontology)    4
3 文本於分佈式表示 (Distributed Representation on Text)    9
3.1 詞向量嵌入(Word Embedding)    10
3.2 句子與文章嵌入(Sentence/Document Embedding)    12
4 文本分類(Text Classification)    14
4.1 文本卷積神經網絡(Text Convolutional Neural Networks)    15
4.2 文本遞迴神經網絡(Text Recurrent Neural Networks)    16
第三章 基於語言模板之文章向量表示法於文本分類 (Linguistic Pattern-based Document Representation on Text Classification, KPDVR)    17
1 類別語言模板生成 (Linguistic Pattern-based Generation for Domain Knowledge)    18
1.1 關鍵樣元標記(Critical Element Labeling, CEL)    18
1.2 類別標籤關係圖(Domain Knowledge Tag Diagram, KTD)    21
1.3 類別語言模板篩選(Domain Knowledge Linguistic Pattern Selection, KPS)    23
2 基於類別語言模板之文章表示法(Domain Knowledge Linguistic Pattern-based Document Representation)    24
2.1 類別語言模板匹配(Domain Knowledge Linguistic Pattern-based Matching, KPM)    25
2.2 結合類別語言模板之文章向量表示(Combined Domain Knowledge Linguistic Pattern Document Vector Representation, KPDVR)    27
3 基於類神經網絡之文本分類模型(Text Classification Model based on Deep Neural Network)    29
第四章 效能評估(Evaluation)    31
1 實驗語料與評估指標(Experimental Corpus and Evaluation Indicators)    31
1.1 雅虎新聞讀者情緒資料集(Yahoo News Reader Emotion Corpus)    31
1.2 雅虎新聞主題資料集(Yahoo News Topic Corpus)    32
1.3 實驗評估方式(Evaluation Indicators)    33
2 類別語言模板相關實驗( KP Related Experiment)    35
2.1 類別語言模板實例(KP Instance)    36
2.2 類別語言模板篩選與匹配(KP Filtering and Matching)    38
2.3 基於類別語言模板文章向量表示之比較(Comparison of KP-based Document Vector Representation)    39
2.4 基於類別語言模板之文章表示於模型萃取之比較(Comparison of KPDVR in Model Extraction)    41
2.5 文章嵌入模型維度討論(Discussion on Dimension of Doc2Vec)    42
2.6 模板注意力之可解釋性 (Interpretability of Domain Knowledge Linguistic-based Attention)    44
3 新聞讀者情緒之文本分類實驗結果(Text Classification Results on News Reader Emotion Corpus)    46
4 新聞主題之實驗結果(Text Classification Results on News Topic Corpus)    48
第五章 結論與未來展望(Conclusion and Future work)    50
參考文獻(Reference)    51
                                

1. S.K. Bharti and K.S. Babu and S.K. Jena, “Automatic keyword extraction for text summarization: A survey”, arXiv CL., Apr. 2017.
2. G. Salton and M.J. McGill, Introduction to modern information retrieval, New York McGraw-Hill, 1983.
3. D.M. Blei, A.Y. Ng, and M.I. Jordan, “Latent dirichlet allocation”, Journal of Machine Learning Research, Vol. 3, pp. 993-1022, Jan. 2003.
4. Y. Matsuo and M. Ishizuka, “Keyword Extraction from a Single Document using Word Co-occurrence Statistical Information”, semanticscholar IR. FLAIRS, St. Augustine, Florida, USA, Jan. 2004, pp. 157-169.
5. M. Paquot and Y. Bestgen, “Distinctive words in academic writing: A comparison of three statistical tests for keyword extraction”, DIAL IR., 2009.
6. S. and V., Syntax-Based Collocation Extraction, Springer Netherlands, 2011.
7. T. Dunning, “Accurate methods for the statistics of surprise and coincidence”, aclweb CL., Vol. 19, No. 1, pp. 61-74, Jan. 1993.
8. S. Rose and D. Engel and N. Cramer and W. Cowley, Automatic keyword extraction from individual documents, Wiley Online Library, Mar. 2010.
9. T. Gruber, “A translation approach to portable ontology specifications”, Knowledge Acquisition, Vol. 5, No. 2, pp. 199-220, 1993.
10. 鍾明強, 基於Ontology架構之文件分類網路服務研究與建構, 碩士, 資訊工程學系, 國立成功大學, 台南市, 2004.
11. C.O. Alm and D. Roth and R. Sproat, “Emotions from text: machine learning for text-based emotion prediction”, semanticscholar HLT/EMNLP, Vancouver B.C. Canada, Oct. 2005.
12. A. Agarwal and B. Xie and I. Vovsha and O. Rambow and R. Passonneau, “Sentiment analysis of twitter data”, ACM LSM’11 Proceedings of the Workshop on Languages in Social Media, Portland Oregon, June 2011.
13. C.P. Yung, “Automatic Ontology Construction by Using a Chinese Parser and a Lexical Knowledge Base”, ndltd NLP, July 2009.
14. 宋啟聖, 詞網同義詞集的中文語意表達之研究, 碩士, 資訊科學系, 東吳大學, 台北市, 2003.
15. 陳信裕, 利用廣義知網及維基百科於劇本文件之廣告推薦, 碩士, 資訊工程學系, 國立臺灣師範大學, 台北市, 2016.
16. C.R. Li and C.H. Yu and H.H. Chen, “Predicting the Semantic Orientation of Terms in E-HowNet”, Computational Linguistics and Chinese Language Processing, Vol. 17, No. 2, pp. 21-36, June 2012.
17. G.E. Hinton and J.L. McClelland and D.E. Rumelhart, Parallel distributed processing: explorations in the microstructure of cognition, MIT Press Cambridge MA USA, 1986, ch3.
18. T. Mikolov and K. Chen and G. Corrado and J. Dean, “Efficient Estimation of Word Representations in Vector Space”, semanticscholar NLP CoRR, Vol. abs-1301-3781, Sep. 2013.
19. H. Peng and J. Li and Y. Song and Yaopeng Liu, “Incrementally Learning the Hierarchical Softmax Function for Neural Language Models”, AAAI NLP Thirty-First AAAI Conference on Artificial Intelligence, Feb. 2017.
20. Q. Le and T. Mikolov, “Distributed Representations of Sentences and Documents”, arXiv CL., May 2014.
21. C.M. Bishop, Neural Networks for Pattern Recognition, Clarendon Press, 1995.
22. B. Liu, Sentiment analysis and opinion mining, Morgan & Claypool Publishers, May 2012.
23. F. Sebastiani, “Machine learning in automated text categorization”, ACM Computing Surveys, Vol. 34, No. 1, March 2002.
24. X. Wei, W.B. Croft, “LDA-based document models for ad-hoc retrieval”, ciir SIGIR’06, Seattle WA USA, August 2006.
25. Y. Kim, “Convolutional Neural Networks for Sentence Classification”, aclweb Conference on Empirical Methods in Natural Language Processing EMNLP, pp. 1746–1751, Oct. 2014.
26. J.Y. Nie and J. Gao and J. Zhang and M. Zhou, “On the use of words and n-grams for Chinese information retrieval”, IRAL Proceedings of the fifth international workshop on on Information retrieval with Asian languages, pp. 141-148, Oct. 2000.
27. Y. LeCun and Y. Bengio and G. Hinton, “Deep learning”, NATURE, Vol. 521, May 2015.
28. P. Liu and X. Qiu and X. Huang, “Recurrent neural network for text classification with multi-task learning”, IJCAI Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pp. 2873-2879, July 2016.
29. K. Greff and R.K. Srivastava and J. Koutník and B.R. Steunebrink and J. Schmidhuber, “LSTM: A Search Space Odyssey”, arxiv IEEE Transactions on Neural Networks and Learning Systems, Vol. 28, No. 10, Oct. 2017.
30. R. Dey and F.M. Salemt, “Gate-variants of Gated Recurrent Unit (GRU) neural networks”, IEEE 60th International Midwest Symposium on Circuits and Systems MWSCAS, Boston MA USA, Oct. 2017.
31. B. Karlik and A.V. Olgac, “Performance Analysis of Various Activation Functions in Generalized MLP Architectures of Neural Networks”, International Journal of Artificial Intelligence And Expert Systems IJAE, Vol. 1, No. 4, Aug. 2011.
32. A.L. Maas and A.Y. Hannun and A.Y. Ng, “Rectifier Nonlinearities Improve Neural Network Acoustic Models”, 30th International Conference on Machine Learning, JMLR:W&CP Vol 28, Atlanta Georgia USA, 2013.
33. Y.C. Chang and C.C. Chen and W.L. Hsu, “SEMANTIC FRAME-BASED APPROACH FOR READER- EMOTION DETECTION”, PACIS, 2015.
34. Y.C. Chang and C.C. Chen and W.L. Hsu, “A semantic frame-based intelligent agent for topic detection”, tmu Soft Computing, Vol. 21, No. 2, pp. 391-401, Feb. 2017.
35. K.H. Lin and C. Yang and H. Chen, “What emotions do news articles trigger in their readers?”, ACM SIGIR, pp. 733–734, New York, 2007.
36. J.P. Chen, “Linguistic Pattern-based Distributed Representation Method for Topic Detection on Social Media”, Dec. 2018.

簡易檢索 / 詳目顯示

相關論文