Supervised by: Ministry of Culture of PRC

Sponsored by:National Library of China
  Library Society of China

ISSN 1001-8867    CN 11-2746/G2

Teaching Natural Language Processing through Big Data Text Summarization with Problem-Based Learning

Abstract: Natural language processing (NLP) coversa large number of topics and tasks related to data andinformation management, leading to a complex andchallenging teaching process. Meanwhile, problem-basedlearning is a teaching technique specifically designed tomotivate students to learn efficiently, work collaboratively,and communicate effectively. With this aim, we developeda problem-based learning course for both undergraduateand graduate students to teach NLP. We providedstudent teams with big data sets, basic guidelines, cloudcomputing resources, and other aids to help differentteams in summarizing two types of big collections:Web pages related to events, and electronic theses anddissertations (ETDs). Student teams then deployeddifferent libraries, tools, methods, and algorithms to solvethe task of big data text summarization. Summarization isan ideal problem to address learning NLP since it involvesall levels of linguistics, as well as many of the tools andtechniques used by NLP practitioners. The evaluationresults showed that all teams generated coherent andreadable summaries. Many summaries were of high qualityand accurately described their corresponding eventsor ETD chapters, and the teams produced them alongwith NLP pipelines in a single semester. Further, bothundergraduate and graduate students gave statisticallysignificant positive feedback, relative to other coursesin the Department of Computer Science. Accordingly,we encourage educators in the data and informationmanagement field to use our approach or similar methodsin their teaching and hope that other researchers will alsouse our data sets and synergistic solutions to approachthe new and challenging tasks we addressed.

Keywords: information system education, computerscience education, problem-based learning, naturallanguage processing, NLP, big data text analytics,machine learning, deep learning.


垦利县| 都匀市| 寻甸| 黑龙江省| 济阳县| 古蔺县| 汝阳县| 滕州市| 微博| 武川县| 安阳县| 榆树市| 闵行区| 县级市| 军事| 新宾| 哈密市| 高安市| 利津县| 离岛区| 维西| 宜城市| 天台县| 宁河县| 仲巴县| 临颍县| 濉溪县| 常宁市| 红安县| 安庆市| 平塘县| 汉沽区| 平舆县| 宜兰市| 澄迈县| 永新县| 庆阳市| 广汉市| 武宣县| 高平市| 涡阳县|