Text processing (Computer science)

Subject

Subject Source: Library of Congress Subject Headings

Found in 2 Collections and/or Records:

Preliminary investigations into the word categorization system of BERT, 2023

Item — Call number MU Thesis Chi

Identifier: b7931714

Abstract Bidirectional Encoder Representations from Transformers (BERT), introduced by Google, is a powerful natural language processing model as it is able to understand the meaning of words in a sentence in context. WordNet, developed at Princeton University, is a lexical database that shows semantic relationships between words. This thesis looks to investigate BERT’s word categorization system by looking at groups of example sentences given from related WordNet synsets. Because BERT allows a...

Dates: 2023

Found in: Monmouth University Library Archives / Monmouth University thesis collection : Master of Science in Computer Science program

Text mining with enhanced named entity recognition, 2017

Item — Call number MU Thesis Mal

Identifier: b7716610

Abstract The goal of this research is to evaluate the usefulness of enhanced named entity recognition for text mining. Text mining is a subpart of data mining. It is the application of data mining techniques to texts in natural language such as English. The data for this project consists news [sic] articles and article titles extracted from the Web. Enhanced name-entity recognition is used to add additional information to the text. Named entities in the text are...

Dates: 2017