Skip to main content

Text processing (Computer science)

 Subject
Subject Source: Library Of Congress Subject Headings

Found in 2 Collections and/or Records:

Preliminary investigations into the word categorization system of BERT, 2023

 Item — Call number MU Thesis Chi
Identifier: b7931714
Abstract Bidirectional Encoder Representations from Transformers (BERT), introduced by Google, is a powerful natural language processing model as it is able to understand the meaning of words in a sentence in context. WordNet, developed at Princeton University, is a lexical database that shows semantic relationships between words. This thesis looks to investigate BERT’s word categorization system by looking at groups of example sentences given from related WordNet synsets. Because BERT allows a...
Dates: 2023

Text mining with enhanced named entity recognition, 2017

 Item — Call number MU Thesis Mal
Identifier: b7716610
Abstract The goal of this research is to evaluate the usefulness of enhanced named entity recognition for text mining. Text mining is a subpart of data mining. It is the application of data mining techniques to texts in natural language such as English. The data for this project consists news [sic] articles and article titles extracted from the Web. Enhanced name-entity recognition is used to add additional information to the text. Named entities in the text are...
Dates: 2017