polito.it
Politecnico di Torino (logo)

Subject extraction and keyword extraction from text

Zerui Song

Subject extraction and keyword extraction from text.

Rel. Luciano Lavagno, Gianpiero Cabodi. Politecnico di Torino, Corso di laurea magistrale in Mechatronic Engineering (Ingegneria Meccatronica), 2021

[img] PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (9MB)
Abstract:

LDA is an unsupervised learning topic probability generation model. The input is the document collection and the number of topics, and the output is the topic presented in the form of probability distribution. It is often used for topic modeling, text classification, and opinions. Mining and other fields. It assumes a premise: the document is equivalent to a bag-of-words, the words in the bag are independent and interchangeable, without grammatical structure and order. The basic idea is: each document (Document) is composed of multiple topics (Topic), and each topic has multiple corresponding words (Word) to describe.

Relatori: Luciano Lavagno, Gianpiero Cabodi
Anno accademico: 2021/22
Tipo di pubblicazione: Elettronica
Numero di pagine: 63
Soggetti:
Corso di laurea: Corso di laurea magistrale in Mechatronic Engineering (Ingegneria Meccatronica)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-25 - INGEGNERIA DELL'AUTOMAZIONE
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/21280
Modifica (riservato agli operatori) Modifica (riservato agli operatori)