Quotation Extraction consists of identifying quotations from a text and associating them to their authors. In this work, we present a Quotation Extraction system for Portuguese. Quotation Extraction has been previously approached using different techniques and for several languages. Our proposal differs from previous work since we use Machine Learning to automatically build specialized rules instead of human-derived rules. Machine Learning models usually present stronger generalization power compared to human-derived models. In addition, we are able to easily adapt our model to other languages, needing only a list of verbs of speech for a given language. The previously proposed systems would probably need a rule set adaptation to correctly classify the quotations, which would be time consuming. We tackle the Quotation Extraction task using one model for the Entropy Guided Transformation Learning algorithm and another one for the Structured Perceptron algorithm. In order to train and evaluate the system, we have build the GloboQuotes corpus, with news extracted from the globo.com portal.
Die Inhaltsangabe kann sich auf eine andere Ausgabe dieses Titels beziehen.
Graduated in 2008 from the Universidade Federal de Juiz de Fora (UFJF) in Computer Science. Has a Masters in Informatics from the Pontifícia Universidade Católica do Rio de Janeiro (PUC-Rio). Nowadays is PhD student in Informatics at PUC-Rio. His research focuses on Natural Language Processing, Information Extraction and Machine Learning.
„Über diesen Titel“ kann sich auf eine andere Ausgabe dieses Titels beziehen.
EUR 11,52 für den Versand von Vereinigtes Königreich nach USA
Versandziele, Kosten & DauerAnbieter: Revaluation Books, Exeter, Vereinigtes Königreich
Paperback. Zustand: Brand New. 64 pages. 8.66x5.91x0.15 inches. In Stock. Artikel-Nr. 3659288152
Anzahl: 1 verfügbar
Anbieter: moluna, Greven, Deutschland
Zustand: New. Artikel-Nr. 385766374
Anzahl: Mehr als 20 verfügbar
Anbieter: buchversandmimpf2000, Emtmannsberg, BAYE, Deutschland
Taschenbuch. Zustand: Neu. Neuware Books on Demand GmbH, Überseering 33, 22297 Hamburg 64 pp. Englisch. Artikel-Nr. 9783659288159
Anzahl: 2 verfügbar