nlp

UNIMIB@ NEEL-IT: Named Entity Recognition and Linking of Italian Tweets

This paper describes the framework proposed by the UNIMIB Team for the task of Named Entity Recognition and Linking of Italian tweets (NEEL-IT). The proposed pipeline, which represents an entry level system, is composed of three main steps: (1) Named …

Adapting Named Entity Types to New Ontologies in a Microblogging Environment

Given the potential rise in the amount of user-generated content on social network, research efforts towards Information Extraction have significantly increased, giving leeway to the emergence of numerous *Named Entity Recognition* (NER) systems. …

A Multi-View Sentiment Corpus

Sentiment Analysis is a broad task that involves the analysis of various aspect of the natural language text. However, most of the approaches in the state of the art usually investigate independently each aspect, i.e. Subjectivity Classification, …

Towards adaptation of named entity classification

Numerous state-of-the-art **Named Entity Recognition** (NER) systems use different classification schemas/ontologies. Comparisons and integration among NER systems, thus, becomes complex. In this paper, we propose a transfer-learning approach where …

TWINE: A real-time system for TWeet analysis via INformation Extraction

In the recent years, the amount of user generated contents shared on the Web has significantly increased, especially in social media environment, e.g. Twitter, Facebook, Google+. This large quantity of data has generated the need of reactive and …

Deep learning and ensemble methods for Domain Adaptation

Real world applications of machine learning in natural language processing can span many different domains and usually require a huge effort for the annotation of domain specific training data. For this reason, domain adaptation techniques have …

Unsupervised Irony Detection: A Probabilistic Model with Word Embeddings

The automatic detection of figurative language, such as irony and sarcasm, is one of the most challenging tasks of Natural Language Processing (NLP). This is because machine learning methods can be easily misled by the presence of words that have a …