Search

Arianna Muti, Sara Gemelli, Emanuele Moscato, Emilie Francis, Amanda Cercas Curry, Flor Miriam Plaza-Del-Arco, Debora Nozza

Blue-haired, misandriche, rabiata: Tracing the Connotation of ‘Feminist(s)’ Across Time, Languages and Domains

Understanding how words shift in meaning is crucial for analyzing societal attitudes.In this study, we investigate the contextual …

Biased Tales: Cultural and Topic Bias in Generating Children’s Stories?

Stories play a pivotal role in human communication, shaping beliefs and morals, particularly in children. As parents increasingly rely …

Donya Rooein, Vilém Zouhar, Debora Nozza, Dirk Hovy

Hamidreza Saffari, Mohammadamin Shafiei, Donya Rooein, Debora Nozza

Measuring Gender Bias in Language Models in Farsi?

As Natural Language Processing models become increasingly embedded in everyday life, ensuring that these systems can measure and …

Hamidreza Saffari, Mohammadamin Shafiei, Donya Rooein, Francesco Pierri, Debora Nozza

Can I Introduce My Boyfriend to My Grandmother? Evaluating Large Language Models Capabilities on Iranian Social Norm Classification

Creating globally inclusive AI systems demands datasets reflecting diverse social norms. Iran, with its unique cultural blend, offers …

Emanuele Moscato, Arianna Muti, Debora Nozza

MilaNLP@Multilingual Counterspeech Generation: Evaluating Translation and Background Knowledge Filtering

We describe our participation in the Multilingual Counterspeech Generation shared task, which aims to generate a counternarrative to …

Fabio Pernisi, Giuseppe Attanasio, Debora Nozza

MONICA: Monitoring Coverage and Attitudes of Italian Measures in Response to COVID-19

Modern social media have long been observed as a mirror for public discourse and opinions. Especially in the face of exceptional …

Pieter Delobelle, Giuseppe Attanasio, Debora Nozza, Su Lin Blodgett, Zeerak Talat

Metrics for What, Metrics for Whom: Assessing Actionability of Bias Evaluation Metrics in NLP

This paper introduces the concept of actionability in the context of bias measures in natural language processing (NLP). We define …

Flor Miriam Plaza-Del-Arco, Debora Nozza, Marco Guerini, Jeffrey Sorensen, Marcos Zampieri

Countering Hateful and Offensive Speech Online - Open Challenges

In today’s digital age, hate speech and offensive speech online pose a significant challenge to maintaining respectful and inclusive …

Proceedings of the 8th Workshop on Online Abuse and Harms (WOAH 2024)

Digital technologies have brought many benefits for society, transforming how people connect, communicate and interact with each other. …

Yi-Ling Chung, Zeerak Talat, Debora Nozza, Flor Miriam Plaza-del-Arco, Paul Röttger, Aida Mostafazadeh Davani, Agostina Calabrese

Proceedings of the 5th Workshop on Gender Bias in Natural Language Processing (GeBNLP)

This volume contains the proceedings of the Fifth Workshop on Gender Bias in Natural Language Processing held in conjunction with the …

Agnieszka Falenska, Christine Basta, Marta Costa-jussà, Seraphina Goldfarb-Tarrant, Debora Nozza

Overview of the Shared Task on Machine Translation Gender Bias Evaluation with Multilingual Holistic Bias

We describe the details of the Shared Task of the 5th ACL Workshop on Gender Bias in Natural Language Processing (GeBNLP 2024). The …

Marta Costa-jussà, Pierre Andrews, Christine Basta, Juan Ciro, Agnieszka Falenska, Seraphina Goldfarb-Tarrant, Rafael Mosquera, Debora Nozza, Eduardo Sánchez

Mattia Setzu, Marta Marchiori Manerba, Pasquale Minervini, Debora Nozza

FairBelief - Assessing Harmful Beliefs in Language Models

Language Models (LMs) have been shown to inherit undesired biases that might hurt minorities and underrepresented groups if such …

Giuseppe Attanasio, Flor Miriam Plaza Del Arco, Debora Nozza, Anne Lauscher

A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation

Recent instruction fine-tuned models can solve multiple NLP tasks when prompted to do so, with machine translation (MT) being a …

Flor Miriam Plaza-del-Arco, Debora Nozza, Dirk Hovy

Wisdom of Instruction-Tuned Language Model Crowds. Exploring Model Label Variation

Large Language Models (LLMs) exhibit remarkable text classification capabilities, excelling in zero- and few-shot learning (ZSL and …

What about ''em''? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns

As 3rd-person pronoun usage shifts to include novel forms, e.g., neopronouns, we need more research on identity-inclusive NLP. …

Anne Lauscher, Debora Nozza, Ehm Miltersen, Archie Crowley, Dirk Hovy

The State of Profanity Obfuscation in Natural Language Processing Scientific Publications

Work on hate speech has made considering rude and harmful examples in scientific publications inevitable. This situation raises various …

Debora Nozza, Dirk Hovy

Flor Miriam Plaza-del-Arco, Debora Nozza, Dirk Hovy

Respectful or Toxic? Using Zero-Shot Learning with Language Models to Detect Hate Speech

Hate speech detection faces two significant challenges: 1) the limited availability of labeled data and 2) the high variability of hate …

Amanda Cercas Curry, Giuseppe Attanasio, Debora Nozza, Dirk Hovy

MilaNLP at SemEval-2023 Task 10: Ensembling Domain-Adapted and Regularized Pretrained Language Models for Robust Sexism Detection

We present the system proposed by the MilaNLP team for the Explainable Detection of Online Sexism (EDOS) shared task. We propose an …

PDF Code Project

A Multi-dimensional study on Bias in Vision-Language models

In recent years, joint Vision-Language (VL) models have increased in popularity and capability. Very few studies have attempted to …

Gabriele Ruggeri, Debora Nozza

Davide Locatelli, Greta Damo, Debora Nozza

A Cross-Lingual Study of Homotransphobia on Twitter

We present a cross-lingual study of homotransphobia on Twitter, examining the prevalence and forms of homotransphobic content in tweets …

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

Machine learning models are now able to convert user-written text descriptions into naturalistic images. These models are available to …

Federico Bianchi, Pratyusha Kalluri, Esin Durmus, Faisal Ladhak, Myra Cheng, Debora Nozza, Tatsunori Hashimoto, Dan Jurafsky, James Zou, Aylin Caliskan

Giuseppe Attanasio, Eliana Pastor, Chiara Di Bonaventura, Debora Nozza

ferret: a Framework for Benchmarking Explainers on Transformers

As Transformers are increasingly relied upon to solve complex NLP problems, there is an increased need for their decisions to be …

Samia Touileb, Debora Nozza

Measuring Harmful Representations in Scandinavian Language Models

Scandinavian countries are perceived as role-models when it comes to gender equality. With the advent of pre-trained language models …

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

Machine learning models are now able to convert user-written text descriptions into naturalistic images. These models are available to …

Federico Bianchi, Pratyusha Kalluri, Esin Durmus, Faisal Ladhak, Myra Cheng, Debora Nozza, Tatsunori Hashimoto, Dan Jurafsky, James Zou, Aylin Caliskan

Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages

Hate speech is a global phenomenon, but most hate speech datasets so far focus on English-language content. This hinders the …

Paul Röttger, Debora Nozza, Federico Bianchi, Dirk Hovy

The State of Profanity Obfuscation in Natural Language Processing Scientific Publications

Work on hate speech has made the consideration of rude and harmful examples in scientific publications inevitable. This raises various …

Debora Nozza, Dirk Hovy

Giuseppe Attanasio, Debora Nozza, Federico Bianchi, Dirk Hovy

Is It Worth the (Environmental) Cost? Limited Evidence for the Benefits of Diachronic Continuous Training

Language is constantly changing and evolving, leaving language models to quickly become outdated, both factually and linguistically. …

Giuseppe Attanasio, Eliana Pastor, Chiara Di Bonaventura, Debora Nozza

ferret: a Framework for Benchmarking Explainers on Transformers

Many interpretability tools allow practitioners and researchers to explain Natural Language Processing systems. However, each tool …

Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models

Hate speech detection models are typically evaluated on held-out test sets. However, this risks painting an incomplete and potentially …

Paul Röttger, Haitham Seelawi, Debora Nozza, Zeerak Talat, Bertie Vidgen

Debora Nozza, Federico Bianchi, Giuseppe Attanasio

HATE-ITA: Hate Speech Detection in Italian Social Media Text

Online hate speech is a dangerous phenomenon that can (and should) be promptly counteracted properly. While Natural Language Processing …

PDF Code Project Poster Slides

MilaNLP at SemEval-2022 Task 5: Using Perceiver IO for Detecting Misogynous Memes with Text and Image Modalities

In this paper, we describe the system proposed by the MilaNLP team for the Multimedia Automatic Misogyny Identification (MAMI) …

Giuseppe Attanasio, Debora Nozza, Federico Bianchi

Federico Bianchi, Debora Nozza, Dirk Hovy

XLM-EMO: Multilingual Emotion Prediction in Social Media Text

Detecting emotion in text allows social and computational scientists to study how people behave and react to online events. However, …

Debora Nozza, Pikakshi Manchanda, Elisabetta Fersini, Matteo Palmonari, Enza Messina

Exposing the limits of Zero-shot Cross-lingual Hate Speech Detection

Reducing and counter-acting hate speech on Social Media is a significant concern. Most of the proposed automatic methods are conducted …

Debora Nozza

Project Poster Slides

LearningToAdapt with word embeddings: Domain adaptation of Named Entity Recognition systems

The task of Named Entity Recognition (NER) is aimed at identifying named entities in a given text and classifying them into pre-defined …

Debora Nozza, Federico Bianchi, Dirk Hovy

HONEST: Measuring Hurtful Sentence Completion in Language Models

Language models have revolutionized the field of NLP. However, language models capture and proliferate hurtful stereotypes, especially …

Tommaso Fornaciari, Federico Bianchi, Debora Nozza, Dirk Hovy

MilaNLP @ WASSA: Does BERT Feel Sad When You Cry?

The paper describes the MilaNLP team’s submission (Bocconi University, Milan) in the WASSA 2021 Shared Task on Empathy Detection and …

Federico Bianchi, Debora Nozza, Dirk Hovy

FEEL-IT: Emotion and Sentiment Classification for the Italian Language

Sentiment analysis is a common task to understand people’s reactions online. Still, we often need more nuanced information: is …

Elisabetta Fersini, Debora Nozza, Paolo Rosso

Cross-lingual Contextualized Topic Models with Zero-shot Learning

We introduce a novel topic modeling method that can make use of contextulized embeddings (e.g., BERT) to do zero-shot cross-lingual …

Federico Bianchi, Silvia Terragni, Dirk Hovy, Debora Nozza, Elisabetta Fersini

PDF Code Slides Blog Post

AMI @ EVALITA2020: Automatic Misogyny Identification

Automatic Misogyny Identification (AMI) is a shared task proposed at the Evalita 2020 evaluation campaign. The AMI challenge, based on …

PDF Code Dataset Project Video

Which Matters Most? Comparing the Impact of Concept and Document Relationships in Topic Models

Topic models have been widely used to discover hidden topics in a collection of documents. In this paper, we propose to investigate the …

Silvia Terragni, Debora Nozza, Elisabetta Fersini, Enza Messina

Elisabetta Fersini, Debora Nozza, Giulia Boifava

Profiling Italian Misogynist: An Empirical Study

Hate speech may take different forms in online social environments. In this paper, we address the problem of automatic detection of …

Debora Nozza, Federico Bianchi, Dirk Hovy

What the [MASK]? Making Sense of Language-Specific BERT Models

Recently, Natural Language Processing (NLP) has witnessed an impressive progress in many areas, due to the advent of novel, pretrained …

PDF Code Project Source Document

CAGE: Constrained deep Attributed Graph Embedding

In this paper we deal with complex attributed graphs which can exhibit rich connectivity patterns and whose nodes are often associated …

Debora Nozza, Claudia Volpetti, Elisabetta Fersini

Source Document

Unintended Bias in Misogyny Detection

During the last years, the phenomenon of hate against women increased exponentially especially in online environments such as …

PDF Dataset Project

Word Embeddings for Unsupervised Named Entity Linking

The huge amount of textual user-generated content on the Web has incredibly grown in the last decade, creating new relevant …

Debora Nozza, Cezar Sas, Elisabetta Fersini, Enza Messina

Elisabetta Fersini, Debora Nozza, Paolo Rosso

SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter

The paper describes the organization of the SemEval 2019 Task 5 about the detection of hate speech against immigrants and women in …

Valerio Basile, Cristina Bosco, Elisabetta Fersini, Debora Nozza, Viviana Patti, Francisco Rangel, Paolo Rosso, Manuela Sanguinetti

PDF Code Dataset Project Source Document

Overview of the Evalita 2018 Task on Automatic Misogyny Identification (AMI)

Automatic Misogyny Identification (AMI) is a new shared task proposed for the first time at the Evalita 2018 evaluation campaign. The …

PDF Dataset Project

Mapping Natural Language Labels to Structured Web Resources

Mapping natural language terms to a Web knowledge base enriches information systems without additional context, with new relations and …

Valerio Basile, Elena Cabrio, Fabien Gandon, Debora Nozza

Source Document

UNIMIB@ NEEL-IT: Named Entity Recognition and Linking of Italian Tweets

This paper describes the framework proposed by the UNIMIB Team for the task of Named Entity Recognition and Linking of Italian tweets …

Flavio Massimiliano Cecchini, Elisabetta Fersini, Pikakshi Manchanda, Enza Messina, Debora Nozza, Matteo Palmonari, Cezar Sas

Federico Bianchi, Matteo Palmonari, Debora Nozza

Towards encoding time in text-based entity embeddings

Knowledge Graphs (KG) are widely used abstractions to represent entity-centric knowledge. Approaches to embed entities, entity types …

Slides Source Document

Adapting Named Entity Types to New Ontologies in a Microblogging Environment

Given the potential rise in the amount of user-generated content on social network, research efforts towards Information Extraction …

Elisabetta Fersini, Pikakshi Manchanda, Enza Messina, Debora Nozza, Matteo Palmonari

PDF DOI

TWINE: A real-time system for TWeet analysis via INformation Extraction

In the recent years, the amount of user generated contents shared on the Web has significantly increased, especially in social media …

Debora Nozza, Fausto Ristagno, Matteo Palmonari, Elisabetta Fersini, Pikakshi Manchanda, Enza Messina

Pikakshi Manchanda, Elisabetta Fersini, Matteo Palmonari, Debora Nozza, Enza Messina

Towards adaptation of named entity classification

Numerous state-of-the-art Named Entity Recognition (NER) systems use different classification schemas/ontologies. Comparisons and …

PDF DOI

A Multi-View Sentiment Corpus

Sentiment Analysis is a broad task that involves the analysis of various aspect of the natural language text. However, most of the …

PDF Dataset Source Document

Unsupervised Irony Detection: A Probabilistic Model with Word Embeddings

The automatic detection of figurative language, such as irony and sarcasm, is one of the most challenging tasks of Natural Language …

Deep learning and ensemble methods for Domain Adaptation

Real world applications of machine learning in natural language processing can span many different domains and usually require a huge …

Debora Nozza, Daniele Maccagnola, Vincent Guigue, Enza Messina, Patrick Gallinari

A latent representation model for sentiment analysis in heterogeneous social networks

The growing availability of social media platforms, in particular microblogs such as Twitter, opened new way to people for expressing …