<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Unstructured Knowledge Integration | Language Technologies Lab</title><link>http://nlp.unibo.it/proposals_uki/</link><atom:link href="http://nlp.unibo.it/proposals_uki/index.xml" rel="self" type="application/rss+xml"/><description>Unstructured Knowledge Integration</description><generator>Hugo Blox Builder (https://hugoblox.com)</generator><language>en-us</language><lastBuildDate>Mon, 02 Mar 2026 00:00:00 +0000</lastBuildDate><image><url>http://nlp.unibo.it/media/icon_hu_7613a4a452ac7087.png</url><title>Unstructured Knowledge Integration</title><link>http://nlp.unibo.it/proposals_uki/</link></image><item><title>Multi-cultural Abusive and Hate Speech Detection</title><link>http://nlp.unibo.it/proposals_uki/hatespeech/</link><pubDate>Mon, 02 Mar 2026 00:00:00 +0000</pubDate><guid>http://nlp.unibo.it/proposals_uki/hatespeech/</guid><description>&lt;p>&lt;strong>Description:&lt;/strong>&lt;br>
What is attributable as abusive or hate speech depends on the given socio-cultural context.
The same text might be reputed offensive by a certain culture, allowed by another, and, in the most extreme case, legally prosecutable by a third one.
Our aim is to evaluate how machine learning model are affected by different definitions of abusive and hate speech to promote awareness in developing accurate abusive speech detection systems.&lt;/p>
&lt;p>&lt;strong>Contact:&lt;/strong> &lt;a href="mailto:federico.ruggeri6@unibo.it">Federico Ruggeri&lt;/a>, &lt;a href="mailto:k.korre@athenarc.gr">Katerina Korre&lt;/a>, &lt;a href="mailto:arianna.muti@unibocconi.it">Arianna Muti&lt;/a>&lt;/p>
&lt;p>&lt;strong>References:&lt;/strong>&lt;/p>
&lt;p>&lt;strong>Untangling Hate Speech Definitions: A Semantic Componential Analysis Across Cultures and Domains.&lt;/strong>&lt;br>
Katerina Korre, Arianna Muti, Federico Ruggeri, and Alberto Barrón-Cedeño. 2025.&lt;br>
In Findings of the Association for Computational Linguistics: NAACL 2025, pages 3184–3198, Albuquerque, New Mexico. Association for Computational Linguistics.&lt;br>
&lt;a href="https://doi.org/10.18653/v1/2025.findings-naacl.175" target="_blank" rel="noopener">DOI&lt;/a>
| &lt;a href="https://aclanthology.org/2025.findings-naacl.175.pdf" target="_blank" rel="noopener">PDF&lt;/a>&lt;/p></description></item><item><title>Text Classification with Guidelines Only</title><link>http://nlp.unibo.it/proposals_uki/clf_guidelines/</link><pubDate>Mon, 02 Mar 2026 00:00:00 +0000</pubDate><guid>http://nlp.unibo.it/proposals_uki/clf_guidelines/</guid><description>&lt;p>&lt;strong>Description:&lt;/strong>&lt;br>
The standard approach for training a machine learning model on a task is to provide an annotated dataset $(\mathcal{X}, \mathcal{Y})$.
The dataset is built by providing unlabeled data $\mathcal{X}$ to a group of annotators previously trained on a set of annotation guidelines $\mathcal{G}$.
Annotators label data $\mathcal{X}$ via a given class set $\mathcal{C}$.
The main issue of this approach is that annotators define the mapping from data $\mathcal{X}$ to the class set $\mathcal{C}$ via the guidelines $\mathcal{G}$, while machine learning models are trained to learn the same mapping without guidelines $\mathcal{G}$.
Consequently, these models can learn any kind of mapping from $\mathcal{X}$ to $\mathcal{C}$ that better fits given data.
Our idea is to directly provide guidelines $\mathcal{G}$ to models without any access to class labels during training.&lt;/p>
&lt;p>&lt;strong>Contact:&lt;/strong> &lt;a href="mailto:federico.ruggeri6@unibo.it">Federico Ruggeri&lt;/a>&lt;/p>
&lt;p>&lt;strong>References:&lt;/strong>&lt;/p>
&lt;p>&lt;strong>Let Guidelines Guide You: A Prescriptive Guideline-Centered Data Annotation Methodology&lt;/strong>&lt;br>
Federico Ruggeri, Eleonora Misino, Arianna Muti, Katerina Korre, Paolo Torroni, Alberto Barrón-Cedeño&lt;br>
September 2024&lt;br>
&lt;a href="https://arxiv.org/abs/2406.14099" target="_blank" rel="noopener">PDF&lt;/a>&lt;/p></description></item></channel></rss>