Research & Publications

Peer-reviewed research contributions in AI, NLP, and machine learning, published in top-tier conferences and journals.

Published Papers

N/A

Citations

Venues

Collaborators

Featured Publications

Published

2023

Why Aren't We NER Yet? Artifacts of ASR Errors in Named Entity Recognition in Spontaneous Speech Transcripts

Piotr Szymanski and Lukasz Augustyniak and Mikolaj Morzy and Adrian Szymczak and Krzysztof Surdyk and Piotr Zelasko

ACL

🎙️💥 Ever wondered why AI still messes up names in conversations? This paper reveals the spectacular failure of name detection in spontaneous speech, showing it's not just speech recognition errors - spoken language is inherently messy and breaks traditional AI models! Bonus: they prove everyone's been measuring success wrong this whole time. 🤖📊

View full abstract

Transcripts of spontaneous human speech present a significant obstacle for traditional NER models. The lack of grammatical structure of spoken utterances and word errors introduced by the ASR make downstream NLP tasks challenging. In this paper, we examine in detail the complex relationship between ASR and NER errors which limit the ability of NER models to recover entity mentions from spontaneous speech transcripts. Using publicly available benchmark datasets (SWNE, Earnings-21, OntoNotes), we present the full taxonomy of ASR-NER errors and measure their true impact on entity recognition. We find that NER models fail spectacularly even if no word errors are introduced by the ASR. We also show why the F1 score is inadequate to evaluate NER models on conversational transcripts.

NERSpeech Processing

Research & Publications

Featured Publications

Why Aren't We NER Yet? Artifacts of ASR Errors in Named Entity Recognition in Spontaneous Speech Transcripts

Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark

This is the way: designing and compiling LEPISZCZE, a comprehensive {NLP

WER we are and WER we think we are

All Publications

Why Aren't We NER Yet? Artifacts of ASR Errors in Named Entity Recognition in Spontaneous Speech Transcripts

Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark

Electoral Agitation Data Set: The Use Case of the Polish Election

Electoral Agitation Dataset: The Use Case of the Polish Election

This is the way: designing and compiling LEPISZCZE, a comprehensive {NLP

Assessment of Massively Multilingual Sentiment Classifiers

Assessment of Massively Multilingual Sentiment Classifiers

Fact-checking: relevance assessment of references in the Polish political domain

Comprehensive analysis of aspect term extraction methods using various text embeddings

WER we are and WER we think we are

Punctuation Prediction in Spontaneous Conversations: Can We Mitigate {ASR

Political Advertising Dataset: the use case of the Polish 2020 Presidential Elections

Aspect Detection using Word and Char Embeddings with (Bi) {LSTM

Avaya Conversational Intelligence: {A

Towards Better Understanding of Spontaneous Conversations: Overcoming Automatic Speech Recognition Errors With Intent Recognition

Aspect Detection using Word and Char Embeddings with (Bi)LSTM and {CRF

Extracting Aspects Hierarchies using Rhetorical Structure Theory

WordNet2Vec: Corpora agnostic word vectorization method

Method for Aspect-Based Sentiment Annotation Using Rhetorical Analysis

Fast and Accurate - Improving Lexicon-Based Sentiment Classification with an Ensemble Methods

Comprehensive Study on Lexicon-based Ensemble Classification Sentiment Analysis

Sentiment Analysis for Polish Using Transfer Learning Approach

Belief Propagation Method for Word Sentiment in WordNet 3.0

Simpler is better? Lexicon-based ensemble sentiment classification beats supervised methods

An Approach to Sentiment Analysis of Movie Reviews: Lexicon Based vs. Classification

Research Interests