Search Results for author: Jussi Karlgren

Found 20 papers, 1 papers with code

Challenging the Assumption of Structure-based embeddings in Few- and Zero-shot Knowledge Graph Completion

1 code implementation • LREC 2022 • Filip Cornell, Chenda Zhang, Jussi Karlgren, Sarunas Girdzijauskas

In this paper, we report experiments on Few- and Zero-shot Knowledge Graph completion, where the objective is to add missing relational links between entities into an existing Knowledge Graph with few or no previous examples of the relation in question.

Descriptive Knowledge Graph Completion +1

Paper
Code

Are We Wasting Time? A Fast, Accurate Performance Evaluation Framework for Knowledge Graph Link Predictors

no code implementations • 25 Jan 2024 • Filip Cornell, Yifei Jin, Jussi Karlgren, Sarunas Girdzijauskas

First, we empirically find and theoretically motivate why sampling uniformly at random vastly overestimates the ranking performance of a method.

Paper
Add Code

Cem Mil Podcasts: A Spoken Portuguese Document Corpus For Multi-modal, Multi-lingual and Multi-Dialect Information Access Research

no code implementations • 23 Sep 2022 • Ekaterina Garmash, Edgar Tanaka, Ann Clifton, Joana Correia, Sharmistha Jat, Winstead Zhu, Rosie Jones, Jussi Karlgren

In this paper we describe the Portuguese-language podcast dataset we have released for academic research purposes.

Descriptive Genre classification

Paper
Add Code

Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free

no code implementations • 25 Jul 2022 • M. Iftekhar Tanveer, Diego Casabuena, Jussi Karlgren, Rosie Jones

Podcasts are conversational in nature and speaker changes are frequent -- requiring speaker diarization for content understanding.

speaker-diarization Speaker Diarization

Paper
Add Code

The Contribution of Lyrics and Acoustics to Collaborative Understanding of Mood

no code implementations • 31 May 2022 • Shahrzad Naseri, Sravana Reddy, Joana Correia, Jussi Karlgren, Rosie Jones

We find that a pretrained transformer-based language model in a zero-shot setting -- i. e., out of the box with no further training on our data -- is powerful for capturing song-mood associations.

Language Modelling

Paper
Add Code

Conventions and Mutual Expectations -- understanding sources for web genres

no code implementations • 1 May 2022 • Jussi Karlgren

Genres can be understood in many different ways.

Paper
Add Code

Textual Stylistic Variation: Choices, Genres and Individuals

no code implementations • 1 May 2022 • Jussi Karlgren

This chapter argues for more informed target metrics for the statistical processing of stylistic variation in text collections.

Information Retrieval Retrieval

Paper
Add Code

Podcast Metadata and Content: Episode Relevance andAttractiveness in Ad Hoc Search

no code implementations • 25 Aug 2021 • Ben Carterette, Rosie Jones, Gareth F. Jones, Maria Eskevich, Sravana Reddy, Ann Clifton, Yongze Yu, Jussi Karlgren, Ian Soboroff

We describe a set of diverse podcast information needs and different approaches to assessing retrieved content for relevance.

Paper
Add Code

Current Challenges and Future Directions in Podcast Information Access

no code implementations • 17 Jun 2021 • Rosie Jones, Hamed Zamani, Markus Schedl, Ching-Wei Chen, Sravana Reddy, Ann Clifton, Jussi Karlgren, Helia Hashemi, Aasish Pappu, Zahra Nazari, Longqi Yang, Oguz Semerci, Hugues Bouchard, Ben Carterette

Podcasts are spoken documents across a wide-range of genres and styles, with growing listenership across the world, and a rapidly lowering barrier to entry for both listeners and creators.

Paper
Add Code

How Lexical Gold Standards Have Effects On The Usefulness Of Text Analysis Tools For Digital Scholarship

no code implementations • 31 May 2021 • Jussi Karlgren

This paper describes how the current lexical similarity and analogy gold standards are built to conform to certain ideas about what the models they are designed to evaluate are used for.

Paper
Add Code

High-dimensional distributed semantic spaces for utterances

no code implementations • 1 Apr 2021 • Jussi Karlgren, Pentti Kanerva

High-dimensional distributed semantic spaces have proven useful and effective for aggregating and processing visual, auditory, and lexical information for many tasks related to human-generated data.

Vocal Bursts Intensity Prediction

Paper
Add Code

TREC 2020 Podcasts Track Overview

no code implementations • 29 Mar 2021 • Rosie Jones, Ben Carterette, Ann Clifton, Maria Eskevich, Gareth J. F. Jones, Jussi Karlgren, Aasish Pappu, Sravana Reddy, Yongze Yu

The Podcast Track is new at the Text Retrieval Conference (TREC) in 2020.

Information Retrieval Retrieval +1

Paper
Add Code

100,000 Podcasts: A Spoken English Document Corpus

no code implementations • COLING 2020 • Ann Clifton, Sravana Reddy, Yongze Yu, Aasish Pappu, Rezvaneh Rezapour, Hamed Bonab, Maria Eskevich, Gareth Jones, Jussi Karlgren, Ben Carterette, Rosie Jones

Paired with the audio files, they are also a resource for speech processing and the study of paralinguistic, sociolinguistic, and acoustic aspects of the domain.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Text Mining for Processing Interview Data in Computational Social Science

no code implementations • 28 Nov 2020 • Jussi Karlgren, Renee Li, Eva M Meyersson Milgrom

We use commercially available text analysis technology to process interview text data from a computational social science study.

Clustering named-entity-recognition +2

Paper
Add Code

The Spotify Podcast Dataset

no code implementations • 8 Apr 2020 • Ann Clifton, Aasish Pappu, Sravana Reddy, Yongze Yu, Jussi Karlgren, Ben Carterette, Rosie Jones

Podcasts are a relatively new form of audio media.

Paper
Add Code

Team Harry Friberg at SemEval-2019 Task 4: Identifying Hyperpartisan News through Editorially Defined Metatopics

no code implementations • SEMEVAL 2019 • Nazanin Afsarmanesh, Jussi Karlgren, Peter Sumbler, Nina Viereckel

This report describes the starting point for a simple rule based hypothesis testing excercise on identifying hyperpartisan news items carried out by the Harry Friberg team from Gavagai.

Two-sample testing

Paper
Add Code

Inferring the location of authors from words in their texts

no code implementations • WS 2015 • Max Berggren, Jussi Karlgren, Robert Östling, Mikael Parkvall

This paper describes a series of experiments to determine how positionally annotated microblog posts can be used to learn location-indicating words which then can be used to locate blog texts and their authors.

Paper
Add Code

Viewpoint and Topic Modeling of Current Events

no code implementations • 14 Aug 2016 • Kerry Zhang, Jussi Karlgren, Cheng Zhang, Jens Lagergren

There are multiple sides to every story, and while statistical topic models have been highly successful at topically summarizing the stories in corpora of text documents, they do not explicitly address the issue of learning the different sides, the viewpoints, expressed in the documents.

Topic Models

Paper
Add Code

The Gavagai Living Lexicon

no code implementations • LREC 2016 • Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan, Anders Holst

This paper presents the Gavagai Living Lexicon, which is an online distributional semantic model currently available in 20 different languages.

Paper
Add Code

New Measures to Investigate Term Typology by Distributional Data

no code implementations • WS 2013 • Jussi Karlgren

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.