1 code implementation • LREC 2022 • Filip Cornell, Chenda Zhang, Jussi Karlgren, Sarunas Girdzijauskas
In this paper, we report experiments on Few- and Zero-shot Knowledge Graph completion, where the objective is to add missing relational links between entities into an existing Knowledge Graph with few or no previous examples of the relation in question.
no code implementations • 25 Jan 2024 • Filip Cornell, Yifei Jin, Jussi Karlgren, Sarunas Girdzijauskas
First, we empirically find and theoretically motivate why sampling uniformly at random vastly overestimates the ranking performance of a method.
no code implementations • 23 Sep 2022 • Ekaterina Garmash, Edgar Tanaka, Ann Clifton, Joana Correia, Sharmistha Jat, Winstead Zhu, Rosie Jones, Jussi Karlgren
In this paper we describe the Portuguese-language podcast dataset we have released for academic research purposes.
no code implementations • 25 Jul 2022 • M. Iftekhar Tanveer, Diego Casabuena, Jussi Karlgren, Rosie Jones
Podcasts are conversational in nature and speaker changes are frequent -- requiring speaker diarization for content understanding.
no code implementations • 31 May 2022 • Shahrzad Naseri, Sravana Reddy, Joana Correia, Jussi Karlgren, Rosie Jones
We find that a pretrained transformer-based language model in a zero-shot setting -- i. e., out of the box with no further training on our data -- is powerful for capturing song-mood associations.
no code implementations • 1 May 2022 • Jussi Karlgren
Genres can be understood in many different ways.
no code implementations • 1 May 2022 • Jussi Karlgren
This chapter argues for more informed target metrics for the statistical processing of stylistic variation in text collections.
no code implementations • 25 Aug 2021 • Ben Carterette, Rosie Jones, Gareth F. Jones, Maria Eskevich, Sravana Reddy, Ann Clifton, Yongze Yu, Jussi Karlgren, Ian Soboroff
We describe a set of diverse podcast information needs and different approaches to assessing retrieved content for relevance.
no code implementations • 17 Jun 2021 • Rosie Jones, Hamed Zamani, Markus Schedl, Ching-Wei Chen, Sravana Reddy, Ann Clifton, Jussi Karlgren, Helia Hashemi, Aasish Pappu, Zahra Nazari, Longqi Yang, Oguz Semerci, Hugues Bouchard, Ben Carterette
Podcasts are spoken documents across a wide-range of genres and styles, with growing listenership across the world, and a rapidly lowering barrier to entry for both listeners and creators.
no code implementations • 31 May 2021 • Jussi Karlgren
This paper describes how the current lexical similarity and analogy gold standards are built to conform to certain ideas about what the models they are designed to evaluate are used for.
no code implementations • 1 Apr 2021 • Jussi Karlgren, Pentti Kanerva
High-dimensional distributed semantic spaces have proven useful and effective for aggregating and processing visual, auditory, and lexical information for many tasks related to human-generated data.
no code implementations • 29 Mar 2021 • Rosie Jones, Ben Carterette, Ann Clifton, Maria Eskevich, Gareth J. F. Jones, Jussi Karlgren, Aasish Pappu, Sravana Reddy, Yongze Yu
The Podcast Track is new at the Text Retrieval Conference (TREC) in 2020.
no code implementations • COLING 2020 • Ann Clifton, Sravana Reddy, Yongze Yu, Aasish Pappu, Rezvaneh Rezapour, Hamed Bonab, Maria Eskevich, Gareth Jones, Jussi Karlgren, Ben Carterette, Rosie Jones
Paired with the audio files, they are also a resource for speech processing and the study of paralinguistic, sociolinguistic, and acoustic aspects of the domain.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 28 Nov 2020 • Jussi Karlgren, Renee Li, Eva M Meyersson Milgrom
We use commercially available text analysis technology to process interview text data from a computational social science study.
no code implementations • 8 Apr 2020 • Ann Clifton, Aasish Pappu, Sravana Reddy, Yongze Yu, Jussi Karlgren, Ben Carterette, Rosie Jones
Podcasts are a relatively new form of audio media.
no code implementations • SEMEVAL 2019 • Nazanin Afsarmanesh, Jussi Karlgren, Peter Sumbler, Nina Viereckel
This report describes the starting point for a simple rule based hypothesis testing excercise on identifying hyperpartisan news items carried out by the Harry Friberg team from Gavagai.
no code implementations • WS 2015 • Max Berggren, Jussi Karlgren, Robert Östling, Mikael Parkvall
This paper describes a series of experiments to determine how positionally annotated microblog posts can be used to learn location-indicating words which then can be used to locate blog texts and their authors.
no code implementations • 14 Aug 2016 • Kerry Zhang, Jussi Karlgren, Cheng Zhang, Jens Lagergren
There are multiple sides to every story, and while statistical topic models have been highly successful at topically summarizing the stories in corpora of text documents, they do not explicitly address the issue of learning the different sides, the viewpoints, expressed in the documents.
no code implementations • LREC 2016 • Magnus Sahlgren, Amaru Cuba Gyllensten, Fredrik Espinoza, Ola Hamfors, Jussi Karlgren, Fredrik Olsson, Per Persson, Akshay Viswanathan, Anders Holst
This paper presents the Gavagai Living Lexicon, which is an online distributional semantic model currently available in 20 different languages.