Mapping the Podcast Ecosystem with the Structured Podcast Research Corpus
Litterer, Benjamin, Jurgens, David, Card, Dallas
–arXiv.org Artificial Intelligence
Podcasts provide highly diverse content to a massive listener base through a unique on-demand modality. However, limited data has prevented large-scale computational analysis of the podcast ecosystem. To fill this gap, we introduce a massive dataset of over 1.1M podcast transcripts that is largely comprehensive of all English language podcasts available through public RSS feeds from May and June of 2020. This data is not limited to text, but rather includes audio features and speaker turns for a subset of 370K episodes, and speaker role inferences and other metadata for all 1.1M episodes. Using this data, we also conduct a foundational investigation into the content, structure, and responsiveness of this ecosystem. Together, our data and analyses open the door to continued computational research of this popular and impactful medium.
arXiv.org Artificial Intelligence
Nov-12-2024
- Country:
- Oceania > Australia (0.04)
- Africa (0.04)
- North America
- Canada (0.04)
- United States
- Michigan (0.04)
- District of Columbia > Washington (0.04)
- Texas (0.04)
- California (0.04)
- New York > New York County
- New York City (0.04)
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Spain > Catalonia
- Asia
- Singapore (0.04)
- India (0.04)
- China (0.04)
- Middle East
- Jordan (0.04)
- Israel (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.14)
- Republic of Türkiye > Batman Province
- Batman (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Leisure & Entertainment > Sports (1.00)
- Information Technology (0.93)
- Banking & Finance (0.68)
- Government (0.67)
- Media
- Health & Medicine > Therapeutic Area
- Infections and Infectious Diseases (0.46)
- Immunology (0.46)
- Education > Curriculum
- Subject-Specific Education (0.68)
- Technology: