Podcast Index Database Dashboard
Authors: Eric Nantz
mastodon
twitter
linkedin
R-Podcast
R Weekly Highlights
Shiny Developer Series
Full Description:
Background
The Podcast Index is an independent and open catalog of podcasts feeds serving as the backbone of what is referred to as the Podcasting 2.0 initiative. The data contained in the Podcast Index is available through a robust REST API as well as a SQLite database updated every week.
The Podcast Index Database Dashboard is my attempt to give back to the amazing Podcasting 2.0 initiative. A key concept that drives the engagement and enthusiasm in this community is the unique ways each of us can contribute time, talent, and treasure to benefit everyone. The objectives of this dashboard are to highlight potential duplicate podcast entries as well as perform quality assessments of the index to highlight potential issues.
You can learn more about the present and future directions of this project in my interview with Adam Curry and Dave Jones on episode 179 of the Podcasting 2.0 show!
Tech Stack
Much like the ethos behind podcasting 2.0, the Podcast Index Database Dashboard is proudly built on the foundations of open-source:
- Quarto technical publishing system with the new capability of dashboards.
- The R project for statistical computing with the following amazing packages:
{reclin2}
: Record linkage toolkit for R.{pointblank}
: Data quality assessment and metadata reporting for data frames and database tables.{reactable}
: Interactive data tables for R, based on the React Table and made with{reactR}
.{reactablefmtr}
: Streamlined table styling and formatting for{reactable}
tables.{dplyr}
: A grammar of data manipulation
Analysis Pipeline
The duplicate records and data quality analysis pipelines are executed weekly (after the Podcast Index SQLite database is refreshed) as scheduled GitHub Action workflows. Visit the GitHub repository at https://github.com/rpodcast/pod-db-checker to find the following scripts:
duplicate_runner.R
: Performs duplication analysis and necessary data processing.pointblank_runner.R
: Execute data quality checks with the{pointblank}
package.
Table Type: interactive-HTML
Submission Type: Other
Table: PodcastIndex Dashboard 1.1.0
Code: GitHub - rpodcast/pod-db-dash: Podcast index database quality dashboard
Language: R
Industries: podcasting.
Packages: reactable, reactablefmtr