mirror of
https://github.com/advplyr/audiobookshelf.git
synced 2026-05-30 23:40:40 +02:00
[PR #3996] [MERGED] Improve podcast library page query performance on title, titleIgnorePrefix, and addedAt sort orders #4142
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/advplyr/audiobookshelf/pull/3996
Author: @mikiher
Created: 2/16/2025
Status: ✅ Merged
Merged: 2/19/2025
Merged by: @advplyr
Base:
master← Head:optimize-podcast-queries📝 Commits (10+)
23a7502Add migration in preparation for podcast query optimizatione2f1aeeAdd numEpisodes to podcast model7282afcAdd podcastId to mediaProgress modelf1de307Update cached user whenever mediaProgress is removedda8fd2dSet podcastId when mediaProgress is createdf1e46a3Separate feed query from podcasts page query2e48ec0Use libraryItem.title[IgnorePrefix] for sorting podcasts page query707533dRemove numEpisodes subquery from podcasst page querycb9fc3eReplace numEpisodesIncomplete subquery with cached user progress calculationbd4f48eAdd required: true to includes in podcast episodes page query📊 Changes
13 files changed (+700 additions, -100 deletions)
View changed files
📝
server/Database.js(+4 -0)📝
server/controllers/PodcastController.js(+7 -1)📝
server/managers/PodcastManager.js(+8 -1)📝
server/migrations/changelog.md(+1 -0)➕
server/migrations/v2.19.4-improve-podcast-queries.js(+219 -0)📝
server/models/MediaProgress.js(+14 -1)📝
server/models/Podcast.js(+13 -1)📝
server/models/PodcastEpisode.js(+9 -1)📝
server/models/User.js(+11 -0)📝
server/scanner/PodcastScanner.js(+86 -69)📝
server/utils/queries/libraryFilters.js(+6 -3)📝
server/utils/queries/libraryItemsPodcastFilters.js(+57 -23)➕
test/server/migrations/v2.19.4-improve-podcast-queries.test.js(+265 -0)📄 Description
Brief summary
This PR tries to optimize some of the podcast library page load and scrolling database queries, following up on what's been done for the book library in #3952
Which issue is fixed?
Fixes #3965
In-depth Description
Podcast library page queries are more complex than book library page queries, because they aggregate data from podcast episodes, specifically
numEpisodes(the number of episodes that the podcast has), andnumEpisodesIncomplete(the number of episodes that the current user has not yet finished listening to).Before this change, these two virtual columns were obtained via per-row subqueries, making the podcast queries very inefficient, leading to very high library page query latency on large podcast libraries. In addition, the queries suffered from the same main issue that caused high latency in the book library page queries, namely the separation between the
libraryItems andpodcasts` tables.Resolution
The following changes affecting the main podcast library page query were made:
libraryItemslibraryItemsPodcastFiltersnumEpisodescolumn to thepodcaststablenumEpisodescould now be removedpodcastIdcolumn to themediaProgressestablenumEpisodesIncompletecould now be removednumEpisodesCompleteis calculated in-memory using the cached user recordnumEpisodesIncomplete = numEpisodes - numEpisodesCompleteIn addition, a couple of other small changes were implemented:
ANALYZEdatabase query at database initializationHow have you tested this?
Performance
The issues described above tend to be more noticable in large podcast libraries, so I tested this on a synthetic podcast library containing 1000 podcast and 130,000 podcast episodes. Like in the previous PR, I ran tests on a Synology 920+ NAS, which has a relatively weak hardware.
No non-default pragma values were applied.
I tested on the same sort orders as in #3952
Other sort orders were not optimized.
I'll post more detailed results later, but the overall effect on latency is very large:
This is an overall drop of ~99% or more in podcast library page load time (roughly similar to what we see in #3952).
Correctness
numPodcastsandnumPodcastsIncompleteare properly calculated/updaed.🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.