mirror of
https://github.com/advplyr/audiobookshelf.git
synced 2026-05-30 23:40:40 +02:00
[Bug]: Audioboks with multiple cds are not properly added #530
Closed
opened 2026-04-24 23:12:10 +02:00 by adam
·
2 comments
No Branch/Tag Specified
master
book_tags_genres_dedupe
episode_download_fallback
Issue-4540-SortBy-StartedDate-and-FinishedDate
episode_meta_tagging
fix_authorize_race_condition
redirect_transcode_requests
progress_updated_sort
fix_ereader_socket_event
fix_change_empty_root_password
fix_podcast_session_track_index
fix_set_token
session_modal_user
localize_durations
fix_oidc_create_user
jwt_auth_refactor
fix_scanner_deleting_single_file_books
fix_mediaprogress_updatedat_2
experimental_next_client
podcast_episode_duration
episode-timestamps-clickable
book_author_secondary_sort_title
podcast_useragents
pathexists_user_access
fix_pathexists_join
book_author_secondary_sort
clean_duplicate_mediaprogress
sanitize_html_description
trix_prevent_attachments
check_path_api_fix
fix_mediaprogress_updatedat
increase_express_json_limit
fix_dockerfile_nunicode
search_episodes
audiobook_tools_update
episode_secondary_sorts
hls_stream_url_update
new_session_track_endpoint
audiobook_tools_enhancements
watcher_rescans_update
player_track_tooltip
fix_exclude_prefixes_crash
socket_item_events
fix_podcast_episode_scanner_promise
new_stats_controller
count_cache_for_userpermissions
parsing-opf-v3
validate_migration_files
fix-quick-match-all-crash
fix-chapter-end-sleep-timer
stringify_sequelize_query
remove-col-ambiguity
fix_next_prev_edit_description
details_trim_whitespace
fix_content_url_basepath
fix_logger_fatal
progress_bar_visibility
batch-edit-populate-map-details
feed_generator_updates
bookmark-modal-updates
migrate-library-item-in-scanner
migrate-new-library-items
migrate-podcasts-new-library-item-2
migrate-podcasts-new-library-item
fix-remove-episode-from-playlist
playback-session-use-new-library-item
refactor-library-item
fix-heatmap-caption
feed-episodes-upsert
share-media-player-media-session-api
remove-old-playlist
remove_old_collection_object
plugin-implementation-demo
feed_migration
refactor-feeds-from-item
fix_remove_authors_no_books
v2.17.3-fk-constraints-migration
migrations-first-upgrade
sqlite_2
feature/nuxt-target-server
waveform
sqlite
playlists
video
v2.35.1
v2.35.0
v2.34.0
v2.33.2
v2.33.1
v2.33.0
v2.32.1
v2.32.0
v2.31.0
v2.30.0
v2.29.0
v2.28.0
v2.27.0
v2.26.3
v2.26.2
v2.26.1
v2.26.0
v2.25.1
v2.25.0
v2.24.0
v2.23.0
v2.22.0
v2.21.0
v2.20.0
v2.19.5
v2.19.4
v2.19.3
v2.19.2
v2.19.1
v2.19.0
v2.18.1
v2.18.0
v2.17.7
v2.17.6
v2.17.5
v2.17.4
v2.17.3
v2.17.2
v2.17.1
v2.17.0
v2.16.2
v2.16.1
v2.16.0
v2.15.1
v2.15.0
v2.14.0
v2.13.4
v2.13.3
v2.13.2
v2.13.1
v2.13.0
v2.12.3
v2.12.2
v2.12.1
v2.12.0
v2.11.0
v2.10.1
v2.10.0
v2.9.0
v2.8.1
v2.8.0
v2.7.2
v2.7.1
v2.7.0
v2.6.0
v2.5.0
v2.4.4
v2.4.3
v2.4.2
v2.4.1
v2.4.0
v2.3.5
v2.3.4
v2.3.3
v2.3.2
v2.3.1
v2.3.0
v2.2.23
v2.2.22
v2.2.21
v2.2.20
v2.2.19
v2.2.18
v2.2.17
v2.2.16
v2.2.15
v2.2.14
v2.2.13
v2.2.12
v2.2.11
v2.2.10
v2.2.9
v2.2.8
v2.2.7
v2.2.6
v2.2.5
v2.2.4
v2.2.3
v2.2.2
v2.2.1
v2.2.0
v2.1.5
v2.1.4
v2.1.3
v2.1.2
v2.1.1
v2.1.0
v2.0.24
v2.0.23
v2.0.22
v2.0.21
v2.0.20
v2.0.19
v2.0.18
v2.0.17
v2.0.16
v2.0.15
v2.0.14
v2.0.13
v2.0.12
v2.0.11
v2.0.10
v2.0.9
v2.0.8
v2.0.7
v2.0.6
v2.0.5
v2.0.4
v2.0.3
v2.0.2
v2.0.1
v1.7.2
v1.7.1
v1.7.0
v1.6.0
v1.5.5
v1.5.0
v1.4.11
v1.4.9
v1.4.7
v1.4.6
v1.4.4
v1.4.2
v1.4.0
v1.4.1
v1.3.4
v1.3.3
v1.3.1
v1.2.8
v1.2.6
v1.2.5
v1.2.4
v1.2.1
v1.1.15
v1.1.14
v1.1.13
v1.1.12
v1.1.11
v1.1.10
v1.1.9
v1.1.8
v1.0.0
0.9.61-beta.0
0.9.61-beta
Labels
Clear labels
authentication
backlog
bug
chapter editor
config-issue
ebooks
encoding/embedding
enhancement
help wanted
listening sessions & progress
planned
possible plugin
progress sync
pull-request
sorting/filtering/searching
unable to reproduce
upload
users & permissions
waiting
Mirrored from GitHub Pull Request
No Label
bug
Milestone
No items
No Milestone
Projects
Clear projects
No project
Assignees
adam (Adam Melkus)
Clear assignees
No Assignees
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: starred/audiobookshelf#530
Reference in New Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Originally created by @mediadev123 on GitHub (Jul 16, 2022).
Describe the issue
One audiobook has cd's subfolders as already described in https://github.com/advplyr/audiobookshelf/issues/393
What's special about my case is that the CDs seem to have a space in between them ("CD 1" instead of "CD1"), which causes the algorithm to not correctly label them. Grabbing through the code, it seems like the regex is all over the place and not really updated:
Here are lines where the incorrect regex is used:
https://github.com/advplyr/audiobookshelf/blob/d0af1c3c9ac49ae46d490a87d7f67754111e16f6/server/utils/scandir.js#L60
https://github.com/advplyr/audiobookshelf/blob/d0af1c3c9ac49ae46d490a87d7f67754111e16f6/server/utils/scandir.js#L111
Here an updated regex is used:
https://github.com/advplyr/audiobookshelf/blob/5446aea910b4d51c5320236421aa9f668b65bdc4/server/scanner/AudioFileScanner.js#L25
https://github.com/advplyr/audiobookshelf/blob/5446aea910b4d51c5320236421aa9f668b65bdc4/server/scanner/AudioFileScanner.js#L32
https://github.com/advplyr/audiobookshelf/blob/8894f5243972e10e2b400dd7c3bf338ca8867068/server/scanner/MediaFileScanner.js#L26https://github.com/advplyr/audiobookshelf/blob/8894f5243972e10e2b400dd7c3bf338ca8867068/server/scanner/MediaFileScanner.js#L33
But a few lines below another incorrect one is used again:
https://github.com/advplyr/audiobookshelf/blob/5446aea910b4d51c5320236421aa9f668b65bdc4/server/scanner/AudioFileScanner.js#L38 (no check for disc)
https://github.com/advplyr/audiobookshelf/blob/8894f5243972e10e2b400dd7c3bf338ca8867068/server/scanner/MediaFileScanner.js#L38-L39
I would highly suggest to create a constant for the regex and unify it to something like this:
Those regex would catch the following formats: "CD(any space any more times)(any number, from 0 to 999)", "disk(any space any more times)(any number, from 0 to 999)", "disc(any space any more times)(any number, from 0 to 999)". By using
\sinstead of a space, we allow any other weird space characters (like the non-line breaking, a tab or even a new line -- you never know). This allows multiple spaces (in case someone misnames the CD). This allows the alternate spelling of disc, disk. And by using a named group, the regex is more forward compatible in case there are more groups in the future:For example:
Lastly, it would be great if this regex is configurable so that users can add their own version easily, in case their audiobook uses a regional name for disc, such as disco in Spanish (I think). This is why I used a regex array instead of a more complex regex -- they scale way better IMO.
I would like to help but my javascript is not good enough for doing a refactor.
Steps to reproduce the issue
Audiobookshelf version
2.0.24_amd64.deb
How are you running audiobookshelf?
Debian/PPA
@advplyr commented on GitHub (Jul 23, 2022):
Can you share what your exact file path was that was not properly detected? Spaces are supported already when parsing the CD from the audio file name.
There is some confusion in your post about what the various regexes are doing for CD. One thing to note is the AudioFileScanner was removed and replaced with the MediaFileScanner but the code parsing the CD is the same.
The MediaFileScanner will attempt to parse out a track and/or cd number from the filename. The only purpose of this is to properly order the tracks so for example Track 10 CD 1 would go before Track 1 CD 2.
Example audio filenames supported
audiobook CD01.mp3audiobook CD 01.mp3audiobook Disc 1.mp3audiobook Disc10.mp3In order to support another file structure sometimes used, I added support for detecting CD subfolders.
For example
/Animal Farm/CD01/track1.mp3/Animal Farm/CD2/track1.mp3This is more restrictive in that it doesn't allow a space but that could be updated. I don't think this is what you are asking for though.
I would like to add support for customizing the scanner folder structure but there is a lot of planning that has to go into that since there are so many edge cases. It has been brought up a few times and I like the suggestion to use grok-style syntax for defining path patterns #774
@mediadev123 commented on GitHub (Aug 22, 2022):
Hello, sorry for taking so long to respond. I must have missed your notification. The complete paths are:
/mnt/media/audiobooks/AuthorFirstName AuthorLastName Collection/AuthorLastName, AuthorFirstName - AudiobookTitle/CD {1..13}/01 - Track 1.mp3You have already pointed out the difference between
MediaFileScannerandAudioFileScanner, but what exactly does scandir.js do then? Anyhow, I still think the regex should be more flexible while you work on #774 as it seems to be more complex.Thanks for your detailed answers, though!