Go to file
2024-01-03 15:02:51 +01:00
media_files_extractor.py Fix #6: Replace https://studio.youtube.com/channel/UC/music with https://youtube.com/audiolibrary 2024-01-03 14:43:30 +01:00
metadata_extracter.py Make output of metadata_extracter.py depends on the specified TAB 2024-01-03 14:54:35 +01:00
music.json Add {music, sound_effects}.json 2023-02-04 14:15:10 +01:00
README.md Precise current state of the repository in README.md 2024-01-03 15:02:51 +01:00
sound_effects.json Add {music, sound_effects}.json 2023-02-04 14:15:10 +01:00

YouTube Audio library extractor

If I remember correctly:

https://youtube.com/audiolibrary contains two categories: Music and Sound effects. Thanks to credentials provided in

SAPISIDHASH = 'CENSORED'
SECURE_3PSID = 'CENSORED'
SECURE_3PAPISID = 'CENSORED'
CHANNEL_ID = 'CENSORED'
ON_BEHALF_OF_USER = 'CENSORED'
, metadata_extractor.py extracts to music.json or sound_effects.json metadata of all entries, thanks to pagination, of a given category specified in
metadata_extracter.py Lines 14 to 15 in 5a6382ea00
# `TAB` can either be `Music` or `Sound effects`
TAB = 'Music'
.

Then as it is unclear how to easily proceed with low-level curl/requests to the download of the given sound tracks, media_files_extractor.py use Selenium to leverage previous step retrieved metadata to download one-by-one all the sound tracks to Selenium default downloads folder, which is on Linux ~/Downloads/.

The current issue with this repository is the stability, as at some point the algorithm stops being able to retrieve metadata or download correctly, cf #10.