media_files_extractor.py | ||
metadata_extracter.py | ||
music.json | ||
README.md | ||
sound_effects.json |
YouTube Audio library extractor
If I remember correctly:
https://youtube.com/audiolibrary contains two categories: Music
and Sound effects
.
Thanks to credentials provided in
metadata_extracter.py
Lines 9 to 13 in 5a6382ea00
SAPISIDHASH = 'CENSORED' |
|
SECURE_3PSID = 'CENSORED' |
|
SECURE_3PAPISID = 'CENSORED' |
|
CHANNEL_ID = 'CENSORED' |
|
ON_BEHALF_OF_USER = 'CENSORED' |
metadata_extractor.py
extracts to music.json
or sound_effects.json
metadata of all entries, thanks to pagination, of a given category specified in
metadata_extracter.py
Lines 14 to 15 in 5a6382ea00
# `TAB` can either be `Music` or `Sound effects` |
|
TAB = 'Music' |
Then as it is unclear how to easily proceed with low-level curl/requests to the download of the given sound tracks, media_files_extractor.py
use Selenium to leverage previous step retrieved metadata to download one-by-one all the sound tracks to Selenium default downloads folder, which is on Linux ~/Downloads/
.
The current issue with this repository is the stability, as at some point the algorithm stops being able to retrieve metadata or download correctly, cf #10.