Benjamin_Loison/YouTube_Audio_library_extractor

Go to file

Benjamin Loison 49c5e29c39

Precise current state of the repository in README.md

2024-01-03 15:02:51 +01:00

media_files_extractor.py

Fix #6 : Replace https://studio.youtube.com/channel/UC/music with https://youtube.com/audiolibrary

2024-01-03 14:43:30 +01:00

metadata_extracter.py

Make output of metadata_extracter.py depends on the specified TAB

2024-01-03 14:54:35 +01:00

music.json

Add {music, sound_effects}.json

2023-02-04 14:15:10 +01:00

README.md

Precise current state of the repository in README.md

2024-01-03 15:02:51 +01:00

sound_effects.json

Add {music, sound_effects}.json

2023-02-04 14:15:10 +01:00

README.md

YouTube Audio library extractor

If I remember correctly:

https://youtube.com/audiolibrary contains two categories: Music and Sound effects. Thanks to credentials provided in

		metadata_extracter.py
		Lines 9 to 13 in 5a6382ea00
	
				SAPISIDHASH = 'CENSORED'

				SECURE_3PSID = 'CENSORED'

				SECURE_3PAPISID = 'CENSORED'

				CHANNEL_ID = 'CENSORED'

				ON_BEHALF_OF_USER = 'CENSORED'

, metadata_extractor.py extracts to music.json or sound_effects.json metadata of all entries, thanks to pagination, of a given category specified in

		metadata_extracter.py
		Lines 14 to 15 in 5a6382ea00
	
				# `TAB` can either be `Music` or `Sound effects`

				TAB = 'Music'

.

Then as it is unclear how to easily proceed with low-level curl/requests to the download of the given sound tracks, media_files_extractor.py use Selenium to leverage previous step retrieved metadata to download one-by-one all the sound tracks to Selenium default downloads folder, which is on Linux ~/Downloads/.

The current issue with this repository is the stability, as at some point the algorithm stops being able to retrieve metadata or download correctly, cf #10.