Benjamin_Loison opened issue Benjamin_Loison/YouTube_captions_search_engine#49 2023-02-19 02:26:01 +01:00
Update YouTube operational API commits Wiki page
Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#47 2023-02-19 02:23:15 +01:00
Retrieving playlist id from PLAYLISTS tab may crash due to type inconsistency

Thanks to a temporary patch, close in favor of #11.

Benjamin_Loison closed issue Benjamin_Loison/YouTube_captions_search_engine#47 2023-02-19 02:23:15 +01:00
Retrieving playlist id from PLAYLISTS tab may crash due to type inconsistency
e86d629597 #48: Modify removeChannelsBeingTreated.py to temporarily solve the issue
Benjamin_Loison opened issue Benjamin_Loison/YouTube_captions_search_engine#48 2023-02-19 02:03:29 +01:00
Redirect compression execution logs for not having them overlapping PRINTs
Benjamin_Loison opened issue Benjamin_Loison/YouTube_captions_search_engine#47 2023-02-19 01:54:59 +01:00
Retrieving playlist id from PLAYLISTS tab may crash due to type inconsistency
Benjamin_Loison closed issue Benjamin_Loison/YouTube_captions_search_engine#45 2023-02-18 12:57:38 +01:00
What does the website returns for a video with two captions matching the query?
Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#45 2023-02-18 12:57:16 +01:00
What does the website returns for a video with two captions matching the query?

As it's working on a captions file basis it returns two entries which seem the best way to treat this case.

Output for: aquatique le plus haut

  • [UCH0XvUpYcxn4V0iZGnZXMnQ.zip](https://crawl…
78b2bf18fa #35: Make the not automatically generated captions correctly downloaded
5bfceccb8e Change the EXIT_WITH_ERROR to PRINT for channels not having an enumerable uploads playlist
Compare 2 commits »
Benjamin_Loison opened issue Benjamin_Loison/YouTube_captions_search_engine#46 2023-02-17 16:54:09 +01:00
yt-dlp seems able to download more live chat than I do
Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#31 2023-02-17 01:02:41 +01:00
Make a website with a search engine notably based on the captions extracted

Note that maybe the returned match timestamps aren't as precise as we can (maybe it returns the previous beginning timestamp caption for instance). This should be ideally investigated.

Benjamin_Loison opened issue Benjamin_Loison/YouTube_captions_search_engine#45 2023-02-16 23:32:14 +01:00
What does the website returns for a video with two captions matching the query?
Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#40 2023-02-16 21:05:06 +01:00
Publish nginx configuration

Note that as I'm hosting multiple websites, to guess which website (here the YouTube operational API one) to talk to, I'm using a private sub domain private.sub.domain. However reaching this…

Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#35 2023-02-16 13:19:18 +01:00
Prepare the presentation

To verify the correct format of channels.txt, as I ran dos2unix on it while the algorithm was running:

verifyChannels.py:

#!/usr/bin/python3

with open('channels.txt') as f:
  
Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#35 2023-02-16 13:16:40 +01:00
Prepare the presentation

To verify that the starting set was treated:

isStartingSetTreated.py:

#!/usr/bin/python3

import os

with open('newChannels.txt') as f:
    lines = f.read().splitlines()
    for
Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#35 2023-02-16 12:12:27 +01:00
Prepare the presentation

Concerning channels/ due to crashes during the unstable process at the time of the process, using:

find -name '*.zip' -exec unzip -t {} \; 
Benjamin_Loison opened issue Benjamin_Loison/YouTube_captions_search_engine#44 2023-02-15 23:52:16 +01:00
Improve indexing of website
Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#25 2023-02-15 16:20:13 +01:00
Make a not pre-release release

Will publish such a release after having treated all the channels I provided it initially.

Benjamin_Loison opened issue Benjamin_Loison/YouTube_captions_search_engine#43 2023-02-15 00:00:05 +01:00
Could wonder if doing on our own speech-to-text wouldn't make sense for videos
Benjamin_Loison commented on issue Benjamin_Loison/YouTube_captions_search_engine#35 2023-02-14 23:50:53 +01:00
Prepare the presentation

Also verifying quality by verifying debug/*.err content:

cat *.err