What does the website returns for a video with two captions matching the query? #45

Closed
opened 2023-02-16 23:32:14 +01:00 by Benjamin_Loison · 1 comment

I would guess that there should be two matches even if they may have the same timestamp.

For -lVZ8wluBfc both captions automatically generated and not contain galement bienvenue but the website only returns a single match at 2 s...

Maybe verifying on a video where there is a match only with automatically generated captions and another with captions not automatically generated would reassure us that everything work fine.
For automatically generated captions there isn't any problem, however for the video mentioned above for CLAP BONJOUR there is no match, as there is no not automatically generated French caption downloaded, maybe it's due to #35. But it doesn't seem to as the archive was completed 2 days ago, so before the yt-dlp bug arises it seems...

I would guess that there should be two matches even if they may have the same timestamp. For [`-lVZ8wluBfc`](https://www.youtube.com/watch?v=-lVZ8wluBfc) both captions automatically generated and not contain `galement bienvenue` but the website only returns a single match at `2 s`... Maybe verifying on a video where there is a match only with automatically generated captions and another with captions not automatically generated would reassure us that *everything work fine*. For automatically generated captions there isn't any problem, however for the video mentioned above for `CLAP BONJOUR` there is no match, as there is no not automatically generated French caption downloaded, maybe it's due to #35. But it doesn't seem to as the archive was completed 2 days ago, so before the yt-dlp bug arises it seems...
Benjamin_Loison added the
quick
discussion
medium priority
labels 2023-02-16 23:32:14 +01:00
Benjamin_Loison added
bug
and removed
discussion
labels 2023-02-17 01:07:56 +01:00
Author
Owner

As it's working on a captions file basis it returns two entries which seem the best way to treat this case.

Output for: aquatique le plus haut

As it's working on a captions file basis it returns two entries which seem the best way to treat this case. Output for: `aquatique le plus haut` - [UCH0XvUpYcxn4V0iZGnZXMnQ.zip](https://crawler.yt.lemnoslife.com/channels/UCH0XvUpYcxn4V0iZGnZXMnQ.zip) - [captions/p6rbOYH2tGY/_.fr.vtt](https://crawler.yt.lemnoslife.com/channels/UCH0XvUpYcxn4V0iZGnZXMnQ.zip/captions/p6rbOYH2tGY/_.fr.vtt) [0 s](https://www.youtube.com/watch?v=p6rbOYH2tGY&t=0) - [captions/p6rbOYH2tGY/_.fr-orig.vtt](https://crawler.yt.lemnoslife.com/channels/UCH0XvUpYcxn4V0iZGnZXMnQ.zip/captions/p6rbOYH2tGY/_.fr-orig.vtt) [0 s](https://www.youtube.com/watch?v=p6rbOYH2tGY&t=0)
Sign in to join this conversation.
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: Benjamin_Loison/YouTube_captions_search_engine#45
No description provided.