Try to find a logic to YouTube search concerning captions #27

Open
opened 2023-01-09 06:15:09 +01:00 by Benjamin_Loison · 1 comment

Concerning the project proposal:

It's not clear to me from the "proof" part whether the video "o8NPllzkFhE" is not returned because of an indexing problem or because it is considered to be a duplicate of the video "Vo9KPk-gqKk". Did you manage to identify a case where a video is not returned even though it is the only match to a query? (Indeed, if the goal of your project is just to work around the fact that some duplicate videos are removed from search results, then it limits a bit the appeal.)

Source: email

An answer is given here.

Concerning [the project proposal](https://gitea.lemnoslife.com/Benjamin_Loison/YouTube_captions_search_engine/wiki/Project-proposal): > It's not clear to me from the "proof" part whether the video "o8NPllzkFhE" is not returned because of an indexing problem or because it is considered to be a duplicate of the video "Vo9KPk-gqKk". Did you manage to identify a case where a video is not returned even though it is the only match to a query? (Indeed, if the goal of your project is just to work around the fact that some duplicate videos are removed from search results, then it limits a bit the appeal.) Source: email An answer is given [here](https://gitea.lemnoslife.com/Benjamin_Loison/YouTube_captions_search_engine/wiki/Home#gjjldnycuyu-https-www-youtube-com-watch-v-gjjldnycuyu-my-kids-have-seen-a-lot-of-cartoons).
Benjamin_Loison added the
medium
enhancement
medium priority
labels 2023-01-09 06:15:09 +01:00
Benjamin_Loison changed title from Try to find a logic to YouTube search to Try to find a logic to YouTube search concerning captions 2023-01-14 15:29:04 +01:00
Benjamin_Loison added the
captions
label 2023-01-14 15:29:26 +01:00
Author
Owner

Even when considering the line by line aspect (see below) that should be considered in the search engine that we will propose, YouTube search engine doesn't make sense by still not proposing o8NPllzkFhE when searching for is in millions of computers.

00:13.440 --> 00:15.440
this is such a strange thing your

00:14.719 --> 00:18.880
software

00:15.440 --> 00:22.320
uh linux is in millions of computers

00:18.880 --> 00:24.240
it probably powers much of the internet

Related to #31.

Even when considering the line by line aspect (see below) that should be considered in the search engine that we will propose, YouTube search engine doesn't make sense by still not proposing [`o8NPllzkFhE`](https://www.youtube.com/watch?v=o8NPllzkFhE) when searching for [`is in millions of computers`](https://yt.lemnoslife.com/noKey/search?part=snippet&q=%22is%20in%20millions%20of%20computers%22&maxResults=50). ``` 00:13.440 --> 00:15.440 this is such a strange thing your 00:14.719 --> 00:18.880 software 00:15.440 --> 00:22.320 uh linux is in millions of computers 00:18.880 --> 00:24.240 it probably powers much of the internet ``` Related to #31.
Benjamin_Loison added this to the 0.0.1 milestone 2023-02-10 17:14:13 +01:00
Benjamin_Loison removed this from the 0.0.1 milestone 2023-02-10 17:15:00 +01:00
Sign in to join this conversation.
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: Benjamin_Loison/YouTube_captions_search_engine#27
No description provided.