Add a wiki page for already existing tools #50

Closed
opened 2023-02-19 16:30:26 +01:00 by Benjamin_Loison · 2 comments

Currently waiting all channels from channels.txt to be treated before treating this issue in order to compare already existing tools and mine.

~~Currently waiting all channels from `channels.txt` to be treated before treating this issue in order to compare already existing tools and mine.~~
Benjamin_Loison added the
waiting presentation
medium
enhancement
medium priority
labels 2023-02-19 16:30:26 +01:00
Author
Owner

I guess that contrarily to existing tools mine is the only-one that is open-source and also crawls YouTube public content (not being captions) in general.

I guess that contrarily to existing tools mine is the only-one that is open-source and also crawls YouTube public content (not being captions) in general.
Author
Owner

Would have used and would switch to a Wiki page if it could support attachments, which isn't the case currently.

Investigating supervisor resources:

Concerning incaptions.com

Search in the wrapped text displayed on the screen but not across displayed captions.

Our solution doesn't have this issue, as shown below:

Even if we workaround this issue it doesn't index all @TED videos cf gJjLdnycuyU:

Our solution indexes it too, as shown below:

Concerning filmot.com

It doesn't have the across displayed captions search issue. However even with is in millions of computers it doesn't return among the results the most viewed video from @TED that is o8NPllzkFhE.

Our solution correctly returns this video.

Concerning youglish.com

When providing is in millions of computers, it it replaces the query with is in millions of computers it so it doesn't found any result.

Our solution correctly returns o8NPllzkFhE.

Would have used and would switch to [a Wiki page if it could support attachments, which isn't the case currently](https://github.com/go-gitea/gitea/issues/574). Investigating supervisor resources: - https://incaptions.com - https://news.ycombinator.com/item?id=34826944 ## Concerning incaptions.com Search in the wrapped text displayed on the screen but not across displayed captions. ![](https://gitea.lemnoslife.com/attachments/cb90cc16-cd6d-4717-ad91-648c7e5fede7) ![](https://gitea.lemnoslife.com/attachments/68650661-e77a-464c-a377-dbeab6cf5703) ![](https://gitea.lemnoslife.com/attachments/68e114d0-70ad-41e4-bb60-bf771d497b17) Our solution doesn't have this issue, as shown below: ![](https://gitea.lemnoslife.com/attachments/a31a1432-646d-4fef-9569-6418a4d0165e) Even if we workaround this issue it doesn't index all [@TED](https://www.youtube.com/@TED) videos cf [`gJjLdnycuyU`](https://www.youtube.com/watch?v=gJjLdnycuyU): ![](https://gitea.lemnoslife.com/attachments/64b1041a-0e20-4b96-bad4-0c5868fe2f84) Our solution indexes it too, as shown below: ![](https://gitea.lemnoslife.com/attachments/5145f7ad-6c82-4a46-b4c9-fb510e4c36fe) ## Concerning filmot.com It doesn't have the across displayed captions search issue. However even with `is in millions of computers` it doesn't return among the results the most viewed video from [@TED](https://www.youtube.com/@TED) that is [`o8NPllzkFhE`](https://www.youtube.com/watch?v=o8NPllzkFhE). Our solution correctly returns this video. ## Concerning youglish.com When providing `is in millions of computers, it` it replaces the query with `is in millions of computers it` so it doesn't found any result. Our solution correctly returns [`o8NPllzkFhE`](https://www.youtube.com/watch?v=o8NPllzkFhE).
Sign in to join this conversation.
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: Benjamin_Loison/YouTube_captions_search_engine#50
No description provided.