Current algorithm doesn't check all channel tabs #11
Labels
No Label
bug
captions
discussion
enhancement
epic
high priority
low priority
medium
medium priority
quick
security
waiting presentation
website
youtube-operational-api
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: Benjamin_Loison/YouTube_captions_search_engine#11
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
As YouTube Data API v3 CommentThreads: list endpoint doesn't support this feature with filter
allThreadsRelatedToChannelId
neitherchannelId
.My YouTube operational API allows to treat them.
Should check that in a similar manner we can't extract channels, even in some cases, from others tabs.
LIVE
(if the channel is livestreaming, or the previous livestreams where we can still access the chat)@LofiGirl
current livestreams (less important as it doesn't seem that we can go up in the chat, even with the ability to go back in time up to 12 hours thanks to DVR the chat still works with current new messages, maybe doable by reverse-engineering). A question I want to raise is for a livestream that started a while ago does a first chat message is still visible after a while if there wasn't any message? Even with the official method I mention in the following item, we can only retrieve current and future chat messages but not previous ones.@LeFatShow
previous livestreamsare good examples to make sure my algorithm works fine with. We can't just use YouTube Data API v3 PlaylistItems: list endpoint to retrieve whether or a not given video is an ended livestream even if they are listed with this endpoint. So we are obliged to use YouTube Data API v3 Videos: list endpoint with
part=liveStreamingDetails
. Note that noactiveLiveChatId
can be returned for this latter video for instance while the first one doesn't have any problem to follow the official method. In the latter case I should rely on my YouTube operational APIliveChats
endpoint.Note that current pagination algorithm doesn't work properly (in addition that the returned data aren't much parsed perfectly).Note that even with reverse-engineering by providing a negativetime
, we can't retrieve all messages in the chat that happened before the livestream (we only have those returned when requestingtime = 0
), an example of livestream having messages before the beginning of it isFuFjLL7gKv4
.I used the following algorithm to find some interesting
LIVE
tabs:PLAYLISTS
(in case of playlist with videos of other channels) - should also keep unlisted videos found that way, seePLe0Nm0KU0Zo2vuYY_DDLHK4iP7SQVOUSm
for such an example) -as far as I know retrievingI solved this issue adding the ability to retrieveSaved playlists
isn't possible with YouTube APIsSaved playlists
, as YouTube official APIs doesn't allow us to do so.Currently having an issue withShould pay attention to not treat twice the same unlisted videos. Should make sure that unlisted videos are treated with our livestream procedure if they are livestreams. We don't treat shows cf #47, but it's unclear if there can be comments on such videos as I'm unable to see when not logged in my comment on@Goldenmoustache
, as there is a YouTube Originals playlist.kZfSRj-cOJk
. YouTube Data API v3 seems to agree with my observation. In fact concerning shows, it doesn't seem that we can extract any channel id that may have indirectly comments as the serie channelUCubjFsqje4qtdyQVZiQAUsg
isn't available.CHANNELS
-have to implement pagination for-@LeFatShow
for instance@cyprien
has a different layout (cf YouTube-operational-API/issues/121) - from@LeFatShow
CHANNELS
tab by proceeding recursively, I retrieved at least 20,538 channels (stopped it by mistake)COMMUNITY
thanks to post comments (have to add its support in YouTube operational API, as I confirm again that YouTube Data API v3 CommentThreads: list endpoint withallThreadsRelatedToChannelId
andchannelId
)Note that in theory we should call YouTube operational API instance at the end of the process with YouTube official APIs otherwise if we do it in the beginning we risk to be detected as having an unusual activity. However should adopt this method only for the long run, as otherwise for testing we would lose quite some time.
Should make another boolean
USE_YT_LEMNOSLIFE_COM_YOUTUBE_OPERATIONAL_API_INSTANCE
likeUSE_YT_LEMNOSLIFE_COM_NO_KEY_SERVICE
to work even if the user doesn't have a YouTube operational API running locally.In fact a
--youtube-operational-api-instance URL
with anURL
likehttps://yt.lemnoslife.com
would be nice and by default the usedURL
would be the one I just mentioned.Note that we could find even more channels/videos by analyzing user messages etc however as we don't have any guarantee that a message contains a channel or a video, we don't treat them. So we limit us to YouTube webpages parts where we are sure that if there is something then it's a YouTube channel/video...
Note that we could also give a try to YouTube Data API v3 endpoints Search: list with notably
relatedToVideoId
filter and Videos: list withpart=suggestions
using OAuth (note that it is maybe to some extent equivalent to Search: list possibility).Treated
@LeParisien
livestreams:To filter video ids out of the logs, I used:
Note that current algorithm doesn't check comments on the `COMMUNITY` tabto Note that current algorithm doesn't check all channel tabsJust writing the id of a channel having unlisted videos doesn't make much sense in my opinion.
Note that current algorithm doesn't check all channel tabsto Current algorithm doesn't check all channel tabsIf you are looking for unlisted videos, you can use:
If you don't mind about listing as many times as an unlisted video appears in playlists, you can use:
Could also get more channels thanks to
HOME
tab, as it may introduce some, however maybe they are repeating with theCHANNELS
ones.