|
|
@@ -2,69 +2,13 @@ |
|
|
|
|
|
|
|
<p style="text-align:center;"><img class="centerImage" src="https://cdn.discordapp.com/attachments/755918971487911947/759063041643118592/unknown.png" width=50%></p> |
|
|
|
|
|
|
|
## What this project is all about |
|
|
|
## [Project Information](https://github.com/Data-Horde/ytcc-archive/wiki/General-Information) |
|
|
|
## [Tutorial](https://github.com/Data-Horde/ytcc-archive/wiki/In-Depth-Tutorial) |
|
|
|
## Stats |
|
|
|
[Graphana](https://atdash.meo.ws/d/attv2/archive-team-tracker-charts-v2?orgId=1&var-project=ext-yt-communitycontribs) |
|
|
|
|
|
|
|
YouTube's community captions... |
|
|
|
[Tracker Data](https://tracker.archiveteam.org/ext-yt-communitycontribs/) |
|
|
|
## See also |
|
|
|
CC-draft retriever: https://github.com/Data-Horde/ytcc-exporter |
|
|
|
|
|
|
|
### Why would YouTube do this? |
|
|
|
|
|
|
|
[A year ago there was a major controversy which led to YouTube restricting the feature.](https://twitter.com/TeamYouTube/status/1167565334917742593) Because people were complaining about spam, they made it so only uploaders could publish submissions |
|
|
|
[2:44 PM] themadprogramer: |
|
|
|
|
|
|
|
[For a more exhaustive explanation click here!](https://datahorde.org/?p=1442) |
|
|
|
|
|
|
|
YouTube announced that they were going to retire the feature on September 28, 2020. This decision seeing as people had [been complaining about the feature](https://www.youtube.com/watch?v=isvAJnuyaN4)and that they did not want to reimplement it for the new editor |
|
|
|
|
|
|
|
### Why should we archive these unpublished contributions? |
|
|
|
|
|
|
|
The best scenario: uploaders approve, |
|
|
|
had previously complained about the feature |
|
|
|
|
|
|
|
### How a "worker" works |
|
|
|
|
|
|
|
<p style="text-align:center;"><img class="centerImage" src="https://cdn.discordapp.com/attachments/755918971487911947/759059477387018250/Architecture.png" width=25%></p> |
|
|
|
|
|
|
|
The Tracker tracks workers, the workers collect information from videos: captions in review, title/description translations in review and caption credits. |
|
|
|
|
|
|
|
An "un-automated" version which https://github.com/Data-Horde/ytcc-exporter |
|
|
|
|
|
|
|
### Stats |
|
|
|
https://atdash.meo.ws/d/attv2/archive-team-tracker-charts-v2?orgId=1&var-project=ext-yt-communitycontribs |
|
|
|
|
|
|
|
https://tracker.archiveteam.org/ext-yt-communitycontribs/ |
|
|
|
*** |
|
|
|
|
|
|
|
## Tutorial |
|
|
|
|
|
|
|
### Getting Started |
|
|
|
|
|
|
|
In order to run these tools you will need to provide "session cookies", you can think of this as a lazy way of logging onto YouTube: |
|
|
|
|
|
|
|
- In a new/guest/Incognito browser profile, create a test Google account. |
|
|
|
(Use a separate browser profile so the cookies don't get associated with your main Google account). |
|
|
|
- **IMPORTANT: Set the default account language to English (United States). https://myaccount.google.com/language** |
|
|
|
- **IMPORTANT: Visit YouTube.com. Set the YouTube site language (found by clicking on the profile image on the top right corner of youtube.com) to English (US).** |
|
|
|
- Open developer tools and go to the Application tab in [Chrome](https://developers.google.com/web/tools/chrome-devtools/storage/cookies), or the Storage tab in Firefox. Click on Cookies and then https://www.youtube.com. Copy the full values for the following cookies on youtube.com: HSID, SSID, and SID. Note these values for when the archiving begins. |
|
|
|
|
|
|
|
The cookie values are needed because a Google account (any Google account) is required to access the community contributions editor, where much of the data is gathered from. |
|
|
|
|
|
|
|
<p style="text-align:center;"><img class="centerImage" src="https://cdn.discordapp.com/attachments/755014597811109898/758618624902168586/unknown.png" width=80%></p> |
|
|
|
|
|
|
|
### Run locally |
|
|
|
|
|
|
|
WIP |
|
|
|
|
|
|
|
### Heroku |
|
|
|
|
|
|
|
If you're familiar with Heroku, you can just deploy the YTCC archiving tool from this template: |
|
|
|
|
|
|
|
[![Deploy](https://www.herokucdn.com/deploy/button.svg)](https://heroku.com/deploy?template=https://github.com/Data-Horde/ytcc-archive-heroku) |
|
|
|
|
|
|
|
<p style="text-align:center;"><img class="centerImage" src="https://cdn.discordapp.com/attachments/755918971487911947/759049420809175050/H1.png" width=50%></p> |
|
|
|
<p style="text-align:center;"><img class="centerImage" src="https://cdn.discordapp.com/attachments/755918971487911947/759049419206295583/H2.png" width=50%></p> |
|
|
|
|
|
|
|
### Docker |
|
|
|
|
|
|
|
WIP |
|
|
|
|
|
|
|
You can also make a new image using the Dockerfile provided in this repo. |
|
|
|
*** |