Browse Source

Ignore /vi/* on YouTube (video thumbnails)

master
JustAnotherArchivist 4 years ago
parent
commit
cd0b3f6214
1 changed files with 2 additions and 1 deletions
  1. +2
    -1
      website-extract-social-media

+ 2
- 1
website-extract-social-media View File

@@ -42,7 +42,8 @@ function fetch_n_extract {
>(
# YouTube
grep -Poi '(youtube\.com/((user|channel|embed)/)?[^/ <"'"'"']+|youtu\.be/[^/ <"'"'"']+)' | \
awk '/^youtube/ { print "https://www." $0 } /^youtu\.be/ { print "https://" $0 }'
awk '/^youtube/ { print "https://www." $0 } /^youtu\.be/ { print "https://" $0 }' | \
grep -vi -e '^https://www\.youtube\.com/vi$'
) \
>/dev/null
} | awk '!seen[$0]++'


Loading…
Cancel
Save