Browse Source

Add support for Instagram posts and ignore spurious links from the CDN

master
JustAnotherArchivist 4 years ago
parent
commit
4f34753788
1 changed files with 3 additions and 2 deletions
  1. +3
    -2
      website-extract-social-media

+ 3
- 2
website-extract-social-media View File

@@ -19,8 +19,9 @@ function fetch_n_extract {
) \
>(
# Instagram
grep -Poi 'instagram\.com/[^/ <"'"'"']+' | \
sed 's,^,https://www.,'
grep -Poi 'instagram\.com/(p/)?[^/ <"'"'"']+' | \
sed 's,^,https://www.,' | \
grep -Pvi -e '^https://www\.instagram\.com/v?p$'
) \
>(
# Telegram


Loading…
Cancel
Save