The little things give you away... A collection of various small helper stuff
Vous ne pouvez pas sélectionner plus de 25 sujets Les noms de sujets doivent commencer par une lettre ou un nombre, peuvent contenir des tirets ('-') et peuvent comporter jusqu'à 35 caractères.
 
 
 

10 lignes
709 B

  1. #!/bin/bash
  2. # When scraping accounts and hashtags which have some overlap, this can be used to filter out the accounts' tweets from the hashtag scrapes
  3. # Starting with account and hashtag scrapes in twitter-@* and twitter-#*, respectively:
  4. for f in twitter-#*; do comm -23 <(sort <$f) <(cat twitter-@* | sort) > "${f}-fixed"; done
  5. for f in *-fixed; do { grep -vF '/status/' $f; grep -F '/status/' $f | sort -t'/' -k6,6n | tac; } > "${f}-sorted"; done
  6. for f in *-fixed-sorted; do mv $f ${f/-fixed-sorted/-filtered}; done
  7. # sort -r should work, but for some reason it doesn't, hence the tac...
  8. # There's certainly a cleaner way which doesn't involve sorting and then restoring the inverse chronological order.