bellingcat Python script to automatically archive social media posts, videos, and images from a Google Sheets document.
 
 
 
 
 
 
Go to file
Patrick Robertson b997bbea2b
Merge pull request #263 from bellingcat/wrong_steps
When loading modules, check they have been added to the right 'step' in the config
2025-03-20 15:31:38 +00:00
.github/workflows Fix merge conflicts. 2025-03-14 12:11:24 +00:00
docs Add info on building RTD versions + automated building of tagged versions 2025-03-17 12:52:04 +00:00
scripts Fix link to module types in config editor 2025-03-17 09:17:17 +00:00
src/auto_archiver Fix small bug in twitter dropin 2025-03-20 18:55:22 +04:00
tests Fix small bug in twitter dropin 2025-03-20 18:55:22 +04:00
.dockerignore docker initial files 2022-10-31 17:10:55 +00:00
.gitignore Remove schema.json file from git - is auto-generated on release 2025-03-16 10:27:08 +00:00
.pre-commit-config.yaml Fix pre-commit for ruff check 2025-03-14 13:40:57 +00:00
.pylintrc Ignore pylint statements for manifest files 2025-01-21 17:59:52 +01:00
.readthedocs.yaml Fix incorrect path in cp 2025-03-16 10:33:52 +00:00
CONTRIBUTING.md Fix links to docs 2025-02-12 11:41:54 +00:00
Dockerfile Remove geckodriver .tar file 2025-03-07 11:52:14 +00:00
LICENSE Add LICENSE 2021-06-24 16:14:32 +02:00
Makefile Update style_guide.md to clarify pre-commit setup, add Docker commands to Makefile and merge ruff actions. 2025-03-13 20:26:29 +00:00
README.md Add '-it' to the list of docker flags, so that docker gives a colour log output 2025-03-17 09:45:12 +00:00
docker-compose.yaml Add docker-compose for easy building and running of docker image in dev 2025-01-15 14:36:02 +01:00
poetry.lock Merge branch 'main' into opentimestamps 2025-03-14 12:36:03 +00:00
pyproject.toml Version bump 2025-03-18 10:52:23 +00:00

README.md

Auto Archiver

PyPI version Docker Image Version (latest by date) Core Test Status Download Test Status

Auto Archiver is a Python tool to automatically archive content on the web in a secure and verifiable way. It takes URLs from different sources (e.g. a CSV file, Google Sheets, command line etc.) and archives the content of each one. It can archive social media posts, videos, images and webpages. Content can be enriched, then saved either locally or remotely (S3 bucket, Google Drive). The status of the archiving process can be appended to a CSV report, or if using Google Sheets – back to the original sheet.

Read the article about Auto Archiver on bellingcat.com.

Installation

View the Installation Guide for full instructions

Advanced:

To get started quickly using Docker:

docker pull bellingcat/auto-archiver && docker run -it --rm -v secrets:/app/secrets bellingcat/auto-archiver --config secrets/orchestration.yaml

Or pip:

pip install auto-archiver && auto-archiver --help

Contributing

We welcome contributions to the Auto Archiver project! See the Contributing Guide for how to get involved!