bellingcat Python script to automatically archive social media posts, videos, and images from a Google Sheets document.
 
 
 
 
 
 
Go to file
Patrick Robertson e2442b2f6b
Merge pull request #243 from bellingcat/fix-long-path-names
Unit tests for storage types + fix storage too long issues for local storage
2025-03-11 10:05:09 +00:00
.github/workflows Add poetry.lock and pyproject.toml paths to trigger tests. 2025-03-07 18:01:08 +00:00
docs Update the release description to tag on release 2025-03-07 18:04:51 +00:00
scripts Don't force enable a module under all types - allows modules to be set as only feeder/only extractor etc. 2025-03-07 14:59:11 +00:00
src/auto_archiver Merge pull request #243 from bellingcat/fix-long-path-names 2025-03-11 10:05:09 +00:00
tests Merge pull request #243 from bellingcat/fix-long-path-names 2025-03-11 10:05:09 +00:00
.dockerignore
.gitignore Fix sorting of steps in the outputted file 2025-03-04 11:51:26 +00:00
.pylintrc
.readthedocs.yaml Overwrite settings_base file 2025-02-27 22:02:44 +00:00
CONTRIBUTING.md Fix links to docs 2025-02-12 11:41:54 +00:00
Dockerfile Remove geckodriver .tar file 2025-03-07 11:52:14 +00:00
LICENSE
README.md Adds new extractor for tiktok via unofficial API (#237) 2025-03-10 11:56:45 +00:00
docker-compose.yaml
poetry.lock Update the release process docs and the latest version in pyproject.toml 2025-03-07 17:33:32 +00:00
pyproject.toml Update the release process docs and the latest version in pyproject.toml 2025-03-07 17:33:32 +00:00

README.md

Auto Archiver

PyPI version Docker Image Version (latest by date) Core Test Status Download Test Status

Auto Archiver is a Python tool to automatically archive content on the web in a secure and verifiable way. It takes URLs from different sources (e.g. a CSV file, Google Sheets, command line etc.) and archives the content of each one. It can archive social media posts, videos, images and webpages. Content can be enriched, then saved either locally or remotely (S3 bucket, Google Drive). The status of the archiving process can be appended to a CSV report, or if using Google Sheets – back to the original sheet.

Read the article about Auto Archiver on bellingcat.com.

Installation

View the Installation Guide for full instructions

Advanced:

To get started quickly using Docker:

docker pull bellingcat/auto-archiver && docker run --rm -v secrets:/app/secrets bellingcat/auto-archiver --config secrets/orchestration.yaml

Or pip:

pip install auto-archiver && auto-archiver --help

Contributing

We welcome contributions to the Auto Archiver project! See the Contributing Guide for how to get involved!