bellingcat Python script to automatically archive social media posts, videos, and images from a Google Sheets document.
 
 
 
 
 
 
Go to file
Miguel Sozinho Ramalho fe07ed7b22
Merge branch 'dev' into v1.1c-dev-upstream
2025-06-17 16:14:02 +01:00
.github dependabot switch to monthly 2025-06-10 16:37:52 +01:00
docs fixes bugs and limited configuration of multi-level logs 2025-06-17 14:10:46 +01:00
scripts npm update on scripts/settings 2025-06-10 16:33:47 +01:00
src/auto_archiver Merge branch 'dev' into v1.1c-dev-upstream 2025-06-17 16:14:02 +01:00
tests generic_extractor download tests adaptations 2025-06-11 20:05:35 +01:00
.dockerignore
.gitignore Gitgnore to include launch.json and installtion docs to include build script. 2025-06-16 14:37:21 +01:00
.pre-commit-config.yaml Fix pre-commit for ruff check 2025-03-14 13:40:57 +00:00
.pylintrc
.readthedocs.yaml Fix incorrect path in cp 2025-03-16 10:33:52 +00:00
CONTRIBUTING.md Fix links to docs 2025-02-12 11:41:54 +00:00
Dockerfile Merge pull request #315 from bellingcat/dependabot/docker/webrecorder/browsertrix-crawler-1.6.2 2025-06-10 16:34:30 +01:00
LICENSE
Makefile Update style_guide.md to clarify pre-commit setup, add Docker commands to Makefile and merge ruff actions. 2025-03-13 20:26:29 +00:00
README.md update repo badges 2025-03-31 16:19:29 +01:00
docker-compose.yaml Cleanup docker-compose 2025-03-20 16:48:30 +04:00
poetry.lock dependencies update 2025-06-17 12:50:35 +01:00
pyproject.toml removes dev specific instructions 2025-06-17 13:16:36 +01:00

README.md

Auto Archiver

Documentation Status PyPI version Docker Image Version (latest by date) Core Test Status

Auto Archiver is a Python tool to automatically archive content on the web in a secure and verifiable way. It takes URLs from different sources (e.g. a CSV file, Google Sheets, command line etc.) and archives the content of each one. It can archive social media posts, videos, images and webpages. Content can be enriched, then saved either locally or remotely (S3 bucket, Google Drive). The status of the archiving process can be appended to a CSV report, or if using Google Sheets – back to the original sheet.

Read the article about Auto Archiver on bellingcat.com.

Installation

View the Installation Guide for full instructions

Advanced:

To get started quickly using Docker:

docker pull bellingcat/auto-archiver && docker run -it --rm -v secrets:/app/secrets bellingcat/auto-archiver --config secrets/orchestration.yaml

Or pip:

pip install auto-archiver && auto-archiver --help

Contributing

We welcome contributions to the Auto Archiver project! See the Contributing Guide for how to get involved!