bellingcat Python script to automatically archive social media posts, videos, and images from a Google Sheets document.
 
 
 
 
 
 
Go to file
Patrick Robertson d28d99daa6 Docs tidy ups and re-organising 2025-02-12 11:16:17 +00:00
.github/workflows
docs Docs tidy ups and re-organising 2025-02-12 11:16:17 +00:00
scripts
src/auto_archiver Tidy ups to manifests for docs 2025-02-11 20:03:53 +00:00
tests
.dockerignore
.gitignore
.pylintrc
.readthedocs.yaml
CONTRIBUTING.md Docs tidy ups and re-organising 2025-02-12 11:16:17 +00:00
Dockerfile
LICENSE
README.md Docs tidy ups and re-organising 2025-02-12 11:16:17 +00:00
docker-compose.yaml
poetry.lock
pyproject.toml

README.md

Auto Archiver

PyPI version Docker Image Version (latest by date) Core Test Status Download Test Status

Auto Archiver is a Python tool to automatically archive content on the web. It takes URLs from different sources (e.g. a CSV file, Google Sheets, command line etc.) and archives the content of each one. It can archive social media posts, videos, images and webpages. Content can enriched, then saved either locally or remotely (S3 bucket, Google Drive). The status of the archiving process can be appended to a CSV report, or if using Google Sheets – back to the original sheet.

**[See the Auto Arciver documentation for more information.](https://auto-archiver.readthedocs.io/en/latest/)**

Read the article about Auto Archiver on bellingcat.com.

Installation

View the Installation Guide for full instructions

To get started quickly using Docker:

docker pull bellingcat/auto-archiver && docker run

Or pip:

pip install auto-archiver && auto-archiver --help

Contributing

We welcome contributions to the Auto Archiver project! See the Contributing Guide for how to get involved!