2025-02-10 22:51:04 +00:00
# Module Documentation
2025-02-10 22:58:52 +00:00
These pages describe the core modules that come with `auto-archiver` and provide the main functionality for archiving websites on the internet. There are five core module types:
2025-02-10 22:51:04 +00:00
1. Feeders - these 'feed' information (the URLs) from various sources to the `auto-archiver` for processing
2. Extractors - these 'extract' the page data for a given URL that is fed in by a feeder
3. Enrichers - these 'enrich' the data extracted in the previous step with additional information
4. Storage - these 'store' the data in a persistent location (on disk, Google Drive etc.)
5. Databases - these 'store' the status of the entire archiving process in a log file or database.
```{include} modules/autogen/module_list.md
```
```{toctree}
:maxdepth: 1
:caption: Core Modules
:hidden:
2025-02-11 14:06:53 +00:00
modules/config_cheatsheet
2025-02-10 22:51:04 +00:00
modules/feeder
modules/extractor
modules/enricher
modules/storage
modules/database
2025-02-18 19:10:09 +00:00
modules/formatter
2025-02-10 22:51:04 +00:00
```