Wykres commitów

  • 89d4ea15c1
    Merge 36fb0f0d82 into cd6a2b6031 Dave Mateer 2025-06-17 10:10:19 +0100
  • 36fb0f0d82 metadata.json hardcode in storage. add new metadata_json_enricher. log level change in orchestrator Dave Mateer 2025-06-17 09:51:19 +0100
  • ec74670dba
    Merge b3adc5603a into cd6a2b6031 Dave Mateer 2025-06-17 08:51:37 +0000
  • b3adc5603a metadata.json hardcode in storage. add new metadata_json_enricher. log level change in orchestrator Dave Mateer 2025-06-17 09:51:19 +0100
  • 24fbbd0644
    Merge 4388be4b23 into 6735fa890b dependabot[bot] 2025-06-16 21:56:13 +0000
  • 4388be4b23
    Bump the actions group in /scripts/settings with 7 updates dependabot/npm_and_yarn/scripts/settings/actions-5e6945255f dependabot[bot] 2025-06-16 21:56:10 +0000
  • 679e870a86
    Merge b66de3156c into 6735fa890b dependabot[bot] 2025-06-16 21:42:04 +0000
  • b66de3156c
    Bump webrecorder/browsertrix-crawler from 1.6.1 to 1.6.3 dependabot/docker/webrecorder/browsertrix-crawler-1.6.3 dependabot[bot] 2025-06-16 21:42:02 +0000
  • 48e59e4d98
    Merge db250fd68d into 6735fa890b dependabot[bot] 2025-06-16 21:30:33 +0000
  • db250fd68d
    Bump the python group with 2 updates dependabot/pip/python-b83113665b dependabot[bot] 2025-06-16 21:30:30 +0000
  • ba3f1a52e8 Logging each_level_in_separate_file feature Dave Mateer 2025-06-16 16:15:54 +0100
  • a60d800b31 Changed log level for media Dave Mateer 2025-06-16 15:07:39 +0100
  • f2e80758a7 typo on authentication docs. Updated install docs. Dave Mateer 2025-06-16 14:59:55 +0100
  • f07fdbc500 Custom local version comment in toml file Dave Mateer 2025-06-16 14:54:15 +0100
  • b236f2510d Updates to installation docs Dave Mateer 2025-06-16 14:40:40 +0100
  • 529d8b60bf Gitgnore to include launch.json and installtion docs to include build script. Dave Mateer 2025-06-16 14:37:21 +0100
  • d13ecfa85b
    Merge cd6a2b6031 into 6735fa890b Miguel Sozinho Ramalho 2025-06-11 20:05:40 +0100
  • cd6a2b6031
    generic_extractor download tests adaptations dev msramalho 2025-06-11 20:05:35 +0100
  • dfb361e3a0
    reset generic_extractor description in result msramalho 2025-06-11 19:55:54 +0100
  • 3d31c7605b
    Merge pull request #319 from bellingcat/feat/linkedin-antibot Miguel Sozinho Ramalho 2025-06-11 19:42:38 +0100
  • d7a48e465b
    fix copypasta msramalho 2025-06-11 18:04:49 +0100
  • aaa9ead39d
    adds documentation for dropins msramalho 2025-06-11 17:58:53 +0100
  • f5be7a50c1
    Testing Linkedin Dropin for Antibot msramalho 2025-06-11 16:52:03 +0100
  • 2adcf231f7
    new LinkedIn Dropin for Antibot msramalho 2025-06-11 16:51:52 +0100
  • cd19181d8f
    minor improvements msramalho 2025-06-11 16:51:42 +0100
  • b60469767a
    more flexibility to antibot dropins media finding process msramalho 2025-06-11 16:51:22 +0100
  • d60d02c16e
    improves download_from_url msramalho 2025-06-11 16:50:31 +0100
  • e567bba6f9
    improves docs for how-to and migrations msramalho 2025-06-11 13:37:03 +0100
  • 3cf51dd874
    adds tracker remove feature and tests msramalho 2025-06-11 11:56:42 +0100
  • 69ddb72146
    separate reddit tests msramalho 2025-06-11 11:27:11 +0100
  • 1039e9631f
    new reddit tests with .env.test msramalho 2025-06-11 11:22:23 +0100
  • 79f42c3c41
    Merge pull request #318 from bellingcat/feat/antibot-reddit Miguel Sozinho Ramalho 2025-06-10 18:39:34 +0100
  • 8314833ae8
    removes exclude_media_extensions option msramalho 2025-06-10 18:34:33 +0100
  • 6279610a43
    updates docs msramalho 2025-06-10 18:28:45 +0100
  • fc89d96517
    escape sequence msramalho 2025-06-10 18:04:33 +0100
  • 54fda9cad4
    antibot in docker uses a different user_data_dir msramalho 2025-06-10 18:04:27 +0100
  • 71636233cb
    adds migration information and VkDropin info. msramalho 2025-06-10 17:07:10 +0100
  • fdbe96f2e4
    vk and reddit should work without credentials but log the error msramalho 2025-06-10 16:44:14 +0100
  • 22bd8727df
    python dependencies bump msramalho 2025-06-10 16:43:55 +0100
  • 499c272260
    dependabot switch to monthly msramalho 2025-06-10 16:37:52 +0100
  • f232bc45b8
    Merge pull request #315 from bellingcat/dependabot/docker/webrecorder/browsertrix-crawler-1.6.2 Miguel Sozinho Ramalho 2025-06-10 16:34:30 +0100
  • 4270e06728
    npm update on scripts/settings msramalho 2025-06-10 16:33:47 +0100
  • ca00aa302d
    version bump breaking msramalho 2025-06-10 16:31:32 +0100
  • 773fa82f06
    introduces reddit dropin msramalho 2025-06-10 16:31:19 +0100
  • ef0e909a72
    extractor to auto detect best quality msramalho 2025-06-10 16:29:35 +0100
  • 6bbc7fb47a
    improves antibot flow and makes auth_wall detection optional msramalho 2025-06-10 16:29:07 +0100
  • 809b8c7749
    default dropin introduced msramalho 2025-06-10 16:14:42 +0100
  • 6d82655cc4
    manifest improvement for antibot msramalho 2025-06-10 16:14:34 +0100
  • 6bd493a791
    dropin with new ytdlp feature and helper method msramalho 2025-06-10 16:11:55 +0100
  • 287e823f43
    improves twitter URL cleaning and introduces another bestquality check msramalho 2025-06-10 16:09:38 +0100
  • c815488daa
    adds new URLs to ignore msramalho 2025-06-10 15:44:52 +0100
  • 529721c0f4
    Bump requests from 2.32.3 to 2.32.4 in the pip group dependabot[bot] 2025-06-10 10:40:58 +0000
  • e0bd8dad18
    Bump the python group with 2 updates dependabot[bot] 2025-06-09 20:59:19 +0000
  • f53e34d6bd
    Bump webrecorder/browsertrix-crawler from 1.6.1 to 1.6.2 dependabot[bot] 2025-06-09 20:55:07 +0000
  • 31d4c3ae5e
    Bump the actions group in /scripts/settings with 7 updates dependabot[bot] 2025-06-09 20:42:31 +0000
  • 6ec113910b
    Merge 2b4cd37e02 into 4cfbc3008b Dave Mateer 2025-06-08 20:10:19 +0100
  • 4cfbc3008b
    Merge pull request #313 from bellingcat/feat/antibot-auth Miguel Sozinho Ramalho 2025-06-08 14:42:35 +0100
  • 6f02493ff1
    adds clips extraction to VK, though generic_extractor should still be run for those msramalho 2025-06-08 14:36:55 +0100
  • 1f2d637928
    minor improvements msramalho 2025-06-08 14:16:21 +0100
  • 18cc05a2fe
    allows auth_for_site to receive do.main directly msramalho 2025-06-08 14:16:12 +0100
  • c96fd71f35
    minor cleanup msramalho 2025-06-07 20:06:53 +0100
  • b3183510ea
    installs ffmpeg in GH actions msramalho 2025-06-07 20:03:26 +0100
  • d13a5ef003
    adds tests in minor improvements msramalho 2025-06-07 19:58:18 +0100
  • 48c1ab3c1f
    doc improvements msramalho 2025-06-07 19:14:16 +0100
  • b2ee42ee95
    adds the first antibot dropin: VKontakte msramalho 2025-06-07 19:10:01 +0100
  • 07ff5baf07
    adds Dropin flexible integration for antibot msramalho 2025-06-07 19:09:37 +0100
  • d202d79e0f
    lint msramalho 2025-06-07 19:06:14 +0100
  • e2e6490b49
    minimal changes msramalho 2025-06-07 18:15:21 +0100
  • 952487da30
    adds missing bin dependency msramalho 2025-06-07 18:14:42 +0100
  • c7a84bc97a
    generalizes ydl info to filename method for reusing msramalho 2025-06-07 18:14:08 +0100
  • c0be41950d
    Merge branch 'dev' of https://github.com/bellingcat/auto-archiver into dev msramalho 2025-06-04 17:06:42 +0100
  • 2b4cd37e02
    Merge branch 'dev' into musthavefoldername Miguel Sozinho Ramalho 2025-06-04 15:10:23 +0100
  • ae547ef83f
    Merge pull request #308 from bellingcat/dependabot/npm_and_yarn/scripts/settings/actions-a541a3dacb Miguel Sozinho Ramalho 2025-06-04 15:06:59 +0100
  • 8a897cf601
    minimal changes: standard naming msramalho 2025-06-04 15:06:08 +0100
  • 14c8af5cc8
    Merge pull request #310 from djhmateer/waczscreenshot bug fix Miguel Sozinho Ramalho 2025-06-04 15:01:12 +0100
  • 8e2e18ef75
    Merge pull request #311 from bellingcat/feat/seleniumbase Miguel Sozinho Ramalho 2025-06-04 14:53:31 +0100
  • 5491f3e9e7
    fixing s3 storage tests msramalho 2025-06-04 14:41:00 +0100
  • 264ba82ea0
    finish removing screenshot_enricher references msramalho 2025-06-04 14:31:07 +0100
  • 05231445d9
    removes unnecessary ignored files msramalho 2025-06-04 14:19:25 +0100
  • 2c6be4447f
    linting msramalho 2025-06-04 14:17:38 +0100
  • 5f68c151a0
    removes webdriver utils used by screenshot enricher msramalho 2025-06-04 14:17:19 +0100
  • 6d2aec032f
    Merge remote-tracking branch 'origin/main' into dev msramalho 2025-06-04 14:15:14 +0100
  • bc8cf2fb29
    minor TODO msramalho 2025-06-04 14:10:19 +0100
  • f066111d49
    removes geckodriver dependencies following screenshot enricher removal msramalho 2025-06-04 14:09:13 +0100
  • e6f3826a3a
    dropping screenshot enricher msramalho 2025-06-04 12:08:59 +0100
  • e5a78a5d06
    antibot can be used out of the box msramalho 2025-06-04 12:01:42 +0100
  • 258fb4faaf
    visual HTML preview improvements msramalho 2025-06-04 12:00:40 +0100
  • 5ec00f7811
    adds dependencies for seleniumbase msramalho 2025-06-04 12:00:22 +0100
  • 22408e2a98
    adds test for antibot msramalho 2025-06-04 11:59:59 +0100
  • 378b1a6d22
    expand S3 objects content type for better preview results in non-latin languages msramalho 2025-06-04 11:53:41 +0100
  • d130c1b3fa
    WIP attempt at ytdlp impersonation msramalho 2025-06-04 11:53:18 +0100
  • cbd189c97d
    general cleanup msramalho 2025-06-04 11:53:01 +0100
  • d2e8f1a512
    introduces antibot step with seleniumbase msramalho 2025-06-04 11:20:46 +0100
  • 488802b632
    poetry update msramalho 2025-06-04 11:08:44 +0100
  • c772082f0e counter_screenshots to counter_warc_files in wacz_extractor so don't get error about add mulitple items with same id. Dave Mateer 2025-06-03 12:30:18 +0100
  • ee68f3efee
    Merge remote-tracking branch 'origin/main' into feat/seleniumbase msramalho 2025-06-03 11:05:16 +0100
  • b1d63f7188
    Bump the python group with 2 updates dependabot[bot] 2025-06-02 20:26:36 +0000
  • efe2a1a8b6
    Bump the actions group in /scripts/settings with 4 updates dependabot[bot] 2025-06-02 20:21:07 +0000
  • 6735fa890b
    v1.0.1 dependency updates, generic extractor improvements (#307) main v1.0.1 Miguel Sozinho Ramalho 2025-06-02 20:57:12 +0100
  • 69028588b3
    linting msramalho 2025-06-02 20:04:34 +0100