msramalho
216226e7cc
browsertrix version bump
2025-06-17 19:22:20 +01:00
Miguel Sozinho Ramalho
f232bc45b8
Merge pull request #315 from bellingcat/dependabot/docker/webrecorder/browsertrix-crawler-1.6.2
...
Bump webrecorder/browsertrix-crawler from 1.6.1 to 1.6.2
2025-06-10 16:34:30 +01:00
dependabot[bot]
f53e34d6bd
Bump webrecorder/browsertrix-crawler from 1.6.1 to 1.6.2
...
Bumps webrecorder/browsertrix-crawler from 1.6.1 to 1.6.2.
---
updated-dependencies:
- dependency-name: webrecorder/browsertrix-crawler
dependency-version: 1.6.2
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
2025-06-09 20:55:07 +00:00
msramalho
f066111d49
removes geckodriver dependencies following screenshot enricher removal
2025-06-04 14:09:13 +01:00
msramalho
5ec00f7811
adds dependencies for seleniumbase
2025-06-04 12:00:22 +01:00
Miguel Sozinho Ramalho
6735fa890b
v1.0.1 dependency updates, generic extractor improvements ( #307 )
...
* wacz: allow exceptional cases where more than one resource image is available
* improves generic extractor edge-cases and yt-dlp updates
* REMOVES vk_extractor until further notice
* bumps browsertrix in docker image
* npm version bump on scripts/settings
* poetry updates
* Changed log level on gsheet_feeder_db started from warning to info (#301 )
* closes 305 and further fixes finding local downloads from uncommon ytdlp extractors
* use ffmpeg -bitexact to reduce duplicate content storing
* formatting
* adds yt-dlp curl-cffi
* version bump
* linting
---------
Co-authored-by: Dave Mateer <davemateer@gmail.com>
2025-06-02 20:57:12 +01:00
Patrick Robertson
5055402c2a
Bump browsertrix version
2025-03-24 17:39:44 +04:00
Patrick Robertson
e756f1504f
Remove geckodriver .tar file
2025-03-07 11:52:14 +00:00
Patrick Robertson
a47e18ef9a
Bump gecko driver to 0.36.0
2025-03-03 16:00:11 +00:00
Patrick Robertson
dea0a49600
Download correct gecko-driver for the platform + fix setting executable path when running in Docker
...
Fixes #232
2025-03-03 15:41:44 +00:00
Miguel Sozinho Ramalho
a6fc4e1bb1
modifies base docker image to use browsertrix 1.4.2 ( #182 )
...
* modifies base image to newest browsertrix version
* modify browsertrix cmd args based on recent experience
2025-01-24 13:59:29 +00:00
erinhmclark
33686ea851
Update versions for GH Actions and Geckodriver.
2025-01-15 17:35:42 +00:00
erinhmclark
84ee1b422f
Update and restrict versions of Poetry and Python.
2025-01-13 17:42:51 +00:00
erinhmclark
c69a5fa1c9
Refactor Dockerfile for multi-stage builds.
...
Combining environment and runtime stages due to Poetry's dependency on source code.
2025-01-12 12:38:12 +00:00
erinhmclark
cc490f9c10
Updated Dockerfile (not optimised yet)
2025-01-12 12:15:56 +00:00
erinhmclark
660ee82c67
Update Dockerfile for poetry.
...
Note: Review security with curl installation. Currently locked to known version, but additional checks could be added.
2025-01-12 12:15:56 +00:00
msramalho
9c7824de57
browsertrix docker updates
2024-04-15 19:01:55 +01:00
Kai
f7839a99cc
Add configs for path to write and read wacz archives ( #93 )
...
Co-authored-by: msramalho <19508417+msramalho@users.noreply.github.com>
2023-09-14 17:49:37 +01:00
msramalho
0dd45d90f1
fix: docker+wacz troubles
2023-09-08 15:09:50 +01:00
msramalho
17d9bf694f
fix docker image so as not to remove browsertrix files
2023-09-06 17:07:10 +01:00
msramalho
804fcb1204
browsertrix dependencies isolated into dockerfile
2023-08-24 16:57:58 +01:00
msramalho
1695954c98
new metadata enricher
2023-07-28 12:46:30 +01:00
msramalho
31f6aae7b9
fix: screenshots in docker
2023-05-10 13:29:42 +01:00
Miguel Sozinho Ramalho
b67a7b818a
Merge pull request #75 from bellingcat/feature/browsertrix
2023-05-10 10:14:40 +01:00
Logan Williams
9cb73c073f
Simplify entrypoint
2023-05-10 11:08:49 +02:00
msramalho
e150370657
updates docker instructions
2023-05-10 09:51:53 +01:00
Logan Williams
2c5b115fbe
Fix lock file issue
2023-05-09 19:34:16 +02:00
Logan Williams
bda812f850
Clean up comments
2023-05-09 19:34:16 +02:00
Logan Williams
ac82764ffc
Working, but some cleanup still necessary
2023-05-09 19:34:16 +02:00
Logan Williams
0fae7d96fb
Detect running in docker container in WACZ enricher
2023-05-09 19:34:16 +02:00
Logan Williams
2f7181ced6
Use browsertrix base image
2023-05-09 19:34:16 +02:00
msramalho
75459d2880
docker
2023-02-08 11:22:38 +00:00
msramalho
f5b7c3a5ea
mute formatter and docker
2023-01-26 23:38:58 +00:00
msramalho
04263094ad
WIP docker changes for cli and auto_archiver
2022-11-10 17:46:40 +00:00
msramalho
390b84eb22
dockerization complete
2022-11-08 15:55:33 +00:00
msramalho
09f47383a3
dockerfile improvements
2022-11-08 13:59:35 +00:00
msramalho
a9df992f66
WiP
2022-11-02 16:51:32 +00:00
msramalho
c8fa077df7
docker initial files
2022-10-31 17:10:55 +00:00