msramalho
|
abaf86c776
|
Bump version to v0.5.26 for release
|
2023-07-02 18:42:59 +02:00 |
msramalho
|
8005a1955a
|
fixes #82 twitter api walls
|
2023-07-02 18:42:43 +02:00 |
msramalho
|
04f827f183
|
Bump version to v0.5.25 for release
|
2023-06-26 18:15:45 +01:00 |
msramalho
|
a2c6cdc111
|
Bump version to v0.5.24 for release
|
2023-06-26 17:58:47 +01:00 |
msramalho
|
a0971fc601
|
final code review changes
|
2023-06-26 17:32:19 +01:00 |
msramalho
|
0cba2c25c6
|
get all media method
|
2023-06-26 17:28:19 +01:00 |
msramalho
|
7c0b05b276
|
new column
|
2023-06-26 17:27:57 +01:00 |
msramalho
|
3bbfdf6eba
|
fix: excluding screenshots
|
2023-06-26 17:27:49 +01:00 |
msramalho
|
a7a6bda1c2
|
improve missing col behaviour to error log
|
2023-06-26 17:27:37 +01:00 |
msramalho
|
d80145002d
|
formatter to accommodate properties of inner media
|
2023-06-26 17:06:50 +01:00 |
msramalho
|
b4f86d0e8d
|
refactor to hash all images and save hex string
|
2023-06-26 17:06:30 +01:00 |
msramalho
|
6cf3e109ed
|
refactor discovery of inner media elements
|
2023-06-26 17:05:25 +01:00 |
Emiel de Heij
|
9fc09c724b
|
add module for perceptual hashing with pdq
|
2023-06-26 15:25:55 +02:00 |
Logan Williams
|
cc66ee3fd4
|
bump to patch 23
|
2023-06-06 12:24:43 -06:00 |
Logan Williams
|
b3b727b005
|
Fix ValueError
|
2023-06-06 12:13:08 -06:00 |
msramalho
|
ee37b20e6c
|
fix: on missing col
|
2023-05-24 20:25:30 +01:00 |
msramalho
|
a184bf7b97
|
Bump version to v0.5.20 for release
|
2023-05-24 20:24:35 +01:00 |
msramalho
|
e535f44a88
|
optional folder
|
2023-05-24 20:24:15 +01:00 |
msramalho
|
0f28bf0e35
|
Bump version to v0.5.19 for release
|
2023-05-24 19:57:51 +01:00 |
msramalho
|
18a8636552
|
feat: new DB for auto-archiver-api
|
2023-05-24 19:24:53 +01:00 |
msramalho
|
81be65c828
|
Bump version to v0.5.18 for release
|
2023-05-24 11:19:02 +01:00 |
msramalho
|
0a91863212
|
typing fixes
|
2023-05-24 11:18:39 +01:00 |
msramalho
|
3ad8349e3f
|
Bump version to v0.5.17 for release
|
2023-05-23 19:05:53 +01:00 |
msramalho
|
2768225cd1
|
fix: generator not called
|
2023-05-23 19:05:47 +01:00 |
msramalho
|
3e44b9b577
|
Bump version to v0.5.16 for release
|
2023-05-23 18:12:56 +01:00 |
msramalho
|
1a5797d0f8
|
feat: orchestrator fed returns archive result
|
2023-05-23 18:12:04 +01:00 |
msramalho
|
768b8fce9f
|
Bump version to v0.5.15 for release
|
2023-05-19 12:35:26 +01:00 |
msramalho
|
613b1f1e50
|
properly overwrite configs
|
2023-05-19 12:35:19 +01:00 |
msramalho
|
919c37bfb6
|
Bump version to v0.5.14 for release
|
2023-05-19 12:18:02 +01:00 |
msramalho
|
a655b3c987
|
gsheet accepts ID too
|
2023-05-19 12:17:34 +01:00 |
msramalho
|
3da9c9cf8f
|
Bump version to v0.5.13 for release
|
2023-05-19 11:49:38 +01:00 |
msramalho
|
68e9d2a2ce
|
allows yaml config to be overwritten
|
2023-05-19 11:49:02 +01:00 |
Logan Williams
|
c47da0a46f
|
Fix issue with profiles in browsertrix
|
2023-05-11 15:08:27 +02:00 |
msramalho
|
b7c69c0f0d
|
Bump version to v0.5.12 for release
|
2023-05-10 18:58:34 +01:00 |
msramalho
|
45b982ec38
|
fix: max chars on sheets cell
|
2023-05-10 18:57:33 +01:00 |
msramalho
|
e11be449e8
|
fix: delete completed whisper tasks
|
2023-05-10 18:57:17 +01:00 |
Logan Williams
|
7a34915f8e
|
Remove old auto auto archiver file
|
2023-05-10 11:16:54 +02:00 |
msramalho
|
9d078a648f
|
version bump
|
2023-05-10 09:57:47 +01:00 |
Logan Williams
|
ac82764ffc
|
Working, but some cleanup still necessary
|
2023-05-09 19:34:16 +02:00 |
Logan Williams
|
0fae7d96fb
|
Detect running in docker container in WACZ enricher
|
2023-05-09 19:34:16 +02:00 |
msramalho
|
9c25b33f1c
|
fix: multiple storages with folder column
|
2023-05-09 12:14:07 +01:00 |
msramalho
|
ae3e607705
|
fix: depreacating thumbnail_index
|
2023-05-09 11:29:05 +01:00 |
msramalho
|
c1a60fde8a
|
fix: deprecates duration column
|
2023-05-09 11:26:19 +01:00 |
msramalho
|
875e1de589
|
feat: re-enable HASH on gsheet
|
2023-05-09 11:17:44 +01:00 |
msramalho
|
8f3d4e05c3
|
fixing bug in whisper wnericher
|
2023-05-04 09:36:10 +01:00 |
msramalho
|
3bd6bed825
|
Bump version to v0.5.10 for release
|
2023-05-02 19:44:00 +01:00 |
msramalho
|
2659675f06
|
skip trim
|
2023-05-02 19:06:10 +01:00 |
msramalho
|
9d44f4b207
|
content append instead of replace
|
2023-05-02 19:06:00 +01:00 |
msramalho
|
5b0bff612e
|
whisper transcripts to content
|
2023-05-02 19:05:32 +01:00 |
msramalho
|
ae7ceba0e5
|
better debug
|
2023-05-02 19:05:18 +01:00 |
msramalho
|
97821a81bc
|
log cleanup
|
2023-05-02 19:05:06 +01:00 |
msramalho
|
9191b38cf2
|
tbot archiver works
|
2023-05-02 19:04:51 +01:00 |
msramalho
|
567edfc35e
|
Bump version to v0.5.8 for release
|
2023-05-02 14:30:49 +01:00 |
msramalho
|
8c22a9df72
|
fixes "url-not-found"
|
2023-05-02 14:30:07 +01:00 |
msramalho
|
d2d6db162b
|
Bump version to v0.5.7 for release
|
2023-04-18 19:28:51 +01:00 |
msramalho
|
5cfbcc0137
|
html template copy ux
|
2023-04-18 19:28:43 +01:00 |
msramalho
|
5fdaa6c739
|
whisper improvements
|
2023-04-18 19:28:36 +01:00 |
msramalho
|
3d389ee05b
|
add url info
|
2023-04-18 19:14:47 +01:00 |
msramalho
|
0ecbed0df0
|
Bump version to v0.5.6 for release
|
2023-04-18 18:49:08 +01:00 |
msramalho
|
69bcfea2eb
|
to_json fix
|
2023-04-18 18:48:51 +01:00 |
msramalho
|
2e2e695444
|
whisper enricher
|
2023-03-23 18:50:37 +00:00 |
msramalho
|
493055a8d9
|
cleanup
|
2023-03-23 18:50:30 +00:00 |
msramalho
|
6f6eb2db7a
|
Archiving Context refactor complete
|
2023-03-23 14:28:45 +00:00 |
msramalho
|
906ed0f6e0
|
creating global context and refactoring tmp_dir logic
|
2023-03-23 11:17:38 +00:00 |
msramalho
|
39818e648a
|
Bump version to v0.4.5 for release
|
2023-03-16 15:05:42 +00:00 |
R. Miles McCain
|
6be7536fad
|
Fix hash enricher for flatfile output (closes #71)
|
2023-03-14 13:37:54 -07:00 |
msramalho
|
0654e8c5c6
|
hash calculation in chunks to avoid exhausting RAM
|
2023-03-10 11:34:29 +00:00 |
msramalho
|
0e3c427371
|
Bump version to v0.4.3 for release
|
2023-02-27 10:30:06 +01:00 |
msramalho
|
7497bc08c0
|
Bump version to v0.4.2 for release
|
2023-02-23 17:14:29 +01:00 |
msramalho
|
49863768fe
|
vk updates
|
2023-02-22 18:35:15 +01:00 |
msramalho
|
cd81cae559
|
auth wall for WACZ
|
2023-02-20 16:08:45 +00:00 |
msramalho
|
23894fad51
|
normalize columns
|
2023-02-20 16:08:35 +00:00 |
msramalho
|
876988b587
|
detect invalid url messages instagram bot
|
2023-02-20 12:22:52 +00:00 |
msramalho
|
f95293b84b
|
support for multiple media instagram
|
2023-02-20 11:25:02 +00:00 |
msramalho
|
2fbcbe4e8b
|
double session issues
|
2023-02-20 11:11:39 +00:00 |
msramalho
|
1970fa3c82
|
new instagram archiver via telegram bot
|
2023-02-17 16:15:25 +00:00 |
msramalho
|
aa5430451e
|
instagram archiver via telegram bot
|
2023-02-17 15:46:29 +00:00 |
msramalho
|
5505255ea3
|
url auth wall detect
|
2023-02-17 15:45:58 +00:00 |
msramalho
|
da17b3f68a
|
name fix
|
2023-02-17 15:45:35 +00:00 |
msramalho
|
db45e0980e
|
Bump version to v0.3.0 for release
|
2023-02-08 22:13:46 +00:00 |
msramalho
|
2a7ece5dcc
|
cleanups and docs
|
2023-02-08 22:13:19 +00:00 |
msramalho
|
d14adf0242
|
Bump version to v0.2.24 for release
|
2023-02-08 11:22:53 +00:00 |
msramalho
|
75459d2880
|
docker
|
2023-02-08 11:22:38 +00:00 |
msramalho
|
94406bda7a
|
Bump version to v0.2.23 for release
|
2023-02-08 10:42:12 +00:00 |
msramalho
|
6244f35cff
|
Bump version to v0.2.22 for release
|
2023-02-08 09:50:36 +00:00 |
msramalho
|
adb3a7332f
|
version
|
2023-02-08 09:49:48 +00:00 |
msramalho
|
0d903fa196
|
Bump version to v0.2.21 for release
|
2023-02-08 09:42:26 +00:00 |
msramalho
|
57e7023f64
|
Bump version to v0.2.20 for release
|
2023-02-08 09:27:53 +00:00 |
msramalho
|
be9e4b2032
|
Bump version to v0.2.19 for release
|
2023-02-08 00:02:55 +00:00 |
msramalho
|
59603d1136
|
Bump version to v0.2.18 for release
|
2023-02-07 23:59:45 +00:00 |
msramalho
|
d31b3dda52
|
Bump version to v0.2.17 for release
|
2023-02-07 23:56:42 +00:00 |
msramalho
|
fa593ee9e2
|
Bump version to v0.2.16 for release
|
2023-02-07 23:49:12 +00:00 |
msramalho
|
9d2f14d3a1
|
Bump version to v0.2.15 for release
|
2023-02-07 23:44:04 +00:00 |
msramalho
|
f81ff14faa
|
license to publish
|
2023-02-07 23:43:50 +00:00 |
msramalho
|
3a70036e71
|
Bump version to v0.2.13 for release
|
2023-02-07 23:31:56 +00:00 |
msramalho
|
4060f3dfb2
|
Bump version to v0.2.12 for release
|
2023-02-07 23:27:44 +00:00 |
msramalho
|
8a419d34d5
|
Bump version to v0.2.11 for release
|
2023-02-07 23:24:51 +00:00 |
msramalho
|
8bbe7e2057
|
back to setup
|
2023-02-07 23:24:44 +00:00 |
msramalho
|
98f4702b9c
|
Bump version to v0.2.10 for release
|
2023-02-07 23:21:26 +00:00 |
msramalho
|
e19a4c85ed
|
pipfile test
|
2023-02-07 23:21:18 +00:00 |
msramalho
|
676bc905c6
|
Bump version to v0.2.9 for release
|
2023-02-07 23:15:56 +00:00 |
msramalho
|
1b51f49d8f
|
Bump version to v0.2.8 for release
|
2023-02-07 23:02:56 +00:00 |
msramalho
|
fb8bb684fe
|
Bump version to v0.2.7 for release
|
2023-02-07 23:00:12 +00:00 |
msramalho
|
67037ab291
|
Bump version to v0.2.6 for release
|
2023-02-07 22:56:56 +00:00 |
msramalho
|
d205846d1d
|
Bump version to v0.2.5 for release
|
2023-02-07 22:54:07 +00:00 |
msramalho
|
c198257e23
|
Bump version to v0.2.4 for release
|
2023-02-07 22:33:01 +00:00 |
msramalho
|
5c9ca9da1d
|
Bump version to v0.2.3 for release
|
2023-02-07 22:28:53 +00:00 |
msramalho
|
3acb1b5f64
|
Bump version to v0.2.2 for release
|
2023-02-07 22:20:53 +00:00 |
msramalho
|
217ec40921
|
Bump version to v0.2.1 for release
|
2023-02-07 22:15:43 +00:00 |
msramalho
|
9b4a41e654
|
Bump version to v0.2.0 for release
|
2023-02-07 22:07:23 +00:00 |
msramalho
|
29680b0be5
|
gsheet_db bug fix on missing thumbnail
|
2023-02-07 21:59:41 +00:00 |
msramalho
|
51a3134065
|
adds gd_drive storage
|
2023-02-07 21:59:24 +00:00 |
msramalho
|
32a8db1223
|
disable bot_token
|
2023-02-02 14:01:08 +00:00 |
msramalho
|
4854929a1d
|
thumbnail and bot token
|
2023-02-02 13:49:56 +00:00 |
msramalho
|
e758bd076b
|
test
|
2023-02-02 12:43:23 +00:00 |
msramalho
|
9bcca427a0
|
wacz in gsheets
|
2023-02-02 12:41:06 +00:00 |
msramalho
|
77a8c290f7
|
logs
|
2023-02-02 12:24:04 +00:00 |
msramalho
|
2f7b6dfc44
|
revert
|
2023-02-02 12:23:43 +00:00 |
msramalho
|
ab4bce6602
|
test
|
2023-02-02 12:20:30 +00:00 |
msramalho
|
8b8845d607
|
bot_token
|
2023-02-02 12:15:57 +00:00 |
msramalho
|
80b4f207d9
|
logs
|
2023-02-02 12:11:46 +00:00 |
msramalho
|
9159f0abd5
|
logs
|
2023-02-02 12:05:23 +00:00 |
msramalho
|
cf4be2f339
|
logs
|
2023-02-02 11:59:53 +00:00 |
msramalho
|
d8a79b930b
|
imrpove logs
|
2023-02-02 11:55:22 +00:00 |
msramalho
|
11eda6d03e
|
staticmethod fix
|
2023-02-02 11:26:00 +00:00 |
msramalho
|
5b0593ce82
|
arg parse fix
|
2023-02-02 11:00:24 +00:00 |
msramalho
|
39bfde2026
|
thumbnails bug fix
|
2023-02-01 00:35:48 +00:00 |
msramalho
|
d1e4dde3f6
|
fixing imports
|
2023-01-27 00:19:58 +00:00 |
msramalho
|
ac000d5943
|
cleanup
|
2023-01-27 00:03:30 +00:00 |
msramalho
|
f5b7c3a5ea
|
mute formatter and docker
|
2023-01-26 23:38:58 +00:00 |
msramalho
|
c261361ac8
|
try/catch enrichers
|
2023-01-26 23:03:51 +00:00 |
msramalho
|
2508bb8a1b
|
cleanup + rearchivable logic
|
2023-01-26 23:01:34 +00:00 |
msramalho
|
9dd8afed8c
|
minor improvements
|
2023-01-22 23:15:54 +00:00 |
msramalho
|
092ffdb6d8
|
replaywebpage
|
2023-01-22 00:48:09 +00:00 |
msramalho
|
746f6a333e
|
further cleanup
|
2023-01-21 19:57:54 +00:00 |
msramalho
|
b763fc4188
|
final naming cleanup + new feeders/dbs
|
2023-01-21 19:44:12 +00:00 |
msramalho
|
753039240f
|
pyproject
|
2023-01-21 19:01:02 +00:00 |
msramalho
|
ea2c266fa2
|
clean up and wacz WIP
|
2023-01-19 00:27:11 +00:00 |
msramalho
|
9bbc13e9be
|
vk and yt-dlp
|
2023-01-18 23:15:25 +00:00 |
msramalho
|
176ce7e8da
|
vk cleanup
|
2023-01-18 21:37:29 +00:00 |
msramalho
|
eb0859fbaf
|
vk archiver
|
2023-01-18 21:34:40 +00:00 |
msramalho
|
085376f63f
|
telegram archiver
|
2023-01-18 21:14:20 +00:00 |
msramalho
|
63d1abbe4b
|
tiktok archiver though info is no longer working
|
2023-01-18 16:56:35 +00:00 |
msramalho
|
1def8bb03d
|
instagram archiver
|
2023-01-18 16:16:23 +00:00 |
msramalho
|
725bab8240
|
twitter archivers
|
2023-01-18 00:15:18 +00:00 |
msramalho
|
f1bc83818d
|
template updates
|
2023-01-17 17:01:25 +00:00 |
msramalho
|
47dc788143
|
thumbnails enricher
|
2023-01-17 16:29:27 +00:00 |
msramalho
|
74e50eccf1
|
hash enricher and media refactor
|
2023-01-13 02:12:08 +00:00 |
msramalho
|
6ca46417fe
|
local storage + multiple storage support
|
2023-01-12 02:09:39 +00:00 |
msramalho
|
0cb593fd21
|
wayback enricher ready
|
2023-01-11 00:03:47 +00:00 |