Wykres commitów

265 Commity (main)

Autor SHA1 Wiadomość Data
msramalho abaf86c776 Bump version to v0.5.26 for release 2023-07-02 18:42:59 +02:00
msramalho 8005a1955a fixes #82 twitter api walls 2023-07-02 18:42:43 +02:00
msramalho 04f827f183 Bump version to v0.5.25 for release 2023-06-26 18:15:45 +01:00
msramalho a2c6cdc111 Bump version to v0.5.24 for release 2023-06-26 17:58:47 +01:00
msramalho a0971fc601 final code review changes 2023-06-26 17:32:19 +01:00
msramalho 0cba2c25c6 get all media method 2023-06-26 17:28:19 +01:00
msramalho 7c0b05b276 new column 2023-06-26 17:27:57 +01:00
msramalho 3bbfdf6eba fix: excluding screenshots 2023-06-26 17:27:49 +01:00
msramalho a7a6bda1c2 improve missing col behaviour to error log 2023-06-26 17:27:37 +01:00
msramalho d80145002d formatter to accommodate properties of inner media 2023-06-26 17:06:50 +01:00
msramalho b4f86d0e8d refactor to hash all images and save hex string 2023-06-26 17:06:30 +01:00
msramalho 6cf3e109ed refactor discovery of inner media elements 2023-06-26 17:05:25 +01:00
Emiel de Heij 9fc09c724b add module for perceptual hashing with pdq 2023-06-26 15:25:55 +02:00
Logan Williams cc66ee3fd4 bump to patch 23 2023-06-06 12:24:43 -06:00
Logan Williams b3b727b005 Fix ValueError 2023-06-06 12:13:08 -06:00
msramalho ee37b20e6c fix: on missing col 2023-05-24 20:25:30 +01:00
msramalho a184bf7b97 Bump version to v0.5.20 for release 2023-05-24 20:24:35 +01:00
msramalho e535f44a88 optional folder 2023-05-24 20:24:15 +01:00
msramalho 0f28bf0e35 Bump version to v0.5.19 for release 2023-05-24 19:57:51 +01:00
msramalho 18a8636552 feat: new DB for auto-archiver-api 2023-05-24 19:24:53 +01:00
msramalho 81be65c828 Bump version to v0.5.18 for release 2023-05-24 11:19:02 +01:00
msramalho 0a91863212 typing fixes 2023-05-24 11:18:39 +01:00
msramalho 3ad8349e3f Bump version to v0.5.17 for release 2023-05-23 19:05:53 +01:00
msramalho 2768225cd1 fix: generator not called 2023-05-23 19:05:47 +01:00
msramalho 3e44b9b577 Bump version to v0.5.16 for release 2023-05-23 18:12:56 +01:00
msramalho 1a5797d0f8 feat: orchestrator fed returns archive result 2023-05-23 18:12:04 +01:00
msramalho 768b8fce9f Bump version to v0.5.15 for release 2023-05-19 12:35:26 +01:00
msramalho 613b1f1e50 properly overwrite configs 2023-05-19 12:35:19 +01:00
msramalho 919c37bfb6 Bump version to v0.5.14 for release 2023-05-19 12:18:02 +01:00
msramalho a655b3c987 gsheet accepts ID too 2023-05-19 12:17:34 +01:00
msramalho 3da9c9cf8f Bump version to v0.5.13 for release 2023-05-19 11:49:38 +01:00
msramalho 68e9d2a2ce allows yaml config to be overwritten 2023-05-19 11:49:02 +01:00
Logan Williams c47da0a46f Fix issue with profiles in browsertrix 2023-05-11 15:08:27 +02:00
msramalho b7c69c0f0d Bump version to v0.5.12 for release 2023-05-10 18:58:34 +01:00
msramalho 45b982ec38 fix: max chars on sheets cell 2023-05-10 18:57:33 +01:00
msramalho e11be449e8 fix: delete completed whisper tasks 2023-05-10 18:57:17 +01:00
Logan Williams 7a34915f8e Remove old auto auto archiver file 2023-05-10 11:16:54 +02:00
msramalho 9d078a648f version bump 2023-05-10 09:57:47 +01:00
Logan Williams ac82764ffc Working, but some cleanup still necessary 2023-05-09 19:34:16 +02:00
Logan Williams 0fae7d96fb Detect running in docker container in WACZ enricher 2023-05-09 19:34:16 +02:00
msramalho 9c25b33f1c fix: multiple storages with folder column 2023-05-09 12:14:07 +01:00
msramalho ae3e607705 fix: depreacating thumbnail_index 2023-05-09 11:29:05 +01:00
msramalho c1a60fde8a fix: deprecates duration column 2023-05-09 11:26:19 +01:00
msramalho 875e1de589 feat: re-enable HASH on gsheet 2023-05-09 11:17:44 +01:00
msramalho 8f3d4e05c3 fixing bug in whisper wnericher 2023-05-04 09:36:10 +01:00
msramalho 3bd6bed825 Bump version to v0.5.10 for release 2023-05-02 19:44:00 +01:00
msramalho 2659675f06 skip trim 2023-05-02 19:06:10 +01:00
msramalho 9d44f4b207 content append instead of replace 2023-05-02 19:06:00 +01:00
msramalho 5b0bff612e whisper transcripts to content 2023-05-02 19:05:32 +01:00
msramalho ae7ceba0e5 better debug 2023-05-02 19:05:18 +01:00
msramalho 97821a81bc log cleanup 2023-05-02 19:05:06 +01:00
msramalho 9191b38cf2 tbot archiver works 2023-05-02 19:04:51 +01:00
msramalho 567edfc35e Bump version to v0.5.8 for release 2023-05-02 14:30:49 +01:00
msramalho 8c22a9df72 fixes "url-not-found" 2023-05-02 14:30:07 +01:00
msramalho d2d6db162b Bump version to v0.5.7 for release 2023-04-18 19:28:51 +01:00
msramalho 5cfbcc0137 html template copy ux 2023-04-18 19:28:43 +01:00
msramalho 5fdaa6c739 whisper improvements 2023-04-18 19:28:36 +01:00
msramalho 3d389ee05b add url info 2023-04-18 19:14:47 +01:00
msramalho 0ecbed0df0 Bump version to v0.5.6 for release 2023-04-18 18:49:08 +01:00
msramalho 69bcfea2eb to_json fix 2023-04-18 18:48:51 +01:00
msramalho 2e2e695444 whisper enricher 2023-03-23 18:50:37 +00:00
msramalho 493055a8d9 cleanup 2023-03-23 18:50:30 +00:00
msramalho 6f6eb2db7a Archiving Context refactor complete 2023-03-23 14:28:45 +00:00
msramalho 906ed0f6e0 creating global context and refactoring tmp_dir logic 2023-03-23 11:17:38 +00:00
msramalho 39818e648a Bump version to v0.4.5 for release 2023-03-16 15:05:42 +00:00
R. Miles McCain 6be7536fad
Fix hash enricher for flatfile output (closes #71) 2023-03-14 13:37:54 -07:00
msramalho 0654e8c5c6 hash calculation in chunks to avoid exhausting RAM 2023-03-10 11:34:29 +00:00
msramalho 0e3c427371 Bump version to v0.4.3 for release 2023-02-27 10:30:06 +01:00
msramalho 7497bc08c0 Bump version to v0.4.2 for release 2023-02-23 17:14:29 +01:00
msramalho 49863768fe vk updates 2023-02-22 18:35:15 +01:00
msramalho cd81cae559 auth wall for WACZ 2023-02-20 16:08:45 +00:00
msramalho 23894fad51 normalize columns 2023-02-20 16:08:35 +00:00
msramalho 876988b587 detect invalid url messages instagram bot 2023-02-20 12:22:52 +00:00
msramalho f95293b84b support for multiple media instagram 2023-02-20 11:25:02 +00:00
msramalho 2fbcbe4e8b double session issues 2023-02-20 11:11:39 +00:00
msramalho 1970fa3c82 new instagram archiver via telegram bot 2023-02-17 16:15:25 +00:00
msramalho aa5430451e instagram archiver via telegram bot 2023-02-17 15:46:29 +00:00
msramalho 5505255ea3 url auth wall detect 2023-02-17 15:45:58 +00:00
msramalho da17b3f68a name fix 2023-02-17 15:45:35 +00:00
msramalho db45e0980e Bump version to v0.3.0 for release 2023-02-08 22:13:46 +00:00
msramalho 2a7ece5dcc cleanups and docs 2023-02-08 22:13:19 +00:00
msramalho d14adf0242 Bump version to v0.2.24 for release 2023-02-08 11:22:53 +00:00
msramalho 75459d2880 docker 2023-02-08 11:22:38 +00:00
msramalho 94406bda7a Bump version to v0.2.23 for release 2023-02-08 10:42:12 +00:00
msramalho 6244f35cff Bump version to v0.2.22 for release 2023-02-08 09:50:36 +00:00
msramalho adb3a7332f version 2023-02-08 09:49:48 +00:00
msramalho 0d903fa196 Bump version to v0.2.21 for release 2023-02-08 09:42:26 +00:00
msramalho 57e7023f64 Bump version to v0.2.20 for release 2023-02-08 09:27:53 +00:00
msramalho be9e4b2032 Bump version to v0.2.19 for release 2023-02-08 00:02:55 +00:00
msramalho 59603d1136 Bump version to v0.2.18 for release 2023-02-07 23:59:45 +00:00
msramalho d31b3dda52 Bump version to v0.2.17 for release 2023-02-07 23:56:42 +00:00
msramalho fa593ee9e2 Bump version to v0.2.16 for release 2023-02-07 23:49:12 +00:00
msramalho 9d2f14d3a1 Bump version to v0.2.15 for release 2023-02-07 23:44:04 +00:00
msramalho f81ff14faa license to publish 2023-02-07 23:43:50 +00:00
msramalho 3a70036e71 Bump version to v0.2.13 for release 2023-02-07 23:31:56 +00:00
msramalho 4060f3dfb2 Bump version to v0.2.12 for release 2023-02-07 23:27:44 +00:00
msramalho 8a419d34d5 Bump version to v0.2.11 for release 2023-02-07 23:24:51 +00:00
msramalho 8bbe7e2057 back to setup 2023-02-07 23:24:44 +00:00
msramalho 98f4702b9c Bump version to v0.2.10 for release 2023-02-07 23:21:26 +00:00
msramalho e19a4c85ed pipfile test 2023-02-07 23:21:18 +00:00
msramalho 676bc905c6 Bump version to v0.2.9 for release 2023-02-07 23:15:56 +00:00
msramalho 1b51f49d8f Bump version to v0.2.8 for release 2023-02-07 23:02:56 +00:00
msramalho fb8bb684fe Bump version to v0.2.7 for release 2023-02-07 23:00:12 +00:00
msramalho 67037ab291 Bump version to v0.2.6 for release 2023-02-07 22:56:56 +00:00
msramalho d205846d1d Bump version to v0.2.5 for release 2023-02-07 22:54:07 +00:00
msramalho c198257e23 Bump version to v0.2.4 for release 2023-02-07 22:33:01 +00:00
msramalho 5c9ca9da1d Bump version to v0.2.3 for release 2023-02-07 22:28:53 +00:00
msramalho 3acb1b5f64 Bump version to v0.2.2 for release 2023-02-07 22:20:53 +00:00
msramalho 217ec40921 Bump version to v0.2.1 for release 2023-02-07 22:15:43 +00:00
msramalho 9b4a41e654 Bump version to v0.2.0 for release 2023-02-07 22:07:23 +00:00
msramalho 29680b0be5 gsheet_db bug fix on missing thumbnail 2023-02-07 21:59:41 +00:00
msramalho 51a3134065 adds gd_drive storage 2023-02-07 21:59:24 +00:00
msramalho 32a8db1223 disable bot_token 2023-02-02 14:01:08 +00:00
msramalho 4854929a1d thumbnail and bot token 2023-02-02 13:49:56 +00:00
msramalho e758bd076b test 2023-02-02 12:43:23 +00:00
msramalho 9bcca427a0 wacz in gsheets 2023-02-02 12:41:06 +00:00
msramalho 77a8c290f7 logs 2023-02-02 12:24:04 +00:00
msramalho 2f7b6dfc44 revert 2023-02-02 12:23:43 +00:00
msramalho ab4bce6602 test 2023-02-02 12:20:30 +00:00
msramalho 8b8845d607 bot_token 2023-02-02 12:15:57 +00:00
msramalho 80b4f207d9 logs 2023-02-02 12:11:46 +00:00
msramalho 9159f0abd5 logs 2023-02-02 12:05:23 +00:00
msramalho cf4be2f339 logs 2023-02-02 11:59:53 +00:00
msramalho d8a79b930b imrpove logs 2023-02-02 11:55:22 +00:00
msramalho 11eda6d03e staticmethod fix 2023-02-02 11:26:00 +00:00
msramalho 5b0593ce82 arg parse fix 2023-02-02 11:00:24 +00:00
msramalho 39bfde2026 thumbnails bug fix 2023-02-01 00:35:48 +00:00
msramalho d1e4dde3f6 fixing imports 2023-01-27 00:19:58 +00:00
msramalho ac000d5943 cleanup 2023-01-27 00:03:30 +00:00
msramalho f5b7c3a5ea mute formatter and docker 2023-01-26 23:38:58 +00:00
msramalho c261361ac8 try/catch enrichers 2023-01-26 23:03:51 +00:00
msramalho 2508bb8a1b cleanup + rearchivable logic 2023-01-26 23:01:34 +00:00
msramalho 9dd8afed8c minor improvements 2023-01-22 23:15:54 +00:00
msramalho 092ffdb6d8 replaywebpage 2023-01-22 00:48:09 +00:00
msramalho 746f6a333e further cleanup 2023-01-21 19:57:54 +00:00
msramalho b763fc4188 final naming cleanup + new feeders/dbs 2023-01-21 19:44:12 +00:00
msramalho 753039240f pyproject 2023-01-21 19:01:02 +00:00
msramalho ea2c266fa2 clean up and wacz WIP 2023-01-19 00:27:11 +00:00
msramalho 9bbc13e9be vk and yt-dlp 2023-01-18 23:15:25 +00:00
msramalho 176ce7e8da vk cleanup 2023-01-18 21:37:29 +00:00
msramalho eb0859fbaf vk archiver 2023-01-18 21:34:40 +00:00
msramalho 085376f63f telegram archiver 2023-01-18 21:14:20 +00:00
msramalho 63d1abbe4b tiktok archiver though info is no longer working 2023-01-18 16:56:35 +00:00
msramalho 1def8bb03d instagram archiver 2023-01-18 16:16:23 +00:00
msramalho 725bab8240 twitter archivers 2023-01-18 00:15:18 +00:00
msramalho f1bc83818d template updates 2023-01-17 17:01:25 +00:00
msramalho 47dc788143 thumbnails enricher 2023-01-17 16:29:27 +00:00
msramalho 74e50eccf1 hash enricher and media refactor 2023-01-13 02:12:08 +00:00
msramalho 6ca46417fe local storage + multiple storage support 2023-01-12 02:09:39 +00:00
msramalho 0cb593fd21 wayback enricher ready 2023-01-11 00:03:47 +00:00