Wykres commitów

50 Commity (252d6ee6fdf938d7c86a64845c61da627d466924)

Autor SHA1 Wiadomość Data
dgtlmoon 252d6ee6fd
Trigger text/wait (#187)
Re #71 - Ability to set filters
2021-08-16 13:13:17 +02:00
dgtlmoon 9e08f326be
Chrome/Webdriver support for Javascript websites (#114)
JS Support via fetching the page over WebDriver/Selenium network
Refactor forms (Split into logical tabs)
2021-08-12 12:05:59 +02:00
dgtlmoon e2304b2ce0
Re #154 Ldjson extract parse (#158)
* Use parsable JSON hiding in <script type="application/ld+json"> where possible, if it matches the filter rule, use it.
* Update README.md
2021-07-25 07:02:19 +02:00
Richard Schwab b008269a70
Partially revert 47e5a7cf09 (#138)
Copy HTTP headers from the global template instead of updating the global template when fetching a site.

fixes #137
2021-07-18 10:12:23 +10:00
dgtlmoon 83daa6f630 Re #132 - Make a list of the JSONpath results instead of using only the first value 2021-07-14 11:15:32 +10:00
dgtlmoon 655a350f50 Re #117 - dont re-encode single value types, looks better in the diff 2021-07-12 18:27:03 +10:00
dgtlmoon e073521f4d
Re #117 Jsonpath based JSON change detection filter (#125)
* Re #117 - Experimental JSON selector support by using 'json:' prefix and any JSONpath rule
2021-07-11 22:07:39 +10:00
dgtlmoon 25185e6d00
Auto extract html title as title (#102)
* Auto extract <title> as watch title, Minor refactor for html tooling
2021-06-24 19:10:19 +10:00
dgtlmoon f215adbbe5 CSS Filter - Smarter is to just extract the HTML blob and continue with inscriptus, so we have almost the same output as not using the filter 2021-06-23 20:40:01 +10:00
dgtlmoon 8d59ef2e10 CSS Filter - restore nicer linefeeds 2021-06-23 12:52:04 +10:00
dgtlmoon e3a9847f74 @todo Comment - BS4's element.get_text() seems to lose the indentation format no-matter what 2021-06-23 12:49:53 +10:00
dgtlmoon 47f7698b32 CSS Filter - strip text of whitespacing, preserve new lines where applicable, remove extra newlines 2021-06-23 12:29:14 +10:00
dgtlmoon 854520005d
#81 - Regex support (#90)
* Re #81 - Regex support
* minor cleanup
2021-06-21 17:17:22 +10:00
Leonardo Brondani Schenkel cec45a7ad7
Strip surrounding whitespace from elements (#89) 2021-06-16 13:57:22 +10:00
dgtlmoon 2346b42ef2
CSS selector filter (#73)
* Re #9 CSS Selector filtering,  Adding test for #9
2021-05-30 21:22:26 +10:00
Leigh Morresi e0578acca2 Tidy up thread logic and version check 2021-03-01 14:29:21 +01:00
Leigh Morresi 47fcb8b4f8 Move logic 2021-02-27 22:01:42 +01:00
Leigh Morresi f1da8f96b6 When new ignore text is specified, reprocess the checksum 2021-02-27 20:30:06 +01:00
Leigh Morresi 468184bc3a Issue #14 - Tweaks to edit, create ignore text, tests for ignore text, integrate ignore text 2021-02-26 20:07:26 +01:00
Leigh Morresi 96221598e7 Tidy up return logic 2021-02-21 20:23:50 +01:00
Leigh Morresi e200cd3289 Fixing a few more easy lint wins 2021-02-21 14:26:19 +01:00
Leigh Morresi 63eea2d6db Linting fixups 2021-02-21 14:21:14 +01:00
Leigh Morresi b0c5dbd88e Just use the current/previous md5 2021-02-21 13:46:16 +01:00
Leigh Morresi 1718e2e86f Finalse pytest methods 2021-02-21 13:41:00 +01:00
Leigh Morresi 87f4347fe5 hack of pytest implementation - doesnt work yet 2021-02-16 21:35:28 +01:00
Leigh Morresi 93ee65fe53 Tidy up a few broken datastore paths 2021-02-12 19:43:05 +01:00
Leigh Morresi 9f964b6d3f WIP, separate out the Flask from everything else, get pytest working 2021-02-12 19:24:30 +01:00
Leigh Morresi 47e5a7cf09 Avoid accidently using Python's objects that are copied - but land as a 'soft reference', need to use a better dict struct in the future #6 2021-02-05 18:43:35 +01:00
Leigh Morresi d07cf53a07 Minor fix to 'last changed' field, simplify template and logic 2021-02-04 13:15:39 +01:00
Leigh Morresi 5e31ae86d0 Use a thread locker and cleaner separation of concerns between main thread and site status fetch 2021-02-04 12:38:48 +01:00
Leigh Morresi 07f41782c0 Adding SEND_FILE_MAX_AGE_DEFAULT to ensure backups etc dont get old 2021-02-03 09:45:58 +01:00
Leigh Morresi f1c2ece32f Use a pool of thread workers, better for huge lists of watchers 2021-02-02 16:29:06 +01:00
Leigh Morresi eecc620386 https://github.com/psf/requests/issues/4525 - brotli compression is not yet supported in requests, be sure that users cant accidently use this content type encoding in the headers 2021-02-02 11:49:43 +01:00
Leigh Morresi 81534d9367 Add [diff] mechanism 2021-01-31 19:55:35 +01:00
Leigh Morresi 43c7ccb3fe Use a single thread for writing the sync json 2021-01-31 18:49:14 +01:00
Leigh Morresi bfcb17ca24 Remove import for old lib 2021-01-30 10:29:39 +01:00
Leigh Morresi 98f6f4619f Switch to inscriptis
prepare config backend struct
2021-01-30 10:14:19 +01:00
Leigh Morresi fbe20d45cc Support for custom headers per watch 2021-01-29 19:12:39 +01:00
Leigh Morresi 324c54fe46 Use requests's r.text so we dont have to deal with charsets 2021-01-29 13:05:31 +01:00
Leigh Morresi b7a0c2dbcd Add edit UI
Move to keyed structure instead of list
2021-01-29 10:49:05 +01:00
Leigh Morresi 9c0c8bf6aa Remove actual :// links, dont consider these as part of the changes, often they include variables/trackingscript ref etc 2021-01-28 14:45:01 +01:00
Leigh Morresi b574a28f1f Tweak comments 2021-01-28 11:12:42 +01:00
Leigh Morresi 01359e4811 Store a history of changes, used for future lookup/diff/explore changes UI 2021-01-28 10:39:14 +01:00
Leigh Morresi 93562afb02 Adding README amd docker info 2021-01-27 20:51:55 +01:00
Leigh Morresi f455f14efd Primitive support for extra headers 2021-01-27 20:09:40 +01:00
Leigh Morresi a4f1f6ab69 Handle titles and links 2021-01-27 19:54:17 +01:00
Leigh Morresi 1968d400fe Store the html2text version too 2021-01-27 19:04:39 +01:00
Leigh Morresi 0515aca7dd small fixes 2021-01-27 17:35:32 +01:00
Leigh Morresi 646a54945a Handle errors better, use the plaintext output 2021-01-27 15:56:59 +01:00
Leigh Morresi 2f018ac04c Workon threads 2021-01-27 15:12:31 +01:00