dgtlmoon
|
252d6ee6fd
|
Trigger text/wait (#187)
Re #71 - Ability to set filters
|
2021-08-16 13:13:17 +02:00 |
dgtlmoon
|
9e08f326be
|
Chrome/Webdriver support for Javascript websites (#114)
JS Support via fetching the page over WebDriver/Selenium network
Refactor forms (Split into logical tabs)
|
2021-08-12 12:05:59 +02:00 |
dgtlmoon
|
e2304b2ce0
|
Re #154 Ldjson extract parse (#158)
* Use parsable JSON hiding in <script type="application/ld+json"> where possible, if it matches the filter rule, use it.
* Update README.md
|
2021-07-25 07:02:19 +02:00 |
Richard Schwab
|
b008269a70
|
Partially revert 47e5a7cf09 (#138)
Copy HTTP headers from the global template instead of updating the global template when fetching a site.
fixes #137
|
2021-07-18 10:12:23 +10:00 |
dgtlmoon
|
83daa6f630
|
Re #132 - Make a list of the JSONpath results instead of using only the first value
|
2021-07-14 11:15:32 +10:00 |
dgtlmoon
|
655a350f50
|
Re #117 - dont re-encode single value types, looks better in the diff
|
2021-07-12 18:27:03 +10:00 |
dgtlmoon
|
e073521f4d
|
Re #117 Jsonpath based JSON change detection filter (#125)
* Re #117 - Experimental JSON selector support by using 'json:' prefix and any JSONpath rule
|
2021-07-11 22:07:39 +10:00 |
dgtlmoon
|
25185e6d00
|
Auto extract html title as title (#102)
* Auto extract <title> as watch title, Minor refactor for html tooling
|
2021-06-24 19:10:19 +10:00 |
dgtlmoon
|
f215adbbe5
|
CSS Filter - Smarter is to just extract the HTML blob and continue with inscriptus, so we have almost the same output as not using the filter
|
2021-06-23 20:40:01 +10:00 |
dgtlmoon
|
8d59ef2e10
|
CSS Filter - restore nicer linefeeds
|
2021-06-23 12:52:04 +10:00 |
dgtlmoon
|
e3a9847f74
|
@todo Comment - BS4's element.get_text() seems to lose the indentation format no-matter what
|
2021-06-23 12:49:53 +10:00 |
dgtlmoon
|
47f7698b32
|
CSS Filter - strip text of whitespacing, preserve new lines where applicable, remove extra newlines
|
2021-06-23 12:29:14 +10:00 |
dgtlmoon
|
854520005d
|
#81 - Regex support (#90)
* Re #81 - Regex support
* minor cleanup
|
2021-06-21 17:17:22 +10:00 |
Leonardo Brondani Schenkel
|
cec45a7ad7
|
Strip surrounding whitespace from elements (#89)
|
2021-06-16 13:57:22 +10:00 |
dgtlmoon
|
2346b42ef2
|
CSS selector filter (#73)
* Re #9 CSS Selector filtering, Adding test for #9
|
2021-05-30 21:22:26 +10:00 |
Leigh Morresi
|
e0578acca2
|
Tidy up thread logic and version check
|
2021-03-01 14:29:21 +01:00 |
Leigh Morresi
|
47fcb8b4f8
|
Move logic
|
2021-02-27 22:01:42 +01:00 |
Leigh Morresi
|
f1da8f96b6
|
When new ignore text is specified, reprocess the checksum
|
2021-02-27 20:30:06 +01:00 |
Leigh Morresi
|
468184bc3a
|
Issue #14 - Tweaks to edit, create ignore text, tests for ignore text, integrate ignore text
|
2021-02-26 20:07:26 +01:00 |
Leigh Morresi
|
96221598e7
|
Tidy up return logic
|
2021-02-21 20:23:50 +01:00 |
Leigh Morresi
|
e200cd3289
|
Fixing a few more easy lint wins
|
2021-02-21 14:26:19 +01:00 |
Leigh Morresi
|
63eea2d6db
|
Linting fixups
|
2021-02-21 14:21:14 +01:00 |
Leigh Morresi
|
b0c5dbd88e
|
Just use the current/previous md5
|
2021-02-21 13:46:16 +01:00 |
Leigh Morresi
|
1718e2e86f
|
Finalse pytest methods
|
2021-02-21 13:41:00 +01:00 |
Leigh Morresi
|
87f4347fe5
|
hack of pytest implementation - doesnt work yet
|
2021-02-16 21:35:28 +01:00 |
Leigh Morresi
|
93ee65fe53
|
Tidy up a few broken datastore paths
|
2021-02-12 19:43:05 +01:00 |
Leigh Morresi
|
9f964b6d3f
|
WIP, separate out the Flask from everything else, get pytest working
|
2021-02-12 19:24:30 +01:00 |
Leigh Morresi
|
47e5a7cf09
|
Avoid accidently using Python's objects that are copied - but land as a 'soft reference', need to use a better dict struct in the future #6
|
2021-02-05 18:43:35 +01:00 |
Leigh Morresi
|
d07cf53a07
|
Minor fix to 'last changed' field, simplify template and logic
|
2021-02-04 13:15:39 +01:00 |
Leigh Morresi
|
5e31ae86d0
|
Use a thread locker and cleaner separation of concerns between main thread and site status fetch
|
2021-02-04 12:38:48 +01:00 |
Leigh Morresi
|
07f41782c0
|
Adding SEND_FILE_MAX_AGE_DEFAULT to ensure backups etc dont get old
|
2021-02-03 09:45:58 +01:00 |
Leigh Morresi
|
f1c2ece32f
|
Use a pool of thread workers, better for huge lists of watchers
|
2021-02-02 16:29:06 +01:00 |
Leigh Morresi
|
eecc620386
|
https://github.com/psf/requests/issues/4525 - brotli compression is not yet supported in requests, be sure that users cant accidently use this content type encoding in the headers
|
2021-02-02 11:49:43 +01:00 |
Leigh Morresi
|
81534d9367
|
Add [diff] mechanism
|
2021-01-31 19:55:35 +01:00 |
Leigh Morresi
|
43c7ccb3fe
|
Use a single thread for writing the sync json
|
2021-01-31 18:49:14 +01:00 |
Leigh Morresi
|
bfcb17ca24
|
Remove import for old lib
|
2021-01-30 10:29:39 +01:00 |
Leigh Morresi
|
98f6f4619f
|
Switch to inscriptis
prepare config backend struct
|
2021-01-30 10:14:19 +01:00 |
Leigh Morresi
|
fbe20d45cc
|
Support for custom headers per watch
|
2021-01-29 19:12:39 +01:00 |
Leigh Morresi
|
324c54fe46
|
Use requests's r.text so we dont have to deal with charsets
|
2021-01-29 13:05:31 +01:00 |
Leigh Morresi
|
b7a0c2dbcd
|
Add edit UI
Move to keyed structure instead of list
|
2021-01-29 10:49:05 +01:00 |
Leigh Morresi
|
9c0c8bf6aa
|
Remove actual :// links, dont consider these as part of the changes, often they include variables/trackingscript ref etc
|
2021-01-28 14:45:01 +01:00 |
Leigh Morresi
|
b574a28f1f
|
Tweak comments
|
2021-01-28 11:12:42 +01:00 |
Leigh Morresi
|
01359e4811
|
Store a history of changes, used for future lookup/diff/explore changes UI
|
2021-01-28 10:39:14 +01:00 |
Leigh Morresi
|
93562afb02
|
Adding README amd docker info
|
2021-01-27 20:51:55 +01:00 |
Leigh Morresi
|
f455f14efd
|
Primitive support for extra headers
|
2021-01-27 20:09:40 +01:00 |
Leigh Morresi
|
a4f1f6ab69
|
Handle titles and links
|
2021-01-27 19:54:17 +01:00 |
Leigh Morresi
|
1968d400fe
|
Store the html2text version too
|
2021-01-27 19:04:39 +01:00 |
Leigh Morresi
|
0515aca7dd
|
small fixes
|
2021-01-27 17:35:32 +01:00 |
Leigh Morresi
|
646a54945a
|
Handle errors better, use the plaintext output
|
2021-01-27 15:56:59 +01:00 |
Leigh Morresi
|
2f018ac04c
|
Workon threads
|
2021-01-27 15:12:31 +01:00 |