hulb
|
01f678c3b1
add proxy arg in scraper.Fetch
|
4 years ago |
Darius
|
9242350f0e
Add per feed cookies option
|
5 years ago |
Frédéric Guillot
|
ec3c604a83
Add option to allow self-signed or invalid certificates
|
5 years ago |
Frédéric Guillot
|
c394a61a4e
Add Prometheus exporter
|
5 years ago |
Frédéric Guillot
|
16b7b3bc3e
http client: remove dependency on global config options
|
5 years ago |
cinput
|
8e1ed8bef3
Return outer HTML when scraping elements
|
6 years ago |
Frédéric Guillot
|
311a133ab8
Refactor manual entry scraper
|
7 years ago |
Frédéric Guillot
|
3b6e44c331
Allow the scraper to parse XHTML documents
|
7 years ago |
Frédéric Guillot
|
5870f04260
Simplify feed parser and format detection
|
7 years ago |
Patrick
|
2538eea177
Add the possibility to override default user agent for each feed
|
7 years ago |
Frédéric Guillot
|
dbcc5d8a97
Use canonical imports
|
7 years ago |
Frédéric Guillot
|
1eba1730d1
Move HTTP client to its own package
|
8 years ago |
aniran
|
322b265d7a
Scrape parent element for iframe
|
8 years ago |
Frédéric Guillot
|
3c3f397bf5
Make sure the scraper parse only HTML documents
|
8 years ago |
Frédéric Guillot
|
1d8193b892
Add logger
|
8 years ago |
Frédéric Guillot
|
c6d9eb3614
Improve content scraper
|
8 years ago |
Frédéric Guillot
|
84d912c979
Rewrite imports
|
8 years ago |
Frédéric Guillot
|
ef097f02fe
Add the possibility to enable crawler for feeds
|
8 years ago |
Frédéric Guillot
|
87ccad5c7f
Add scraper rules
|
8 years ago |
Frédéric Guillot
|
7a35c58f53
Add readability package to fetch original content
|
8 years ago |