Frédéric Guillot
|
3b6e44c331
Allow the scraper to parse XHTML documents
|
7 years ago |
Frédéric Guillot
|
5870f04260
Simplify feed parser and format detection
|
7 years ago |
Patrick
|
2538eea177
Add the possibility to override default user agent for each feed
|
7 years ago |
Frédéric Guillot
|
dbcc5d8a97
Use canonical imports
|
7 years ago |
Frédéric Guillot
|
1eba1730d1
Move HTTP client to its own package
|
8 years ago |
aniran
|
322b265d7a
Scrape parent element for iframe
|
8 years ago |
Frédéric Guillot
|
3c3f397bf5
Make sure the scraper parse only HTML documents
|
8 years ago |
Frédéric Guillot
|
1d8193b892
Add logger
|
8 years ago |
Frédéric Guillot
|
c6d9eb3614
Improve content scraper
|
8 years ago |
Frédéric Guillot
|
84d912c979
Rewrite imports
|
8 years ago |
Frédéric Guillot
|
ef097f02fe
Add the possibility to enable crawler for feeds
|
8 years ago |
Frédéric Guillot
|
87ccad5c7f
Add scraper rules
|
8 years ago |
Frédéric Guillot
|
7a35c58f53
Add readability package to fetch original content
|
8 years ago |