Commit History

Author SHA1 Message Date
  Frédéric Guillot 3b6e44c331 Allow the scraper to parse XHTML documents 7 years ago
  Frédéric Guillot 5870f04260 Simplify feed parser and format detection 7 years ago
  Patrick 2538eea177 Add the possibility to override default user agent for each feed 7 years ago
  Frédéric Guillot dbcc5d8a97 Use canonical imports 7 years ago
  Frédéric Guillot 1eba1730d1 Move HTTP client to its own package 8 years ago
  aniran 322b265d7a Scrape parent element for iframe 8 years ago
  Frédéric Guillot 3c3f397bf5 Make sure the scraper parse only HTML documents 8 years ago
  Frédéric Guillot 1d8193b892 Add logger 8 years ago
  Frédéric Guillot c6d9eb3614 Improve content scraper 8 years ago
  Frédéric Guillot 84d912c979 Rewrite imports 8 years ago
  Frédéric Guillot ef097f02fe Add the possibility to enable crawler for feeds 8 years ago
  Frédéric Guillot 87ccad5c7f Add scraper rules 8 years ago
  Frédéric Guillot 7a35c58f53 Add readability package to fetch original content 8 years ago