Frédéric Guillot
|
66b269e6cd
feat(readability): avoid removing elements with `content` class
|
8 miesięcy temu |
jvoisin
|
a62b97bddd
refactor(readability): get rid of getClassWeight
|
8 miesięcy temu |
jvoisin
|
1de9cf4241
perf(readability): simplify removeUnlikelyCandidates
|
8 miesięcy temu |
jvoisin
|
7912b9b8fb
perf(readability): avoid materializing text to count commas
|
8 miesięcy temu |
jvoisin
|
2d24f5d04e
refactor(readability): minor code folding
|
8 miesięcy temu |
Frédéric Guillot
|
f02213a168
refactor(readability): use `String` explicitly in debug log instead of `Any`
|
8 miesięcy temu |
Julien Voisin
|
86e2ce6d0b
perf(readability): move `transformMisusedDivsIntoParagraphs` call after `removeUnlikelyCandidates`
|
8 miesięcy temu |
jvoisin
|
69a74c4abf
refactor(readability): minor clean up
|
9 miesięcy temu |
jvoisin
|
766d4ab834
refactor(readability): make use of getSelectionLength
|
9 miesięcy temu |
Frédéric Guillot
|
8c3f280f32
test(readability): add test case for `ExtractContent` with broken reader
|
9 miesięcy temu |
jvoisin
|
8a98926674
refactor(readability): add a getSelectionLength function
|
9 miesięcy temu |
jvoisin
|
89c32d518d
perf(readability): significantly improve transformMisusedDivsIntoParagraphs
|
9 miesięcy temu |
jvoisin
|
2f7b2e7375
perf(readability): improve getLinkDensity
|
9 miesięcy temu |
Frédéric Guillot
|
6eeccae7cd
test(readability): increase test coverage
|
9 miesięcy temu |
jvoisin
|
aed99e65c1
perf(readability): improve getClassWeight speed
|
9 miesięcy temu |
jvoisin
|
4e1f836266
refactor(readability): simplify a bit getArticle
|
9 miesięcy temu |
jvoisin
|
c064891314
perf(readability): Simplify removeUnlikelyCandidates
|
9 miesięcy temu |
Frédéric Guillot
|
6d58052504
fix(readability): do not remove elements within code blocks
|
9 miesięcy temu |
jvoisin
|
8a014c6abc
perf(readability): minor regex improvement
|
9 miesięcy temu |
jvoisin
|
2df59b4865
Refactor internal/reader/readability/testdata
|
1 rok temu |
Julien Voisin
|
6ad5ad0bb2
refactor(readability): various improvements and optimizations
|
1 rok temu |
Julien Voisin
|
e6185b1393
refactor: use min/max instead of math.Min/math.Max
|
1 rok temu |
Julien Voisin
|
1b0b8b9c42
refactor: use a better construct than `doc.Find(…).First()`
|
1 rok temu |
Julien Voisin
|
2671f57edd
refactor(readability): simplify the regexes in `internal/reader/readability/readability.go`
|
1 rok temu |
Frédéric Guillot
|
29387f2d60
feat: implement base element handling in content scraper
|
1 rok temu |
Frédéric Guillot
|
b1e73fafdf
Enable go-critic linter and fix various issues detected
|
2 lat temu |
jvoisin
|
347740dce1
Speed up removeUnlikelyCandidates
|
2 lat temu |
Frédéric Guillot
|
97765b93a9
Revert "Minor internal/reader/readability/readability.go speedup"
|
2 lat temu |
jvoisin
|
4db138d4b8
Minor internal/reader/readability/readability.go speedup
|
2 lat temu |
jvoisin
|
61af08a721
Use .WriteString( instead of .Write([]byte(…
|
2 lat temu |