I want to find users of Heritrix3 engine
from aary@xn--e1aghfa.xn--e1aqbccjfc.xn--p1acf to technology@lemmy.ml on 23 Jun 20:40
https://xn--e1aghfa.xn--e1aqbccjfc.xn--p1acf/post/422
from aary@xn--e1aghfa.xn--e1aqbccjfc.xn--p1acf to technology@lemmy.ml on 23 Jun 20:40
https://xn--e1aghfa.xn--e1aqbccjfc.xn--p1acf/post/422
I have such setup, but with some problems:
- it always does full crawl (doesn’t do deduplication)
- I am unable to control crawl order (don’t have enough knowledge)
threaded - newest