Help Center

Improved

Fixed

arvow

In Review

Planned

In Progress

Completed

Rejected

Open

Closed

High Priority

Low Priority

Backlog

Next up

Done

Main Roadmap

Hey {name|there}! 👋

Mostly:<ul><li>Are you honoroing robots.txt when crawling a given website.</li><li>If you do - what is the crawler name.</li></ul>Use case: I work on a number of websites adding articles (which I want journalist.cafe to use internally) but - the robots.txt is not open at the moment as I want google etc. out until I have a longer content list.My robots.txt has exceptions for the tools I use - i would like to tell journalist.cafe it can crawl my site, if it gets currently blocked.

Please document crawler use

Thomas Tomiczek

Arvow

Please document crawler use

Subscribe to post

Subscribe to post