dexi.io

From the Dashboard or Projects page, click the Create New Robot button.

Select Crawler and enter the required information.

- From the Dashboard or Projects page, click the Create New Robot button.
- Select Crawler and enter the required information.

Under the Settings tab, make any desired configuration changes.

If you wish to use input data to provide the crawler with URLs to visit, activate the Dynamic URL? checkbox.

To follow the rules of any robots.txt files on the site (recommended), activate the Respect robots.txt checkbox.

- If you wish to use input data to provide the crawler with URLs to visit, activate the Dynamic URL? checkbox.
- To follow the rules of any robots.txt files on the site (recommended), activate the Respect robots.txt checkbox.

Under the Output tab, create any output fields your project requires. These fields will store the output data generated by the crawler.

Under the Page Processors tab, configure any page processors required by your project. See <a href="https://intercom.help/dexiio/what-should-i-know-about-page-processors">What should I know about page processors?</a> for details.

When all necessary page processors are configured, click the blue Save button in the top-right of the page to save the crawler.

On the Projects page, select the crawler and click the Create Run button near the top-right of the page.

Select the new run and click Open in the slide-in panel.

Under the Configuration tab, change settings as needed.

Under the Integrations tab, configure any required integrations.

Under the Executions tab, you may launch the execution when ready, or view existing execution information.

How can I crawl a Web site directory?

Find answers and get help from Intercom Support and Community Experts

Empty Help Center

Uh oh. That page doesn’t exist.

Disappointed

Neutral

Smiley

Title

Track the progress of all tickets related to your company.

Tickets portal.

{assigneeName} needs more information from you