What should I know being new to Web data extraction?

Web data extraction is simply the act of copying information displayed on Web pages. Though modern Web pages can be complicated beneath the hood, dexi.io helps make extraction easier by providing a visual tool to assist in the design and configuration of robots. In addition to the extractor, we also offer the crawler robot for special-purpose occasions when simplicity and performance are at the top of the requirements list; and Pipes, a tool to control robots, refine the results of executing those robots, and connect that data to end-points for transportation.

Even with the extractor editor's assistance, it can sometimes be challenging to build a working robot. If everything doesn't come together perfectly the first time around, it might be necessary to make adjustments to the editor's recommended configuration settings. This requires knowledge and understanding of HTML, CSS selectors, and regular expressions, as well as JavaScript and jQuery selectors on occasion.

If you don't have these skills under your belt, but are willing to learn, see the list of resources below for learning opportunities. If, on the other hand, you don't have the time or inclination to learn these things, consider asking us to build a custom robot for you based on your specifications.

Knowledge Resources

You can learn virtually everything you need to know to build crawling and scraping robots at W3Schools, including HTML, CSS, JavaScript, and jQuery. They can even help you understand our JSON output format.

Here are a few specific resources that may come in handy:

