What should I know being new to Web data extraction?
Web data extraction is simply the act of copying information displayed on Web pages. Though modern Web pages can be complicated beneath the hood, dexi.io helps make extraction easier by providing a visual tool to assist in the design and configuration of robots. In addition to the extractor, we also offer the crawler robot for special-purpose occasions when simplicity and performance are at the top of the requirements list; and Pipes, a tool to control robots, refine the results of executing those robots, and connect that data to end-points for transportation.
If you don't have these skills under your belt, but are willing to learn, see the list of resources below for learning opportunities. If, on the other hand, you don't have the time or inclination to learn these things, consider asking us to build a custom robot for you based on your specifications.
Here are a few specific resources that may come in handy:
- W3Schools HTML Tag Reference: http://www.w3schools.com/tags/default.asp
- W3Schools HTML Attribute Reference: http://www.w3schools.com/tags/ref_standardattributes.asp
- W3Schools CSS Selectors Reference: http://www.w3schools.com/cssref/css_selectors.asp
- jQuery Selectors Reference: https://api.jquery.com/category/selectors/