Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Possible issues with the current implementation

  • It may be impossible to use the built-in PhantomJS functionality to render the page into an image as that generates an output file instead of writing to stdout.
  • It may be difficult to retrieve structured data from multiple pages; all of the data would have to be transferred through stdout, most often JSON encoded, which may be suboptimal. A possibility would be to write to an output file instead.
  • Currently, the scripts time out after 20 seconds. For a larger scale crawls it may be insufficient.

Example scripts

For extended function reference see API documentation and examples of PhantomJS.

...