...
Possible issues with the current implementation
- It may be impossible to use the built-in PhantomJS functionality to render the page into an image as that generates an output file instead of writing to stdout.
- It may be difficult to retrieve structured data from multiple pages; all of the data would have to be transferred through stdout, most often JSON encoded, which may be suboptimal. A possibility would be to write to an output file instead.
- Currently, the scripts time out after 20 seconds. For a larger scale crawls it may be insufficient.
Example scripts
For extended function reference see API documentation and examples of PhantomJS.
...