The scrapebox page scanner allows you to build a list of footprints to scan for.
You take your footprints that you want to look for, and load them in. You can load in multiple footprints, for instance you could put in several footprints for wordpress. You then name your footprint.
So say you name it wordpress. Then you load in a list of urls to check. Then the page scanner will look thru the html source of each page for any of your footprints. If it finds one of the wordpress footprints it will show as "wordpress" in the addon.
Then you can export to excel and sort how you like.
This has many many applications. You could search for a particular image name or CSS or html component on a page. You can look for particular text, or anything you like. Spaces are supported as well as html code.
You could also use this to qualify a list of urls based on CMS or platform or you could use it to find contact forms or keywords on a page. |
No comments:
Post a Comment