Import your list of scraped URLs into the Malware checker and run it.
Sadly many bloggers and small business owners rarely check their sites for malware and not everyone knows how to setup Google Webmaster tools. If you have Google Webmaster Tools setup on your websites then Google will normally inform you that a site has been infected by malware.
The first free plugin you will need is the Malware and Phishing Filter once you have installed this plugin it allows you to search a list of sites from Scrapebox to find sites that have been compromised by some form of Malware.
txt files with lots of different search terms to put your harvesting on steroids. “submit * guest post” you will find lots of guest blogging opportunities for your niche quickly. Scrapebox allows you to harvest thousands of URL’s from Google and Bing in no time at all and by entering your own custom footprints e.g. Scrapebox currently costs $97 (there are a few coupons on the net for $57 if you search around) and for the amount of time and money this tool will save you it is more than worth the investment.
This link building technique utilises some of the free plugins that you can get from Scrapebox, the main tactic in this technique is to find a compromised or malware infected site and open a dialogue with the site owner in an attempt to receive a link either via a Guest Post or by suggesting the site owner replaces broken links with your own. Well “Soapbox White Hatters” I’m going to show you a way that you can actually use Scrapebox to make the internet a better place… in fact a safer place for all! So what is this Scrapebox Link building technique? I can already imagine several people ready to jump down to the comments and tell me that tools like this are ruining the internet…
A lot of people in the SEO community hate the thoughts of automated link building and the sheer mention of a tool such as Scrapebox makes their skin crawl. I know a lot of my regular readers will have a heart attack at the recommendation of using Scrapebox as a “White Hat” Link Building tool. They usually say things such as “Great Blog Post thanks for sharing” with a keyword rich anchor text link to a site selling fake Ugg boots. If you have ever spent any time reading blogs you will have seen the stereotypical comments on blogs. There are three major differences between FMiner and WebHarvy.Scrapebox is well known in the SEO community as a Grey Hat, Black Hat, Yellow Polka Dotted Hat link building tool that is mainly used by blog commenting spammers.
It visual scraping feature allows you to define extraction rules just like Octoparse and Parsehub. WebHarvy is a desktop application that can scrape website locally (it runs on your computer, not on a cloud server). Simply run the following : docker run -v ~/portia_projects:/app/data/projects:rw -p 9001:9001 scrapinghub/portia You can run it easily thanks to the docker image. Portia is a web application written in Python. This means it allows to create Scrapy spiders without a single line of code, with a visual tool. It's a visual abstraction layer on top of the great Scrapy framework. Portia is another great open source project from ScrapingHub.
What is unique about DataMiner is that it has a lot of features compared to other extensions. DataMiner is one of the most famous Chrome extensions for web scraping (186k installation and counting).