How to salvage links from WMT “Crawl Errors” list?
When someone links to your website, but makes a typo while doing it, those broken inbound links will show up in Google Webmaster Tools in the Crawl Errors section as “Not Found”. Often they are easy to salvage by just adding a 301 redirect in the htaccess file.
But sometimes the typo is really weird, or the link source looks a little scary, and that`s what I need your help with.
First let`s look at the weird typo problem. If it is something easy, like they just lost the last part of the URL, ( such as www.mydomain.com/pagenam ) then I fix it in htaccess this way:
RewriteCond %{HTTP_HOST} ^mydomain.com$ OR
RewriteCond %{HTTP_HOST} ^www.mydomain.com$
RewriteRule ^pagenam$ "http://www.mydomain.com/pagename.html" R=301, L
But what about when the last part of the URL is really screwed up? Especially with non-text characters, like these:
www.mydomain.com/pagename.htmlsale
www.mydomain.com/pagename.htmlhttp://
www.mydomain.com/pagename.html%22
www.mydomain.com/pagename.html/
How is the htaccess Rewrite Rule typed to deal with these oddballs?
Second, is there a quick and easy method or tool to tell us if a linking domain is good or spammy? I have incoming broken links from sites like these:
www.webutation(dot)net
titlesaurus(dot)com
www.webstatsdomain(dot)com
www.ericksontribune(dot)com
www.addondashboard(dot)com
search.wiki.gov(dot)cn
www.mixeet(dot)com
dinasdesignsgraphics(dot)com
If you could type out how the Rewrite should be done, that would be really fantastic. Your help is greatly appreciated. Thanks!
Greg?