Google Hacking
Google is the most popular search engine on the planet, so
much so that its name has become a verb. (As in, “to google.”) The term “google”
was originally “googol,” a term meaning the number “1” followed by 100 zeroes,
created by prominent mathematician Edward Kasner
Google search is a web search engine owned by Google Inc.
and is the most-used search engine on the Web.
A California-based public corporation specializing in online
searches and advertising. Google was created by Stanford students Larry Page
and Sergey Brin and has by now become the world's leading search engine in
terms of reach.
Advance Search Operators
•There are many more advanced operators.
•Combining these creatively is the key to Google Hacking.
allinanchor: All
query words must appear in anchor text of links to the page.
inanchor: Terms must appear in anchor text
of links to the page.
allintext: All
query words must appear in the text of the page.
intext: The terms
must appear in the text of the page.
allintitle: All
query words must appear in the title of the page.
intitle: The terms must appear in the
title of the page.
allinurl: All query words must appear in
the URL.
inurl: The terms must appear in the URL of the
page.
Advance Search Operators(Contd.)
•Advanced Search Operators
•site: (.edu, .gov, foundstone.com,
usc.edu)
•filetype: (txt, xls, mdb, pdf, .log)
•Daterange: (julian date format)
•Intitle / allintitle
•Inurl / allinurl
Some other things to keep in mind:
Google queries are not case sensitive.
The * wildcard represents any word
Example: “*
insurance quote”
Google stems words automatically
Example:
“automobile insurance quote” brings up sites with “auto…”
Countermeasures
•Keep sensitive data off the web!!
•Perform periodic Google Assessments
– Update robots.txt
– Use meta-tags: NO ARCHIVE
How To Protect Your Websites From
Google Hackers.
•In general, be very careful about what content you place on
your Internet-facing websites.
•Do not display detailed error messages.
•Do not allow directory browsing.
●Keep all of your links environment specific.
●Keep your name and email out of HTML comments and don’t
post them on Google Groups.
●Configure your web server to only serve up a list of “safe”
file types and to respond with “File Not Found” for any unsafe types.
●Use a robots.txt file to prevent Google and other search
engines from crawling your site if it shouldn’t be crawled.
But we can use these for well use
like eBook search ,video ,and many more useful works
Ankush
Mohanty