- Create a crawler to crawl /cache all pages on web.
- Store index cached web in Server mostly billions TB if you target all the web. try cached country database if you have some success then expand it other countries all over world.
-
Parser/ crwaler,index builder, query (complier,builder, macther) snippet generation.
- Filter results depending on ranking factors.
- Google have the unique NOSQL (Not only SQL) database system called big Table which runs On HDFS system owned by google.
- HDF alternates Hadoop (file system). Hbase and hyper table databases work on NoSQL
- Google,
- Bing
- Yahoo
- Ask
- AoL search
- Wow
- Web crawler
- MyWebSearch
- InfoSpace
- info.ccom
- Duck Duck go
Ask a Question:
You must be logged in to post a comment.