Google Technologies. This article servers as a point to capture information about Google technologies, e.g. Pagerank, Bigtable, Google Search, etc.
1. Overview
Google uses an amazing set of technologies. I use this article to keep information about this technology and to keep pointers to publicly available information about these technologies.
2. Pagerank
Pagerank is the weight of importance for a webpage calculated by Google.
The calculation process is not known and contains many factors but the principle is simple. A web page is more important if other web pages link to it. The more importance a web pages has (by inbound links from other sites) the more important are outbound links from this webpage.
Please find a good description of the pagerank calculation in the following article http://www.ams.org/featurecolumn/archive/pagerank.html.
3. MapReduce
Please see MapReduce.
4. Google File System
Google has the GFS, a distributed, multi-gigabyte files. This file system is described in the Google File System Whitepaper.
GFS does not handle the replication between different data centers.
5. Bigtable
Please see Bigtable.
6. Memcache
Google provides on the Google App Engine memcache as a caching mechanism.
Memcache is a high-performance, distributed memory object caching system, primarily intended for fast access to cached results of datastore queries.
Similar to Bigtable it works similar to a map with key and objects. If the memory consumption of memcache is to big then memory will automatically released based on a Last-Recently-Used (LRU) strategy.
Google provides an API to put something into memcache and to remove something again from memcache.
7. Links and Literature
7.2. vogella Java example code
If you need more assistance we offer Online Training and Onsite training as well as consulting