!!! Overview [{$pagename}] is an open source, non-relational, distributed database modeled after Google's [BigTable]. [{$pagename}] is developed as part of [Apache] Software Foundation's [Apache Hadoop] project and runs on top of [HDFS] ([Hadoop Distributed File System]), providing [BigTable]-like capabilities for [Hadoop]. That is, it provides a fault-tolerant way of storing large quantities of sparse data (small amounts of information caught within a large collection of empty or unimportant [data], such as finding the 50 largest items in a group of 2 billion records, or finding the non-zero items representing less than 0.1% of a huge collection). [{$pagename}] features compression, in-memory operation, and Bloom filters on a per-column basis as outlined in the original [BigTable] paper. Tables in [{$pagename}] can serve as the input and output for [MapReduce] jobs run in [Hadoop], and may be accessed through the [Java] [API] but also through [REST], Avro or Thrift gateway [APIs]. [{$pagename}] is a column-oriented key-value [data Store|DataStore] and has been idolized widely because of its lineage with [Hadoop] and [HDFS]. [{$pagename}] as described for the [CAP theorem] is a [Consistency]/[Partition] system. !! More Information There might be more information for this subject on one of the following: [{ReferringPagesPlugin before='*' after='\n' }]