Google’s big-data tool, Mesa, holds petabytes of data across multiple servers

Google says its big-data architecture, Mesa, can store petabytes of data, update millions of rows of data per second, and field trillions of queries daily across multiple servers, enabling continuous operation of the data warehouse even if a data center fails. “Mesa ingests data generated by upstream services, aggregates and persists the data internally, and serves the data via user queries,” note Google researchers. They say Mesa was originally constructed to house and analyze critical measurement data for Google’s Internet advertising business, but the technology could be applicable to other, similar data warehouse tasks. Mesa is dependent on other Google technologies, such as the Colossus distributed file system, the BigTable distributed data storage system, and the MapReduce data analysis framework. Google engineers implemented Paxos, a distributed synchronization protocol, to help address query consistency issues. Mesa also can operate on generic servers, making costly specialized hardware unnecessary and enabling Mesa to be run as a cloud service with the advantage of scalability.

Google’s big-data tool, Mesa, holds petabytes of data across multiple servers – Computerworld.

Share

Leave a Reply

Your email address will not be published. Required fields are marked *