Apache CarbonData is a free and open-sourcecolumn-oriented data storage format of the Apache Hadoop ecosystem. It is similar to the other columnar-storage file formats available in Hadoop namely RCFile and ORC. It is compatible with most of the data processing frameworks in the Hadoop environment. It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk.
History
CarbonData was developed at Huawei in 2013.[3][4] The project was donated to the Apache Community in 2015 submitted to the Apache Incubator in June 2016.[3][4] The project won top honors in the BlackDuck 2016 Open Source Rookies of the Year's Big Data category.[5] Apache CarbonData has been a top-level Apache Software Foundation (ASF)-sponsored project since May 1, 2017.[1]