Vertica
Vertica Systems is an analytic database management software company.[1][2] Vertica was founded in 2005 by database researcher Michael Stonebraker and Andrew Palmer. Palmer was the founding CEO. Ralph Breslauer and Christopher P. Lynch served as later CEOs.
Industry | Database management & Data warehousing |
---|---|
Founded | 2005 |
Founder | Andrew Palmer and Michael Stonebraker |
Headquarters | Cambridge, MA, United States |
Key people | Colin Mahony (SVP and General Manager) |
Products | Vertica Analytics Platform Enterprise Edition, Vertica SQL on Hadoop, Vertica Analytics Platform Community Edition |
Parent | Micro Focus |
Website | www |
Lynch joined as Chairman and CEO in 2010 and was responsible for Vertica's acquisition by Hewlett Packard in March 2011.[3][4] The acquisition expanded the HP Software portfolio for enterprise companies and the public sector group.[5] As part of the Micro Focus-Hewlett Packard Enterprise merger, Vertica joined Micro Focus in September, 2017.
Products
The column-oriented Vertica Analytics Platform was designed to manage large, fast-growing volumes of data and provide very fast query performance when used for data warehouses and other query-intensive applications. The product claims to greatly improve query performance over traditional relational database systems, and to provide high availability and exabyte scalability on commodity enterprise servers. Vertica is infrastructure-independent, supporting deployments on multiple cloud platforms (AWS, Google Cloud, Azure), on-premises and natively on Hadoop nodes. Vertica's Eon Mode, available on Amazon Web Services and on premise with Pure Storage Flashblade, separates compute from storage and leverages low cost S3 object storage and the ability to apply compute to variable workloads, capitalizing on cloud economics. Vertica claims that its Eon Mode architecture is the only analytics platform that separates compute from storage and brings the advantages of cloud architecture to on premise data centers.
Its design features include:
- Column-oriented storage organization, which increases performance of sequential record access at the expense of common transactional operations such as single record retrieval, updates, and deletes.[6]
- Massively parallel processing (MPP) architecture to distribute queries on independent nodes and scale performance linearly.
- Standard SQL interface with many analytics capabilities built-in, such as time series gap filling/interpolation, event-based windowing and sessionization, pattern matching, event series joins, statistical computation (e.g., regression analysis), and geospatial analysis.
- In-database machine learning including categorization, fitting and prediction to enhance processing speed by eliminating the need for down-sampling and data movement. Vertica offers a variety of in-database algorithms, including linear regression, logistic regression, k-means clustering, Naive Bayes classification, random forest decision trees, and support vector machine regression and classification. It also allows deployment of ML models to multiple clusters.
- High Compression, possible because columns of homogeneous datatype are stored together and because updates to the main store are batched.[7]
- Shared-nothing architecture, which reduces system contention for shared resources and allows gradual degradation of performance in the face of hardware failure.
- Automated workload management, data replication, server recovery, query optimization, and storage optimization.
- Native integration with open source big data technologies like Apache Kafka and Apache Spark.
- Support for standard programming interfaces, including ODBC, JDBC, ADO.NET, and OLEDB.
- High-performance and parallel data transfer to statistical tools such as built-in machine learning algorithms.[8][9]
Vertica's specialized approach aims to significantly increase query performance in data warehouses, while reducing the total cost of ownership by reducing the hardware footprint.[10]
In late 2011, the Vertica Analytics Platform Community Edition was made available for free with certain limitations, such as a maximum of one terabyte of raw data, three-node (servers) cluster, and community-based support.[11]
Optimizations
The Vertica Analytics Platform runs on clusters of Linux-based commodity servers. It is also available on the Amazon Elastic Compute Cloud , Microsoft Azure and the Google Cloud Platform, ensuring no infrastructure or platform lock in. The product integrates with Hadoop[12] to leverage HDFS via External Tables with ORC and Parquet Readers and can be installed on Hadoop nodes in a co-located manner as Vertica for SQL on Hadoop (a separate offering, priced by per node). These combined capabilities allow users to choose where to analyze their data, including across multiple data lakes.
A range of BI, data visualization, and ETL tools are certified to work with and integrate with the Vertica Analytics Platform. Vertica also offers a certified and secure interface with the popular Kafka message bus, allowing streaming data ingestion. This capability combined with Vertica's high performance analytics supports use cases like Internet of Things, Edge Analytics and near real time Fraud Prevention.
Several of Vertica’s features were originally prototyped within the C-Store column-oriented database, an academic open source research project at MIT and other universities. The system's architecture is described in a 2012 VLDB paper.[13]
Versions and documentation
- Vertica Analytics Platform 10.x[14]
- Vertica Analytics Platform 9.3.x[15]
- Vertica Analytics Platform 9.2.x[16]
- Vertica Analytics Platform 9.1.x[17]
- Vertica Analytics Platform 9.0.x[18]
- Vertica Analytics Platform 8.1.x[19]
- Vertica Analytics Platform 8.0.x[20]
- Vertica Analytics Platform 7.2.x[21]
- Vertica Analytics Platform 7.1.x[22]
- Vertica Analytics Platform 7.0.x[23]
- Vertica Analytics Platform 6.1.x[24]
- Vertica 6.0.x Enterprise Edition[25]
- Vertica 5.1 Enterprise Edition[26]
- Vertica Enterprise Edition 5.0[27]
- Vertica Enterprise Edition 4.1[28]
Company events
In January 2008, Sybase filed a patent-infringement lawsuit against Vertica.[29] In January 2010, Vertica prevailed in a preliminary hearing,[30] and in June, 2010, Sybase and Vertica resolved the suit, with the court dismissing all infringement claims.[31] Under the leadership of Colin Mahony, Vertica has sponsored various technological events in the database industry.[32]
In August 2013, Vertica held its first Big Data conference[33] event in Boston, MA USA. This event was held again in 2014, 2015, 2016, and 2017.
In 2016, Vertica published The Big Data Transformation: Understanding Why Change is Actually Good for Your Business.
References
- Network World staff: "New database company raises funds, nabs ex-Oracle bigwigs”, LinuxWorld, February 14, 2007
- Brodkin, J: "10 enterprise software companies to watch", Archived 2007-05-18 at the Wayback Machine Network World, April 11, 2007
- HP News Release: “HP to Acquire Vertica: Customers Can Analyze Massive Amounts of Big Data at Speed and Scale” Feb. 2011
- HP News Release: “HP Completes Acquisition of Vertica Systems, Inc.” March 22, 2011.
- ComputerWorld.com: “Update: HP to buy Vertica for analytics.” Kanaracus. Feb. 2011.
- Monash, C: "Are row-oriented RDBMS obsolete?" DBMS2, January 22, 2007
- Monash, C: "Mike Stonebraker on database compression – comments”,DBMS2, March 24, 2007
- Gagliordi, Natalie. "HP adds scale to open-source R in latest big data platform". ZDNet. Retrieved 17 February 2015.
- Prasad, Shreya; Fard, Arash; Gupta, Vishrut; Martinez, Jorge; LeFevre, Jeff; Xu, Vincent; Hsu, Meichun; Roy, Indrajit (2015). "Enabling predictive analytics in Vertica: Fast data transfer, distributed model creation and in-database prediction". ACM SIGMOD International Conference on Management of Data.
- One Size Fits All? Part 2: Benchmarking Results (sect. 3.1)
- "Vertica Announces Community Edition Version of Vertica Analytic Database". Archived from the original on July 4, 2015. Retrieved August 17, 2016.
- "Vertica-Hadoop integration". DBMS2. October 12, 2010.
- "The Vertica Analytic Database: C-Store 7 Years Later" (PDF). VLDB. August 28, 2012.
- Documentation https://my.vertica.com/docs/10.0.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/9.3.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/9.2.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/9.1.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/9.0.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/8.1.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/8.0.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/7.2.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/7.1.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/7.0.x/HTML/index.htm
- Documentation https://my.vertica.com/docs/6.1.x/HTML/index.htm
- Documentation http://www.vertica.com/documentation/hp-vertica-documentation-6-0-x/
- Documentation http://www.vertica.com/documentation/hp-vertica-5-1-x-enterprise-edition-product-documentation/
- Documentation http://www.vertica.com/documentation/hp-vertica-enterprise-edition-5-0-product-documentation/
- Documentation http://www.vertica.com/documentation/hp-vertica-documentation-5-1/
- Sybase, Inc. v. Vertica Systems, Inc. (Texas Eastern District Court January 30, 2008).Text
- Monash, C: "Vertica slaughters Sybase in patent litigation”,DBMS2, January 14, 2010
- Vertica Press Release, "Vertica Resolves Sybase Patent Lawsuits" http://www.vertica.com/news/press/vertica-resolves-sybase-patent-lawsuits/
- http://www.vertica.com/news/events/
- HP Vertica Big Data Conference 2013 http://www.vertica.com/hp-vertica-big-data-conference-2013/