http://blog.cloudera.com/blog/2014/01/i ... ass-speed/
Could the 'commercial analytic DBMS' possibly be Vertica?
Impala now faster than 'commercial analytic DBMS' ?
Moderator: NorbertKrupa
-
- Newbie
- Posts: 12
- Joined: Tue May 07, 2013 5:28 pm
- JimKnicely
- Site Admin
- Posts: 1825
- Joined: Sat Jan 21, 2012 4:58 am
- Contact:
Re: Impala now faster than 'commercial analytic DBMS' ?
Jim Knicely
Note: I work for Vertica. My views, opinions, and thoughts expressed here do not represent those of my employer.
Note: I work for Vertica. My views, opinions, and thoughts expressed here do not represent those of my employer.
Re: Impala now faster than 'commercial analytic DBMS' ?
I’m not sure that anything can be concluded from the Quora references. But from a quick scan of the Impala vs. DBMS-Y article, it refers to data being in a column store but says nothing about it being optimized. If it were vertica - is a comparison with unoptimized projections meaningful? I don’t think so.
—Sharon
—Sharon
Sharon Cutter
Vertica Consultant, Zazz Technologies LLC
Vertica Consultant, Zazz Technologies LLC
- JimKnicely
- Site Admin
- Posts: 1825
- Joined: Sat Jan 21, 2012 4:58 am
- Contact:
Re: Impala now faster than 'commercial analytic DBMS' ?
I was just implying that if Cloudera was using Vertica there'd be no need to make the comparison in the first place
Jim Knicely
Note: I work for Vertica. My views, opinions, and thoughts expressed here do not represent those of my employer.
Note: I work for Vertica. My views, opinions, and thoughts expressed here do not represent those of my employer.
-
- Newbie
- Posts: 12
- Joined: Tue May 07, 2013 5:28 pm
Re: Impala now faster than 'commercial analytic DBMS' ?
With Parquet, Impala now has column store as well. I met the guy behind Impala, Marcel Kornacker, a while ago at a conference. They wanted to be as fast as Vertica within a year.
Now I know, wanting something, and doing your own tests, doesn't make it so. For one, HDFS is still a blocker, as far as I know. With Vertica you have control over how you segment your data. You typically replicate your small tables for efficient, local joins. Last time I checked, joins were always distributed in Impala.
Still, it's a player to keep an eye on.
Now I know, wanting something, and doing your own tests, doesn't make it so. For one, HDFS is still a blocker, as far as I know. With Vertica you have control over how you segment your data. You typically replicate your small tables for efficient, local joins. Last time I checked, joins were always distributed in Impala.
Still, it's a player to keep an eye on.