Recommended: what database is the most suitable data analyst

Database analyst

shujufenxi· 2016-07-01 17:24:52

data analysts want to use the database as a data warehouse processing and operation of data, then which is the most appropriate database analyst? Although the Internet has compared many articles to various databases, but its focus is the general architecture, cost, scalability and performance, seldom consider another key factor: analysts write the query difficulty in the database. Recently, Mode's chief analyst Stancil Benn released an article, from another point of view which explains the database of the most suitable data analyst.


Benn Stancil that the data analysis can not be achieved, analysts in the use of the database in the process is often not hinder their speed of macro performance, but to write the query details. For example, how to get the current time in the Redshift, NOW (), CURDATE (), CURDATE, SYSDATE or WHATDAYISIT. Analysts at Mode, every day will use different language to write thousands of queries, running in the Mode editor in the query over millions, while Benn Stancil is starting from these data of MySQL, PostgreSQL, Redshift, SQL, Server, BigQuery, Vertica, Hive and Impala eight database were compared. First of all, />

Benn, Stancil




Benn Stancil but the results suggest that there may be no rigorous, because Impala, MySQL and Hive are open source free products, Vertica, and SQL Server and BigQuery is not, after the three users are usually large enterprises have sufficient budget analysis, the higher error rate is probably due to the use of more in-depth and more difficult to use than the language "". />

Benn in addition to the error rate, Stancil




if a simple comparison final length biased, so you can see with the analysis gradually, gradually become complex query process, which modifies the relationship between the number and length of the



: the figure shows that after about 20 times the edit, query length usually 2 times as before, and in the 100 edition, the length will be 3 times as before. So in the process of change, the number of times and the ratio of the number of errors and what is the appearance of it? />





< p>

of the matrix is shown at the top of the database error rate compared with the difference the database, the higher the performance becomes worse. For example, Hive and BigQuery intersection of the "20.2": the use of these two database analysts, the use of Hive's error rate is higher than the use of BigQuery 20.2. The bottom line is the total of Total, can be seen from MySQL and PostgreSQL has better performance; Vertica jump, almost from the bottom jumped into the middle, defeated SQL Server and Hive, it also implies that the high error rate of Vertica is likely due to the ability of analysts rather than the language itself.


Benn Stancil that finally, in the 8 database analysis, MySQL and PostgreSQL SQL to write the most simple, the most widely used, but compared with the Vertica SQL Server and their properties is not rich enough, but the speed to slow. Integrated all aspects of the factors, Redshift may be the best choice.

transfer: Chinese statistics network; http://www.itongji.cn/cms/article/articledetails? Articleid=40;

copyright statement: this part, from the Internet, please indicate the author and the original link, if any infringement or source is incorrect please contact us.

business cooperation, please add qq:365242293  manuscripts;.


more knowledge please reply: " yueguangbaohe ";

data analysis (ID:   ecshujufenxi& nbsp;) Internet technology and data circle own WeChat, one of the members is the WeMedia from the media alliance, the WeMedia alliance covering 50 million people. Style= text-align:center "

The lastest articles of shujufenxi

Dry cargo: the complete knowledge structure of Data Analyst

Necessary in the workplace: Excel2016, the official recommended the use of...

Necessary in the workplace: Excel2016, the official recommended the use of...

Dry cargo: ten points of the website segment analysis

Dry cargo: ten points of the website segment analysis

Business analysis to achieve business insight - Excel business intelligence...

Business analysis to achieve business insight - Excel business intelligence...

About the data analysis of those things, look at this article is enough

About the data analysis of those things, look at this article is enough

How

Discussion: how big data will change your life