Big Data Forum Collection: [impala-user] Impala on existed Hadoop clusters

We are considering how to use Impala in our production environment. There are two choice:

1. Building a dedicated clusters

2. Installing Impalad on the existed hadoop clusters

I prefer #2, but it raises another question, how many Impalad daemons should I install on existed hadoop clusers? one impalad per datanode?

Yes, one impalad per datanode. We generally recommend to co-locate the Impalads with the HDFS datanodes to allow scheduling queries for local short-circuit reads (better performance).

Big Data Forum Collection

2014년 12월 30일 화요일

[impala-user] Impala on existed Hadoop clusters

댓글 없음:

댓글 쓰기