Saturday, November 15, 2014

Querying Hive and Impala from Tableau (on Windows)

You need to install the 32 and 64 bit ODBC drivers/Connectors for one of or both Hive and Impala from here:


http://ift.tt/1rB1pwX


You will need to run the ODBC Administrator program that’s built into Windows to configure the “System DSN” that gets created for each one when you install the drivers/Connectors. Just hit the Windows key and type “odbc admin”. There’s a 32-bit one and a 64-bit one. The 32-bit one can only edit the 32-bit DSNs. The 64-bit one can only edit the 64-bit DSNs. Find them on the “System DSN” tab. And, yes, you need to edit both to point at your Impala and/or Hive server. If you’re not sure which node(s) that is, you can find them in Cloudera Manager:


1. Cluster > Hive > Instances > HiveServer2

– Whichever machine is running that role instance is where you need to point all your DSNs

2. The ports to point at are found here:

– Cluster > Impala > Configuration > Ports > “Impala Daemon HiveServer2 Port” (default is 21050)

– Cluster > Hive > Configuration > Ports > “HiveServer2 Port” (default is 10000)





No comments:

Post a Comment