Add support for Hive/Impala
See original GitHub issueIt would be really nice for Caravel to have support for Hive/Impala. There is a nice package called Impyla that provides a SQLAlchemy interface to Hive and Impala.
I am testing it out with Caravel, seems to be working so far with Hive. I can connect to my Hive database, there is a problem where SQLAlchemy tries to run a select version()
as the first query, as this is not a valid Hive function, it causes an immediate failure. Once I resolve this I will open a pull request.
Issue Analytics
- State:
- Created 7 years ago
- Reactions:6
- Comments:14 (6 by maintainers)
Top Results From Across the Web
Tutorial: Using Impala, Hive and Hue with Virtual Private ...
In a Virtural Private Cluster environment, Hue and the impala-shell can be used to setup databases, tables, insert and retrieve data using ...
Read more >Add support for Hive/Impala · Issue #339 · apache/superset
I am testing it out with Caravel, seems to be working so far with Hive. I can connect to my Hive database, there...
Read more >Apache Impala support (with Hive Metastore)
Add new connection ... To connect to Hive Metastore and create new documentation by clicking Add documentation and choosing Database connection.
Read more >The Relationship Between Impala and Hive and its ...
Impala and Hive are both data query tools built on Hadoop, each with different focus on adaptability. From the perspective of client use, ......
Read more >SQL Differences Between Impala and Hive
Impala supports high-performance UDFs written in C++, as well as reusing some Java-based Hive UDFs. Impala supports scalar UDFs and user-defined aggregate ...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
I added an extra in the Superset “create database connection” for passing some connection arguments:
So I introduced a connection URL as following one:
impala://impala_host:21050/impala_database
And it worked
Note: I had run a kinit before to connect.
Raised a pull request to fix this issue with Impala here: https://github.com/cloudera/impyla/pull/191
I have tested the Impyla fix, and verified that I now have a working Hive/Impala connection in Caravel. Once the Impyla fix is in, I will make a pull request against Caravel to add it.