Add: schema loading support for Presto query runner #1233

ninneko · 2016-08-10T10:28:31Z

No description provided.

arikfr · 2016-08-10T13:59:43Z

Thanks!

My only concern is that some users have a lot of tables in Presto, which might result in long schema load time due to the need to run a separate query for each table.

Do you know how fast those queries are? What size of database did you test this with?

hussainbohra · 2016-08-11T20:58:38Z

I had a look into this code - Infact I also implemented the query browser for a presto - I am running only one query and that fetches all schema and tables

def get_schema(self, get_stats=False):
        schema = {}
        query = SELECT table_schema, table_name, column_name FROM information_schema.columns      WHERE table_schema NOT IN ('pg_catalog', 'information_schema')


        results, error = self.run_query(query)
        if error is not None:
            raise Exception("Failed getting schema.")

        results = json.loads(results)

        for row in results['rows']:
            if row['table_schema'] != 'public':
                table_name = '{}.{}'.format(row['table_schema'], row['table_name'])
            else:
                table_name = row['table_name']

            if table_name not in schema:
                schema[table_name] = {'name': table_name, 'columns': []}

            schema[table_name]['columns'].append(row['column_name'])

        return schema.values()

The base class will also remain the same (BaseQueryRunner)

ninneko · 2016-08-21T05:15:39Z

@arikfr I tested on EMR cluster that have 5 schemas and 30 tables, and I didnt feel the problem.

There is two reason that I dont think this code is bad.
First, this queris are also used in the hive plugin and the impala plugin, and these plugins are already commonly used.
Second, I dont think that mete tables should be directly used.

Thank you for comments and sorry for poor English.

arikfr · 2016-08-28T19:32:42Z

I understand the concern of using meta tables, but the method of running a query for each table is just not something we can add. @rohanpd benchmarked your method on their Presto cluster and resulted in thousands of queries and unreasonable time to complete.

I've decided to go with @hussainbohra's solution as implemented by @rohanpd in #1252.

fix schema resolves for presto

8d29bef

arikfr changed the title ~~fix schema resolves for presto~~ Add: schema loading support for Presto query runner Aug 10, 2016

rohanpd mentioned this pull request Aug 23, 2016

Add: Schema loading support for Presto query runner (using information_schema) #1252

Merged

arikfr closed this Aug 28, 2016

snyk-bot mentioned this pull request Sep 15, 2021

[Snyk] Fix for 1 vulnerabilities MaxMood96/redash#23

Open

MaxMood96 mentioned this pull request Nov 5, 2022

[Snyk] Fix for 1 vulnerabilities MaxMood96/redash#62

Open

MaxMood96 mentioned this pull request Dec 26, 2022

[Snyk] Fix for 1 vulnerabilities MaxMood96/redash#89

Open

MaxMood96 mentioned this pull request Nov 28, 2023

[Snyk] Fix for 16 vulnerabilities MaxMood96/redash#129

Open

MaxMood96 mentioned this pull request Dec 20, 2023

[Snyk] Fix for 28 vulnerabilities MaxMood96/redash#133

Open

MaxMood96 mentioned this pull request May 13, 2024

[Snyk] Fix for 2 vulnerabilities MaxMood96/redash#161

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add: schema loading support for Presto query runner #1233

Add: schema loading support for Presto query runner #1233

ninneko commented Aug 10, 2016

arikfr commented Aug 10, 2016

hussainbohra commented Aug 11, 2016 •

edited by arikfr

Loading

ninneko commented Aug 21, 2016

arikfr commented Aug 28, 2016

Add: schema loading support for Presto query runner #1233

Add: schema loading support for Presto query runner #1233

Conversation

ninneko commented Aug 10, 2016

arikfr commented Aug 10, 2016

hussainbohra commented Aug 11, 2016 • edited by arikfr Loading

ninneko commented Aug 21, 2016

arikfr commented Aug 28, 2016

hussainbohra commented Aug 11, 2016 •

edited by arikfr

Loading