Jump to content

Apache Impala

Daga Wikipedia, Insakulofidiya ta kyauta.
Apache Impala
free software (en) Fassara
Bayanai
Mai haɓakawa Cloudera (en) Fassara da Apache Software Foundation (en) Fassara
Platform (en) Fassara Java Virtual Machine (en) Fassara
Operating system (en) Fassara cross-platform (en) Fassara
Programmed in (en) Fassara C++ (mul) Fassara, Java programming language da Python programming language
Source code repository URL (en) Fassara https://github.com/apache/impala
Software version identifier (en) Fassara 4.4.0, 2.10.0, 2.7.0, 2.8.0, 2.9.0, 2.11.0, 2.12.0, 3.0.0, 3.0.1, 3.1.0, 3.2.0, 3.3.0, 3.4.0, 3.4.1, 4.0.0, 4.1.0, 4.1.1, 4.2.0, 4.1.2, 4.3.0, 3.4.2 da 4.4.1
Shafin yanar gizo impala.apache.org
Lasisin haƙƙin mallaka Apache License (en) Fassara
Copyright status (en) Fassara copyrighted (en) Fassara

Apache Impala shine buɗaɗɗen tushe mai daidaitawa daidai gwargwado (MPP) injin tambaya na SQL don bayanan da aka adana a cikin tarin kwamfuta mai gudana Apache Hadoop . An bayyana Impala a matsayin tushen budewa daidai da Google F1, wanda ya karfafa ci gabansa a cikin 2012.

Apache Impala injin tambaya ne wanda ke gudana akan Apache Hadoop. An sanar da aikin a cikin Oktoba 2012 tare da rarraba gwajin beta na jama'a kuma ya zama gabaɗaya a cikin Mayu 2013.

Impala yana kawo fasahar bayanai mai daidaitawa zuwa Hadoop, yana bawa masu amfani damar ba da ƙananan tambayoyin SQL zuwa bayanan da aka adana a HDFS da Apache HBase ba tare da buƙatar motsin bayanai ko canji ba. An haɗa Impala tare da Hadoop don amfani da fayil iri ɗaya da tsarin bayanai, metadata, tsaro da tsarin sarrafa albarkatun da MapReduce, Apache Hive, Apache Pig da sauran software na Hadoop ke amfani da su.

Ana ciyar da Impala ga manazarta da masana kimiyyar bayanai don yin nazari akan bayanan da aka adana a Hadoop ta hanyar SQL ko kayan aikin sirri na kasuwanci . Sakamakon shi ne cewa manyan bayanan sarrafa bayanai (ta hanyar MapReduce) da kuma tambayoyin hulɗa za a iya yin su akan tsarin guda ɗaya ta amfani da bayanai iri ɗaya da metadata - cire buƙatar ƙaura saitin bayanai zuwa tsarin na musamman da / ko tsarin mallakar mallaka kawai don yin bincike.

Siffofin sun haɗa da:

  • Yana goyan bayan HDFS, S3, ABFS, Apache HBase da ajiyar Apache Kudu ,
  • Yana karanta fayilolin Hadoop, gami da rubutu, LZO, SequenceFile, Avro, RCFile, Parquet da ORC
  • Yana goyan bayan tsaro na Hadoop (Tabbacin Kerberos, Ldap ),
  • Kyakkyawan inganci, izini na tushen rawar tare da Apache Sentry da Apache ranger
  • Yana amfani da metadata, direban ODBC, da SQL syntax daga Apache Hive .

A farkon 2013, an sanar da tsarin fayil ɗin da ke kan shafi mai suna Parquet don gine-ginen gine-gine ciki har da Impala. A cikin Disamba 2013, Amazon Web Services ya sanar da goyon baya ga Impala. A farkon 2014, MapR ta ƙara tallafi ga Impala. A cikin 2015, an sanar da wani tsari mai suna Kudu, wanda Cloudera ya ba da shawarar ba da gudummawa ga Gidauniyar Software ta Apache tare da Impala. Impala ya kammala karatunsa zuwa wani babban matakin matakin Apache (TLP) akan 28 Nuwamba 2017.

  • Apache Drill - irin wannan aikin buɗe tushen aikin da Dremel ya yi wahayi
  • Dremel - irin wannan kayan aiki daga Google
  • Trino - buɗaɗɗen tushen injin tambaya na SQL wanda mahaliccin Presto suka ƙirƙira
  • Presto - buɗaɗɗen tushen injin tambaya na SQL wanda Facebook ya ƙirƙira kuma Teradata yana goyan bayansa

Hanyoyin haɗi na waje

[gyara sashe | gyara masomin]