Hadoop Interview Questions
Hadoop Interview Questions
sudo
2. Incremental load options in hive , examples ? dwgeek.com
3. Exporting data from mysql to HDFS using sqoop
a. Sqoop
b. Encrypt
c. decrypt
4. Which is better parquet or orc?
ORC (Optimized Row Columnar) indexes are used only for the selection of stripes and row
groups and not for answering queries. AVRO is a row-based storage format
whereas PARQUET is a columnar based storage format. PARQUET is much better for
analytical querying i.e. reads and querying are much more efficient than writing
5.