0% found this document useful (0 votes)

41 views46 pages

POC Issues 0327

Uploaded by

praveen kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views46 pages

POC Issues 0327

Uploaded by

praveen kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as XLSX, PDF, TXT or read online on Scribd

You are on page 1/ 46

spark-shell --packages com.hortonworks:shc-core:1.1.1-2.1-s_2.11 --repositories http://repo.hortonworks.

co
/usr/iop/4.2.5.0-0000/hbase/lib/hbase-client.jar,/usr/iop/4.2.5.0-0000/hbase/lib/hbase-protocol.jar,/usr/iop/
server.jar,/usr/iop/4.2.5.0-0000/hbase/lib/guava-12.0.1.jar,/usr/iop/4.2.5.0-0000/hbase/lib/htrace-core-3.1.0-incu
lib/protobuf-java-2.5.0.jar,/usr/iop/4.2.5.0-0000/hbase/lib/hbase-hadoop2-compat.jar,/usr/iop/4.2.5.0-0000/
2.2.0.jar,/usr/iop/4.2.5.0-0000/hbase/lib/htrace-core-3.1.0-incubating.jar,/usr/iop/4.2.5.0-0000/hbase/lib/hbase-
hadoop/lib/hadoop-lzo-0.5.1.jar --m

Reading HDFS File

sc.textFile("hdfs:/Data/csc_insights/disability/rdz/cbs/member/preferences/ONETIME_CBS_TCBECSP_20171
(l.substring(0, 10).trim())).toDF.show(5)

Querying HDFS File

textFile.distinct.count
textFile.registerTempTable("Sample")
val result=sqlContext.sql("select count(distinct(value)) from Sample")
result.show

Filter Condition

Hbase table
import org.apache.hadoop.hbase.spark
import org.apache.spark.sql.{SQLContext, _}
import org.apache.spark.sql.execution.datasources.hbase._
import org.apache.spark.{SparkConf, SparkContext}
import org.apache.hadoop.hbase.HBaseConfiguration
import org.apache.hadoop.hbase.mapreduce.TableInputFormat
import org.apache.hadoop.hbase.client.HBaseAdmin
import org.apache.hadoop.hbase.{HTableDescriptor,HColumnDescriptor}
import org.apache.hadoop.hbase.util.Bytes
import org.apache.hadoop.hbase.client.{Put,HTable}
import org.apache.hadoop.fs.{Path, FileAlreadyExistsException, FileSystem}
import org.apache.hadoop.hbase.client._
import org.apache.hadoop.hbase.spark._
val conf = HBaseConfiguration.create()
conf.addResource(new Path("/etc/hbase/4.2.5.0-0000/0/hbase-site.xml"))
conf.addResource(new Path("/etc/hbase/4.2.5.0-0000/0/core-site.xml"))
val hbaseContext = new HBaseContext(sc, conf)
val sqlContext = new org.apache.spark.sql.SQLContext(sc)
import sqlContext.implicits._
val accountMapping = s"""rowkey INTEGER :key, SRC_SYS_NM STRING b:SRC_SYS_NM, RPT_NUM STRIN
val accountdf=sqlContext.read.format("org.apache.hadoop.hbase.spark").option("hbase.columns.mapping",accoun
accountdf.registerTempTable("edms_qa_test")

val result1 =sqlContext.sql("select count(*) from edms_qa_test where SRC_SYS_NM = 'UDS'")

val result1 =sqlContext.sql("select * from edms_qa_test limit 5")

Joining Hbase and HDFS file

es http://repo.hortonworks.com/content/groups/public/ --files /etc/hbase/4.2.5.0-0000/0/hbase-site.xml --jars
b/hbase-protocol.jar,/usr/iop/4.2.5.0-0000/hbase/lib/hbase-common.jar,/usr/iop/4.2.5.0-0000/hbase/lib/hbase-
base/lib/htrace-core-3.1.0-incubating.jar,/usr/iop/4.2.5.0-0000/hbase/lib/zookeeper.jar,/usr/iop/4.2.5.0-0000/hbase/
mpat.jar,/usr/iop/4.2.5.0-0000/hbase/lib/hbase-hadoop-compat.jar,/usr/iop/4.2.5.0-0000/hbase/lib/metrics-core-
4.2.5.0-0000/hbase/lib/hbase-spark.jar,/usr/iop/4.2.5.0-0000/hive2/lib/hive-hbase-handler.jar,/usr/iop/4.2.5.0-0000/
p/lib/hadoop-lzo-0.5.1.jar --master yarn

NETIME_CBS_TCBECSP_20171018021626.DAT").map(l =>
F.show(5)
.count
able("Sample")
distinct(value)) from Sample")
w
RC_SYS_NM, RPT_NUM STRING b:RPT_NUM""".stripMargin
ase.columns.mapping",accountMapping).option("hbase.table","T_RPT_NUM_ACT").load().persist
CREATE VIEW T_CLM_PY
( ROWKEY VARCHAR PRIMARY KEY,
b."SRC_SYS_NM" VARCHAR,
b."CLM_GUID " VARCHAR,
b."CLM_NUM" VARCHAR,
b."PMT_ID" VARCHAR,
b."PY_HIST_IND" VARCHAR,
e."PAYMENTCONTACTHISTORY" VARCHAR);

CREATE VIEW T_CLM

( ROWKEY VARCHAR PRIMARY KEY,
b."SRC_SYS_NM" VARCHAR,
b."CLM_NUM" VARCHAR,
e."PAYMENTCONTACTHISTORY" VARCHAR);

SELECT COUNT(*) FROM "T_CLM_PY";

SELECT * FROM "T_CLM_PY" WHERE "e"."PAYMENTCONTACTHISTORY" IS NOT NULL LIMIT 5;

Bravo Team have given a build which willdo the below comparison using SPARK
a. Base object: (Hive or HDFS or HBASE) vs (Hive or HDFS or HBASE)
b. Multiple objects: (Hive & HDFS) vs (Hbase_1 & Hbase_2) and all other combinations as well

Process:
1. The comparison is done using Spark. Mapreduce comparison is NA in this new build
2. As of now, this supports and reads data from Hive, HDFS & HBASE components
3. This process reads the data from Hive or HDFS files or HBASE tables and creates & loads them into Spa
4. The output files are written on HDFS. Since this is spark, its not possible to write log files in local Unix s

Below items are the pre-requisits

1. Java 1.8
2. Spark 2.1
3. Access to Namedoe [NT10] - We are seeings issues in ZooKeeper services in edge node [ET01]. In Nam
4. Access to write files to HDFS layer

POC's
1. HDFS vs HDFS comparison
/*HDFS vs HDFS*/
spark-submit --master local[*] --class cog.bravo.sparkComp.SparkComp_3 /hadoop/dsa/qa/jmichael2/Br
/hadoop/dsa/qa/jmichael2/Bravo_Next_Gen_Biz_QA_Unix/DataLake_Processing_QA/DataLake_Test_Ca
SparkSQL_Obj "" "" "<SparkSQLObj>
<object> <source_type>FILE</ <source>/user/jmich
<sql> select * from csv_src </sql></SparkSQLObj>" "" "" "" "" "" "" "
SparkSQL_Obj "" "" "<SparkSQLObj>
<object> <source_type>FILE</ <source>/user/jmich
<sql> select * fr </sql></SparkSQLObj>" "" "" "" "" "" "" "" "" "" "" "
true "true" "/user/jmichael2/test" "" "true"

Output files:
2. HIVE vs HIVE comparison
/*HIVE vs HIVE*/
spark-submit --master local[*] --packages org.apache.hbase:hbase-common:1.0.0,org.apache.hbase:hba
/hadoop/dsa/qa/jmichael2/Bravo_Next_Gen_Biz_QA_Unix/DataLake_Processing_QA/DataLake_Test_Ca
SparkSQL_Obj "" "" "<SparkSQLObj>
<object> <source_type>HIVE</<source>select * fr
<sql> select SUB </sql></SparkSQLObj>" "" "" "" "" "" "" "" "" "" "" "
SparkSQL_Obj "" "" "<SparkSQLObj>
<object> <source_type>HIVE</<source>select * fr
<sql> select SUB </sql></SparkSQLObj>" "" "" "" "" "" "" "" "" "" "" "
true "true" "/user/jmichael2/test" "" "true"

Output Files:

3. HBASE vs HIVE
/*HIVE vs HBASE - NT10*/
spark-submit --master local[*] --packages org.apache.hbase:hbase-common:1.0.0,org.apache.hbase:hba
/home/METNET/jmichael2 "logs" "TC_1" \
SparkSQL_Obj "" "" "<SparkSQLObj>
<object> <source_type>HIVE</<source>select * fr
<sql> select SUB </sql></SparkSQLObj>" "" "" "" "" "" "" "" "" "" "" "
SparkSQL_Obj "" "" "<SparkSQLObj>
<object> <source_type>HBASE<<source>T_RPT_NUM_ <alias>DPA_TGT</alia
true "true" "/user/jmichael2/test" "" "true"
combinations as well

s new build

creates & loads them into Spark-sql tables

to write log files in local Unix server [how our current build works is NA]

s in edge node [ET01]. In Namenode [NT10], its running successfully

/hadoop/dsa/qa/jmichael2/Bravo_Next_Gen_Biz_QA_Unix/DataLake_Processing_QA/DataLake_Test_Case_Executor/cogBrv3897_Metlife
cessing_QA/DataLake_Test_Case_Executor "logs" "TC_1" \

<alias>csv_src</alias><header>false</head<delimiter </object>
parkSQLObj>" "" "" "" "" "" "" "" "" "" "" "," "1" \

<alias>csv_src</alias><header>false</head<delimiter </object>
j>" "" "" "" "" "" "" "" "" "" "" "," "1" \
n:1.0.0,org.apache.hbase:hbase-client:1.0.0,org.apache.hbase:hbase-server:1.0.0 --class cog.bravo.sparkComp.SparkComp_3 /hadoop/ds
cessing_QA/DataLake_Test_Case_Executor "logs" "TC_1" \

<alias>DPA</object>
j>" "" "" "" "" "" "" "" "" "" "" "," "1" \

n:1.0.0,org.apache.hbase:hbase-client:1.0.0,org.apache.hbase:hbase-server:1.0.0 --class cog.bravo.sparkComp.SparkComp_3 /home/MET

<alias>DPA</object>
j>" "" "" "" "" "" "" "" "" "" "" "," "1" \

<columns></object> <sql> select B_R </sql></SparkSQLObj>" "" "" "" "" "" "" "" "" "" "" "," "1" \
_Executor/cogBrv3897_Metlife.jar \
mp.SparkComp_3 /hadoop/dsa/qa/jmichael2/Bravo_Next_Gen_Biz_QA_Unix/DataLake_Processing_QA/DataLake_Test_Case_Executor/c

mp.SparkComp_3 /home/METNET/jmichael2/cogBrv3897_Metlife0320.jar \

"" "" "" "," "1" \

taLake_Test_Case_Executor/cogBrv3897_Metlife.jar \
1. Querying Larger data sets
Issue: Error: Operation timed out. (state=TIM01,code=6000)
Root Cause: This is primarily seen with queries running on larger data set because the default phoenix co

Resolution:
To resolve this issue we need to make sure that HBASE_CONF_PATH environment variable is set before l

1) Update or add the following configs to hbase-site.xml

--> phoenix.query.timeoutMs=1800000
--> hbase.regionserver.lease.period = 1200000
--> hbase.rpc.timeout = 1200000
--> hbase.client.scanner.caching = 1000
--> hbase.client.scanner.timeout.period = 1200000
2) Restart Hbase services to make these changes effective.
3) export HBASE_CONF_PATH = /etc/hbase/conf
4) Launch sqlline.py
5) run the same query that is failing

2. Running join queries

Issue: Size of hash cache (104857608 bytes) exceeds the maximum allowed size (104857600 bytes) [100
Root Cause: The cache size for processing the data is not sufficient
Resolution: Need to increase the buffer size of te hash cache
3. Unable to load RDZ data

> We need to load RDZ data into Phoenix for doing RDZ against EOS comparison
> Phoenix supports importing files which are only in .csv format
> Any tables that are created in Phoenix are also created in HBASE
> Hence, we always need to append a primary key to the RDZ file
> This primary key will be taken as the row_key when the data is getting inserted into HBASE
> When we load the data using the below two options, we are getting error

Option 1: Using phoenix-psql method

phoenix-psql -t Sample_edms_qa localhost /home/METNET/jmichael2/ONETIME_UDS_DPA_Conversion

Rootcause & Resolution: Unable to find

Option 2: Using phoenix jar and csvBulkLoadTool
hadoop jar /usr/iop/4.2.5.0-0000/phoenix/phoenix-4.8.1-HBase-1.2.0-IBM-21-client.jar org.apache.phoe

Root Cause:
This happens when user has an incorrect value defined for "zookeeper.znode.parent" in the hbase-site.x
For example the default "zookeeper.znode.parent" is set to "/hbase-unsecure" , but if you incorrectly sp

Resolution:
The solution here would be to update the hbase-site.xml / source out the same hbase-site.xml from the
ecause the default phoenix configurations are hitting timeout limits

onment variable is set before launching sqlline.py. This variable should point to hbase config directory.

d size (104857600 bytes) [100 MB]

serted into HBASE

ETIME_UDS_DPA_Conversion_20180119011722.csv
-21-client.jar org.apache.phoenix.mapreduce.CsvBulkLoadTool -Dfs.permissions.umask-mode=000 --table Sample_edms_qa --input user/

de.parent" in the hbase-site.xml sourced on the client side or in case of a custom API written , the "zookeeper.znode.parent" was incorrec
ure" , but if you incorrectly specify that as lets say "/hbase" as opposed to what we have set up in the cluster, we will encounter this excep

same hbase-site.xml from the cluster or update the HBase API to correctly point out the "zookeeper.znode.parent" value as updated in the
ample_edms_qa --input user/jmichael2/test/ONETIME_UDS_DPA_Conversion_trial.csv

er.znode.parent" was incorrectly updated to a wrong location.

r, we will encounter this exception while trying to connect to the HBase cluster.

arent" value as updated in the HBase cluster.

1. Timeout error - during Huge count tables
Cause:
The above error (java.util.concurrent.TimeoutException: Futures timed out after [300 seconds]) sugges
The stack displays the query plan calls broadcast joins. The awaitResult has a default timeout value of 3
The above error is displayed when this default timeout value is exceeded
Solution:
To resolve this issue, increase the default value of 300 for spark.sql.broadcastTimeout to 1200.
er [300 seconds]) suggests that there has been a timeout.
efault timeout value of 300 seconds for the broadcast wait time in broadcast joins

meout to 1200.

Essential n8n Playbook
From Everand
Essential n8n Playbook
Leandro Calado
No ratings yet
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
00 Creating The Driving School Database
No ratings yet
00 Creating The Driving School Database
28 pages
GST Billing System
100% (14)
GST Billing System
22 pages
PowerShell SysAdmin Crash Course: Unlock the Full Potential of PowerShell with Advanced Techniques, Automation, Configuration Management and Integration
From Everand
PowerShell SysAdmin Crash Course: Unlock the Full Potential of PowerShell with Advanced Techniques, Automation, Configuration Management and Integration
Steeve Lee
No ratings yet
DevOps. How To Build Pipelines With Bitbucket Pipelines + Docker Container + AWS ECS + JDK 11 + Maven 3?
From Everand
DevOps. How To Build Pipelines With Bitbucket Pipelines + Docker Container + AWS ECS + JDK 11 + Maven 3?
John Edward Cooper Berg
No ratings yet
JSP-Servlet Interview Questions You'll Most Likely Be Asked
From Everand
JSP-Servlet Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
ST-030 Correct Installation and Handling of Connecting Rods PDF
100% (1)
ST-030 Correct Installation and Handling of Connecting Rods PDF
7 pages
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet
rc159-HBase 7 PDF
No ratings yet
rc159-HBase 7 PDF
7 pages
Installation Steps
No ratings yet
Installation Steps
5 pages
Ajax in One Hour, For Beginners, Learn Coding Fast
From Everand
Ajax in One Hour, For Beginners, Learn Coding Fast
Ray Yao
No ratings yet
Build your own Blockchain: Make your own blockchain and trading bot on your pc
From Everand
Build your own Blockchain: Make your own blockchain and trading bot on your pc
Magelan Cybersecurity
No ratings yet
Hadoop Tutorials: Daniel Lanza Zbigniew Baranowski
No ratings yet
Hadoop Tutorials: Daniel Lanza Zbigniew Baranowski
49 pages
bdcc-2 5
No ratings yet
bdcc-2 5
9 pages
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Spark-Tutorial - IV - Python
No ratings yet
Spark-Tutorial - IV - Python
212 pages
Apache Spark Tutorial, With Deep-Dives On SparkR and Data Sources API
No ratings yet
Apache Spark Tutorial, With Deep-Dives On SparkR and Data Sources API
39 pages
NoSQL Injection for Elasticsearch
From Everand
NoSQL Injection for Elasticsearch
Gary Drocella
No ratings yet
Unstructured Dataload Into Hive Database Through PySpark
No ratings yet
Unstructured Dataload Into Hive Database Through PySpark
9 pages
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
From Everand
Quick Configuration of Openldap and Kerberos in Linux and Authenicating Linux to Active Directory
Dr. Hidaia Mahmood Alassouli
No ratings yet
React Portfolio App Development: Increase your online presence and create your personal brand
From Everand
React Portfolio App Development: Increase your online presence and create your personal brand
Abdelfattah Ragab
No ratings yet
Hbase Apache Org Book HTML
No ratings yet
Hbase Apache Org Book HTML
482 pages
Practise Quiz Ccd-470 Exam (05-2014) - Cloudera Quiz Learning
No ratings yet
Practise Quiz Ccd-470 Exam (05-2014) - Cloudera Quiz Learning
74 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
30 pages
Azure For Starters
From Everand
Azure For Starters
Chinmoy Mukherjee
No ratings yet
Configuration of Apache Server To Support ASP
From Everand
Configuration of Apache Server To Support ASP
Dr. Hedaya Mahmood Alasooly
No ratings yet
Introduction to PHP, Part 5, Second Edition
From Everand
Introduction to PHP, Part 5, Second Edition
Adam Majczak
No ratings yet
Practical 4.4 HBase
No ratings yet
Practical 4.4 HBase
12 pages
CONFIGURATION OF APACHE SERVER TO SUPPORT ASP
From Everand
CONFIGURATION OF APACHE SERVER TO SUPPORT ASP
DR. HIDAIA MAHMOOD ALASSOULI
No ratings yet
Bda Unit 5
No ratings yet
Bda Unit 5
16 pages
Company Interview Questions
No ratings yet
Company Interview Questions
6 pages
SIC - Big Data - Chapter 6 - Workbook
No ratings yet
SIC - Big Data - Chapter 6 - Workbook
133 pages
Log Hdfs Vtgroup
No ratings yet
Log Hdfs Vtgroup
10 pages
NgRx SignalStore: An effortless solution for state management
From Everand
NgRx SignalStore: An effortless solution for state management
Abdelfattah Ragab
No ratings yet
Unit 5 Lecture No-3 (Hbase)
No ratings yet
Unit 5 Lecture No-3 (Hbase)
35 pages
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
From Everand
Understanding Software Engineering Vol 3: Programming Basic Software Functionalities.
Gabriel Clemente
No ratings yet
Bigdatahbase
No ratings yet
Bigdatahbase
9 pages
What Is Spark?: Up To 100× Faster
No ratings yet
What Is Spark?: Up To 100× Faster
56 pages
Getting Started with Big Data Query using Apache Impala
From Everand
Getting Started with Big Data Query using Apache Impala
Agus Kurniawan
No ratings yet
Hive Advanced Concepts
No ratings yet
Hive Advanced Concepts
57 pages
Unit # 2
No ratings yet
Unit # 2
23 pages
Unit 2 Part A
No ratings yet
Unit 2 Part A
34 pages
Midhun BIGDATA Curicullum
No ratings yet
Midhun BIGDATA Curicullum
17 pages
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
From Everand
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
Anand Vemula
No ratings yet
Cloudlab Exercise 11 Lesson 11
No ratings yet
Cloudlab Exercise 11 Lesson 11
2 pages
Bda From Module 3
No ratings yet
Bda From Module 3
81 pages
Unit 5 Topic 13 IBM Big Data Strategy (12 Files Merged)
No ratings yet
Unit 5 Topic 13 IBM Big Data Strategy (12 Files Merged)
219 pages
01 HK 082010005150001
No ratings yet
01 HK 082010005150001
56 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Learning Apache Spark 2
From Everand
Learning Apache Spark 2
Muhammad Asif Abbasi
No ratings yet
Big Data Training1
No ratings yet
Big Data Training1
4 pages
Hadoop Ecosystem PDF
No ratings yet
Hadoop Ecosystem PDF
55 pages
Hadoop Ecosystem PDF
No ratings yet
Hadoop Ecosystem PDF
55 pages
OpenCart Tips and Tricks
From Everand
OpenCart Tips and Tricks
iSenseLabs
No ratings yet
Hadoopvsspark 180108070838
No ratings yet
Hadoopvsspark 180108070838
17 pages
Configure HBase Hadoop and Hbase Client
No ratings yet
Configure HBase Hadoop and Hbase Client
16 pages
BDA Manual
No ratings yet
BDA Manual
41 pages
Apache Hive Installation and Basic Usage Guide
No ratings yet
Apache Hive Installation and Basic Usage Guide
10 pages
Unit 6
No ratings yet
Unit 6
26 pages
The Book of JavaScript, 2nd Edition: A Practical Guide to Interactive Web Pages
From Everand
The Book of JavaScript, 2nd Edition: A Practical Guide to Interactive Web Pages
Thau
4.5/5 (3)
Hadoop/Hbase Installation: Install Java
No ratings yet
Hadoop/Hbase Installation: Install Java
11 pages
Questions On Ratios and Proportion
No ratings yet
Questions On Ratios and Proportion
3 pages
Code
No ratings yet
Code
1 page
Rectangle Star Pattern
No ratings yet
Rectangle Star Pattern
3 pages
Agni College Hexaware Registered
No ratings yet
Agni College Hexaware Registered
11 pages
Programming Assignment 1
No ratings yet
Programming Assignment 1
1 page
CSE Student DataBase - Placement 2023-2024
No ratings yet
CSE Student DataBase - Placement 2023-2024
46 pages
openGTS Design Document
No ratings yet
openGTS Design Document
17 pages
Assesment Questions - Basic Level
No ratings yet
Assesment Questions - Basic Level
3 pages
Opengts Documentation
No ratings yet
Opengts Documentation
15 pages
IJCRT2003114
No ratings yet
IJCRT2003114
4 pages
IJCRT2002084
No ratings yet
IJCRT2002084
8 pages
Controlling A Car Using Gesture by Accelerometer With The Help of Arduino Nano
No ratings yet
Controlling A Car Using Gesture by Accelerometer With The Help of Arduino Nano
6 pages
An Artificial Intelligence Based Tool For Eye Disease Classification
No ratings yet
An Artificial Intelligence Based Tool For Eye Disease Classification
21 pages
SL - No 15 Karthi S - B.E. ECE
No ratings yet
SL - No 15 Karthi S - B.E. ECE
1 page
DBMS MCQ Questions
No ratings yet
DBMS MCQ Questions
8 pages
NetBackup1011 WebUIGuide OracleAdmin
No ratings yet
NetBackup1011 WebUIGuide OracleAdmin
30 pages
GL122 Probability and Statistics 2019 1
No ratings yet
GL122 Probability and Statistics 2019 1
6 pages
Student Attendance System Using QR Code
No ratings yet
Student Attendance System Using QR Code
9 pages
BigData Questions
No ratings yet
BigData Questions
17 pages
A Project Report ON Library Management System: Niranjan KC Sagar Kunjyang Tamang Prajwal Bhandari
No ratings yet
A Project Report ON Library Management System: Niranjan KC Sagar Kunjyang Tamang Prajwal Bhandari
14 pages
SAP Hybris V6 Certified Development Professional - Study Guide
No ratings yet
SAP Hybris V6 Certified Development Professional - Study Guide
261 pages
Version-52 1178745807
No ratings yet
Version-52 1178745807
262 pages
DMS Microproject
No ratings yet
DMS Microproject
30 pages
Module 9
No ratings yet
Module 9
11 pages
CRM - Part 2 - Strategic CRM
No ratings yet
CRM - Part 2 - Strategic CRM
48 pages
Map3D - BestPractices (Inglês)
No ratings yet
Map3D - BestPractices (Inglês)
140 pages
Navya Sree Kolluri Resume
No ratings yet
Navya Sree Kolluri Resume
1 page
College Election System
No ratings yet
College Election System
29 pages
DB Space Statistic Monitor (DB02) Not Showing Data: Sap Basis Sap Basis Tutorial
No ratings yet
DB Space Statistic Monitor (DB02) Not Showing Data: Sap Basis Sap Basis Tutorial
6 pages
Automating SQL Server Management
No ratings yet
Automating SQL Server Management
20 pages
Advance Spreadsheet Skills: Lesson: Worksheet Basics & Navigation Level: Beginner
No ratings yet
Advance Spreadsheet Skills: Lesson: Worksheet Basics & Navigation Level: Beginner
39 pages
Aitcs: Amir Idlan Azmi, Norhanim Selamat
No ratings yet
Aitcs: Amir Idlan Azmi, Norhanim Selamat
18 pages
DBMS Syllabus
No ratings yet
DBMS Syllabus
8 pages
Applets: Unit - V
No ratings yet
Applets: Unit - V
19 pages
Thara
No ratings yet
Thara
4 pages
What Is System Catalog
No ratings yet
What Is System Catalog
3 pages
Evaluation of Business Performance Source 01
No ratings yet
Evaluation of Business Performance Source 01
25 pages
Laboratory Manual: 18EC3017 Biomedical Electronics & IOT For Healthcare
No ratings yet
Laboratory Manual: 18EC3017 Biomedical Electronics & IOT For Healthcare
7 pages
04 Rel-Algebra2
No ratings yet
04 Rel-Algebra2
23 pages
Managing and Optimizing Resources For SQL Server: Balmukund Lakhani Technical Lead - SQL Support Team
No ratings yet
Managing and Optimizing Resources For SQL Server: Balmukund Lakhani Technical Lead - SQL Support Team
28 pages
Oracle SQL PL SQL Concepts
No ratings yet
Oracle SQL PL SQL Concepts
4 pages

POC Issues 0327

Uploaded by

POC Issues 0327

Uploaded by

spark-shell --packages com.hortonworks:shc-core:1.1.1-2.1-s_2.11 --repositories http://repo.hortonworks.

Reading HDFS File

Querying HDFS File

val result1 =sqlContext.sql("select count(*) from edms_qa_test where SRC_SYS_NM = 'UDS'")

Joining Hbase and HDFS file

CREATE VIEW T_CLM

SELECT COUNT(*) FROM "T_CLM_PY";

SELECT * FROM "T_CLM_PY" WHERE "e"."PAYMENTCONTACTHISTORY" IS NOT NULL LIMIT 5;

Below items are the pre-requisits

creates & loads them into Spark-sql tables

s in edge node [ET01]. In Namenode [NT10], its running successfully

n:1.0.0,org.apache.hbase:hbase-client:1.0.0,org.apache.hbase:hbase-server:1.0.0 --class cog.bravo.sparkComp.SparkComp_3 /home/MET

"" "" "" "," "1" \

1) Update or add the following configs to hbase-site.xml

2. Running join queries

Option 1: Using phoenix-psql method

Rootcause & Resolution: Unable to find

d size (104857600 bytes) [100 MB]

er.znode.parent" was incorrectly updated to a wrong location.

arent" value as updated in the HBase cluster.

You might also like