Pages
- Manual Testing Tutorials
- Manual Testing Materials
- Manual Testing Interview Q & A
- ISTQB
- UNIX /Linux
- SQL
- Agile Methodology
- Selenium with Java
- Selenium with Python
- Automation Testing Materials
- API Testing
- Advanced Java
- Cypress Tutorials
- ETL Testing Documents
- ETL Testing videos
- Big Data Hadoop
- SDET Essentials
- Miscellaneous Topics
- Career Guidance
- Mock Interviews
- Resume Templates
- YouTube Videos
- Online Training
- Udemy Courses
Streaming data into Hadoop using Apache Flume
Flume:Flume a hadoop echo system s/w used for streaming the logs file from applications int o HDFS.
In this post let's discuuss about following topics.
- Overview on Flume
- Streaming log files data into HDFS
- Streaming Twitter App logs into HDFS
Overview on Apache Pig
What is pig?
- Implemented by Yahoo.
- Pig Hadoop echo system s/w from apache foundation used for analysing the data.
- Pig uses pig latin language.
- Data flow language.
- handle structured, semi-structured and un-structured
- Replacement of mapreduce(not 100%)
- Pig internally uses MapReduce.
Apache Hive UDF'S (User Defined Functions)
- Creating UDF
- How to packaging UDF(creating jar file)
- Add jar file in to hive
- Test UDF
Steps to create and test UDF's
1) Implement the code for UDF in Java
2) Package java class into jar file copy in some location
3) Add jar file in to Hive CLI
4) Create temporary function in hive
5) Use hive UDF BY using Query.
Prerequiste: Table should have some data.
Problem statement-1
Find the maximum marks obtained out of four subject by an student.
Package java class into jar file copy in some location.
SELECT CLASS IN ECLIPSE-->RIGHT-->EXPORT-->JAVA-->JAR--> BROWSE THE LOCATION-->PROFILE FILENAME WITH .JAR Extension.
Add jar file in to Hive CLI
hive> add jar /home/cloudera/training/HiveUDFS/getMaxMarks.jar;
Create temporary function in hive
hive> create temporary function getmaxmarks as 'udfs.GetMaxMarks';
Use hive UDF BY using Query
hive> select getmaxmarks(10,20,30,40) from dummy; // sanity test
There are 2 types of UDF'S
1) Regular UDF( UDF) ---> Applied on more number of rows in a table
2) User Defined aggregate function (UDAF) --> Group of result sets.
Problem statement-2: Find the mean of marks obtained in maths by all the students.
Package java class into jar file copy in some location
Right click onth package-->export-->java-->provide jar file name.
Add jar file in to Hive CLI
hive> add jar /home/cloudera/training/HiveUDFS/getMeanMarks.jar;
Create temporary function in hive
hive> create temporary function getmeanmarks as 'udaf.GetMeanMarks';
Use functions with queries
hive> select getmeanmarks(social)from t_student_record;
Subscribe to:
Posts (Atom)
Popular Posts
- How To Explain Project In Interview Freshers and Experienced
- Selenium Frequently Asked Questions & Answers Part-6
- API/Webservices Testing using RestAssured (Part 1)
- How to use HashMap in Selenium WebDriver
- Java Programs for Selenium
- Manual & Automation Testing Free Video Tutorials | YouTube Playlists
- Manual Testing Interview Questions & Answers-PART1
- ETL Test Scenarios and Test Cases
- Python Interview Questions and Answers Part-1
Followers
Labels
a Software Tester or a Developer?
(1)
Adhoc Testing
(1)
Agile
(33)
Agile Team
(1)
Agile Testing
(2)
apache poi
(1)
Appium
(1)
Appium FAQ'S
(1)
Banking Domain
(1)
Core Java scripts
(4)
Cross-browser Web Testing
(1)
How to use Java Collections
(1)
ISTQB
(1)
ISTQB Sample Question Paper
(10)
Java
(5)
Java Interview Questions
(2)
Java Programs for Selenium
(1)
Jira
(1)
Linux
(7)
Manual Testing
(48)
Manual Testing Interview Questions
(2)
Maven Questions & Answers
(1)
Mobile Application
(1)
Mobile application testing
(1)
Mobile Application Types
(1)
Mobile Testing
(2)
NoSQL
(1)
ORACLE
(9)
PL/SQL
(1)
Scrum
(1)
SDLC
(33)
Selenium
(6)
Selenium Common Exceptions
(1)
Selenium FAQ
(5)
Selenium FAQ's
(1)
Selenium Grid
(1)
Selenium Interview Questions
(1)
Set Career Goals
(1)
Shell Scripting
(6)
Skills Required for Software Tester
(1)
Software Testing
(43)
Sprint
(1)
SQL
(11)
STLC
(33)
T-SQL
(1)
Testing E-commerce Websites
(1)
Testing Life Cycle
(33)
Testing process
(8)
TestNG
(7)
TestNG Questions & Answers
(1)
TestNG Assertions
(1)
TestNG Scripts
(4)
VI Editor
(6)
Web Services Testing
(1)
Web Testing
(1)
WebDriver
(6)
Webdriver Questions & Answers
(1)
Webservices API Testing
(1)
Writing Good Agile User Stories
(1)
WwebDriver Scripts
(4)