Skip to content

ayanmukhuty/sparktest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

neurolab-hadoop-spark

image

1. Validate if hadoop is working or not

Open terminal in vscode and execute below command

  1. Create a sample.txt file with content "This is hadoop and spark lab."
echo "This is hadoop and spark lab">sample.txt
  1. Upload your sample.txt file to Hadoop file system
hdfs dfs -put sample.txt /
  1. List uploaded file in hadoop
hdfs dfs -ls /

2. Validate if pyspark is working or not

Execute below command

pyspark

Above command will launch pyspark terminal and you can execute pyspark command in interactive shell.

Close your terminal

3. Now let's run a script file "demo.py"

Execute the spark script.

python demo.py

URL to view data in UI:

Spark Master - <your_app_name>.ineuron.app:8080

History Server - <your_app_name>.ineuron.app:18080

Spark Worker Node - <your_app_name>.ineuron.app:8081

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages