Python Interpreter for parsing html/xml in zepplin #9

mkscala · 2016-04-13T17:35:33Z

I wanted to use the Python APIs like BeautifulSoup, how can we use external python api's along with pyspark
http://omz-software.com/pythonista/docs/ios/beautifulsoup_guide.html
http://apache-spark-user-list.1001560.n3.nabble.com/How-to-consider-HTML-files-in-Spark-td22017.html
https://pypi.python.org/pypi/beautifulsoup4

dylanmei · 2016-04-13T17:54:05Z

%sh pip install beautifulsoup4

I don't know any details about actually using it effectively with Spark. Spark has a very active mailing list and freenode #apache-spark IRC channel which I'm sure will yield better tips.

mkscala · 2016-04-13T19:31:08Z

I get the below error
Process exited with an error: 127 (Exit value: 127)

mkscala · 2016-04-13T19:36:30Z

how to add the basic python interpreter ? have you tried ? within this docker-zepplin?

dylanmei · 2016-04-13T20:16:53Z

There is only the pyspark interpreter. Perhaps you'd get more mileage with this: https://github.com/jupyter/docker-stacks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python Interpreter for parsing html/xml in zepplin #9

Python Interpreter for parsing html/xml in zepplin #9

mkscala commented Apr 13, 2016

dylanmei commented Apr 13, 2016

mkscala commented Apr 13, 2016

mkscala commented Apr 13, 2016

dylanmei commented Apr 13, 2016

Python Interpreter for parsing html/xml in zepplin #9

Python Interpreter for parsing html/xml in zepplin #9

Comments

mkscala commented Apr 13, 2016

dylanmei commented Apr 13, 2016

mkscala commented Apr 13, 2016

mkscala commented Apr 13, 2016

dylanmei commented Apr 13, 2016