Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Running in DataBricks #17

Open
lordoetl opened this issue Aug 21, 2018 · 1 comment
Open

Running in DataBricks #17

lordoetl opened this issue Aug 21, 2018 · 1 comment

Comments

@lordoetl
Copy link

lordoetl commented Aug 21, 2018

I have loaded a dataframe and when I run the command
profile = spark_df_profiling.ProfileReport(df)

I get the following error:

pycache not bottom-level directory

I have confirmed the df is loaded and looking good (it is very large if that matters), not sure where to go next with this, suggestions?

It occurred to me that I was running on a serverless cluster so tried your example code on a Standard just to make sure that wasn't it:

Ran:
import spark_df_profiling
df = sqlContext.createDataFrame([["2",True,None,"8"],
["2",False,None,"8"],
["2",True,"5","7"]], ["a","b","c","d"])
rep = spark_df_profiling.ProfileReport(df)
displayHTML(rep.html)

Error:
pycache not bottom-level directory in ....

@mparkhe
Copy link
Contributor

mparkhe commented Dec 20, 2018

Hey Folks,

I created a PR #22 to display renderable HTML in Databricks notebook.

mani

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants