You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If you use Spark Connect, e.g. in databricks then dataframes are of the type pyspark.sql.connect.dataframe.DataFrame.
When df is of type pyspark.sql.connect.dataframe.DataFrame.
a = ProfileReport(df)
raises
TypeCheckError: argument "df" (pyspark.sql.connect.dataframe.DataFrame) did not match any element in the union:
pandas.core.frame.DataFrame: is not an instance of pandas.core.frame.DataFrame
pyspark.sql.dataframe.DataFrame: is not an instance of pyspark.sql.dataframe.DataFrame
NoneType: is not an instance of NoneType
Current Behaviour
If you use Spark Connect, e.g. in databricks then dataframes are of the type pyspark.sql.connect.dataframe.DataFrame.
When
df
is of typepyspark.sql.connect.dataframe.DataFrame
.a = ProfileReport(df)
raises
Related:
https://community.databricks.com/t5/data-engineering/pyspark-sql-connect-dataframe-dataframe-vs-pyspark-sql-dataframe/td-p/71055
Expected Behaviour
The type checking should allow
pyspark.sql.connect.dataframe.DataFrame
as well.Data Description
N/A
Code that reproduces the bug
pandas-profiling version
v4.12.0
Dependencies
OS
No response
Checklist
The text was updated successfully, but these errors were encountered: