-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High level API redesign #123
Comments
This is a good idea. Another idea is that,
|
I kind of started to like these abbreviations after getting familiar with fastai. The learning curve is the same since you have to check docstrings anyway, but with shorter names the code is smaller.
I suggested something similar in #69
|
I want to go away from schema, mostly because now schema-based features are all mixed together between Arche, Matt and standard schema.
I am thinking about fastai-like parameters https://docs.fast.ai/tabular.data.html#TabularList:
a = Arche(data, cat_names=["size"], cont_names=["price"], uniques=["id", ("url", "title", "price")])
So then duplicates will use
uniques
, i.e. check if allid
are unique and all rows have uniqueurl and title
Categories will use
cat_names
cont_names
is just an example, but can be used to determine numerical data, and then plot some stats like deviation, percentiles and such.Thoughts?
@ejulio @raphapassini @victor-torres @alexander-matsievsky
The text was updated successfully, but these errors were encountered: