-
Notifications
You must be signed in to change notification settings - Fork 370
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deprecate DataArrays in favor of NullableArrays #994
Comments
I have done some work already, I'll push a branch soon. I think it's indeed not that complicated, as you said. I think one of the things we absolutely need to do in time for 0.5 is JuliaStats/NullableArrays.jl#111. I'd certainly appreciate some help. |
I've pushed the current state of my work to the That's really a rough port at this point, but I think it shows we're not that far. |
Good news: tests all pass now, except those which are already broken on master. The code now needs a lot of cleanup, in particular we need to implement the missing features in NullableArrays. I'll make a detailed list soon, but you can start having a look at the branch to see how it feels. |
@nalimilan it seems as though there are some methods that ought to go into Base, too --- e.g. the |
Yes. Most of these should go into Base, and I'll move the most hacky to |
I agree that they should be useful in many cases. They also seem sensible. What sort of bugs do you think they would hide? Just unexpected behavior? |
Not sure, maybe I'm paranoid. |
My original goal in not defining them was to avoid encouraging people to think about |
The branch has been merged now (#1008). |
I believe NullableArrays has reached a level of maturity where we should seriously consider defaulting to them instead of DataArrays.
Unfortunately, I'm not familiar with the far-reaching ramifications of the switch, so my naive suggestion is to swap it out and see what breaks and start fixing. I'd appreciate some more mature insight on what the potential breakage/things to deal with are. AFAIK, DataFrames should be fairly robust since we've already abstracted DataArrays out on its own.
My long-term plans are to deprecate
Data.Table
in theDataStreams.jl
package in favor of supporting returning a DataFrame as the defactor in-memory Julia table structure in the DataStreams ecosystem (CSV, ODBC, SQLite, Feather).I'm happy to push forward on this if there aren't any red flags/major roadblocks.
The text was updated successfully, but these errors were encountered: