-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fst_table as a serious class #236
Comments
Hi @hope-data-science , thanks for the feature request! Having more options to manipulate and view characteristics of the offline dataset would be very useful indeed. But those can be better served in separate So For example,
Downstream packages could use these features to facilitate their own API's and provide functionality like offline sorting, partial loading, etc... |
I am not so familiar with the implementations underneath, what you mention as "low-level abilities" are acutually quite "high-level" to me. If these abilities could be done in How to access data more efficiently from fst file? How to subset data more flexibly (by group? filter? slice? select?[I think I've handled this part in some way] )? I did make a function named BTW: A small problem, I am tring to get the zero row of fst table but failed. In data.frame or data.table, you can get Thanks! |
Hi @hope-data-science, you're right, # identical
x[1, ]
#> X Y
#> 1 1 2
fst_table[1, ]
#> X Y
#> 1 1 2
# not identical
x[0, ]
#> [1] X Y
#> <0 rows> (or 0-length row.names)
fst_table[0, ]
#> Error in read_fst(meta_info$path, from = min_row, to = max_row): Parameter 'from' should have a numerical value equal or larger than 1. thanks for pointing that out, I'll schedule a fix for the next release! |
added as a separate issue |
I've designed a new tool to work with fst, which is considered to be more memory efficient. |
I find fst_table a very useful class, do not have to read the file physically but could get enough information to know how to process. Perhaps there could be more methods to deal on it, e.g.
is.fst.table
,path.fst.table
,summary.fst.table
, etc. I think this is going to be popular in big data analysis in R.The text was updated successfully, but these errors were encountered: