_rowaddr and _rowid not exposed for `merge`? #3251

oceanusxiv · 2024-12-16T03:39:22Z

I have a simple use case where I want to pull down all rows of a column from a lance dataset, do some custom processing locally, and add the resulting rows (same length) directly back up to the same lance dataset. (The local processing involves some windowing operation which is why the straight SQL update syntax won't work).

It looks like the most supported way to do this is via the merge syntax, where I can feed in a precomputed data frame and join on index columns. This is great and all, but I couldn't figure out how to do this without manually generating some custom index column that basically ends up being the same as a _rowaddr anyways, except actually manifested in the dataset instead of being a meta column.

It seems strange to me that such a simple operation as adding a column with the same row count as the existing dataset is so complicated, but at least if _rowaddr is exposed to merge operations than it saves the inconvenience of generating a redundant row idx column manually.

The text was updated successfully, but these errors were encountered:

chenkovsky · 2024-12-16T14:30:09Z

is this what you want #3254 ? I'm faced with similar problem. @oceanusxiv

oceanusxiv · 2024-12-16T14:42:28Z

@chenkovsky oh wow that was quick, yes, that's what I want, didn't realize row address was already read anyways, so it's just an interface limitation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

_rowaddr and _rowid not exposed for `merge`? #3251

_rowaddr and _rowid not exposed for `merge`? #3251

oceanusxiv commented Dec 16, 2024

chenkovsky commented Dec 16, 2024 •

edited

Loading

oceanusxiv commented Dec 16, 2024

_rowaddr and _rowid not exposed for merge? #3251

_rowaddr and _rowid not exposed for merge? #3251

Comments

oceanusxiv commented Dec 16, 2024

chenkovsky commented Dec 16, 2024 • edited Loading

oceanusxiv commented Dec 16, 2024

_rowaddr and _rowid not exposed for `merge`? #3251

_rowaddr and _rowid not exposed for `merge`? #3251

chenkovsky commented Dec 16, 2024 •

edited

Loading