Separate packages for handling DB connections and SQL? #72

krlmlr · 2016-01-04T21:02:08Z

Quoting Paul Gilbert's post to R-sig-DB:

[Splitting DBI] might make a lot of sense if you ever want to standardize in layers, for example, if you ever wanted NoSQL to be a possible replacement for SQL.

There are different reasons for wanting separate packages, but the important one in my mind may not be the one you are thinking about: The classes, and the generic methods dbConnect, and dbDisconnect should all be extremely stable. On the other hand, the SQL part is likely to go through some changes. For sake of discussion let me call the two packages DBIclasses and DBIsql. If you make a change in DBIsql my packages TSsdmx, TSmisc, and some others, will not be in the upstream dependencies, and do not need to be tested for a CRAN submission of DBIsql. If DBIclasses and DBIsql are in the one package, DBI, then these packages do need to be checked (not just by me but also by you if you make an API change and intend to submit to CRAN). These packages in turn have a large number of dependencies which can change from time to time on their own. Thus things may be broken for reasons having nothing to do with your changes, and are beyond your control. Then the CRAN checks will fail and your submission will be rejected, or at least require considerable additional work. So, it is advisable to avoid having dependencies that really can be avoided.

If such a split is implemented, probably currently only the Result class (with all its methods) and the Connection methods dbSendQuery() and dbGetQuery() would end up in the DBIsql package. I'm not sure what this means for backward compatibility. An advantage would be that such a DBIsql package could host an array of other methods -- creating indexes, views, ... .

A way around the reverse dependency nightmare outlined by Paul is to design DBI around the open/closed principle -- open for extension, closed for modification. This requires very careful design, but looks doable.

hannes · 2016-01-06T08:11:09Z

And then the result class is only used once to fetch it to a data frame... Also, do NoSQL systems not support some sort of query language? The result object is already opaque w.r.t. its contents.

krlmlr · 2016-01-13T21:48:29Z

Separate tests into no-SQL and SQL parts
Specify extension strategy for DBI

krlmlr · 2019-08-23T07:14:33Z

SQL is wired fairly deeply into DBI and DBItest.

krlmlr added the action:design label Jan 4, 2016

nbenn mentioned this issue Oct 24, 2017

Think about splitting dbWriteTable() #74

Closed

krlmlr closed this as completed Aug 23, 2019

github-actions bot locked and limited conversation to collaborators Oct 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate packages for handling DB connections and SQL? #72

Separate packages for handling DB connections and SQL? #72

krlmlr commented Jan 4, 2016

hannes commented Jan 6, 2016

krlmlr commented Jan 13, 2016

krlmlr commented Aug 23, 2019

Separate packages for handling DB connections and SQL? #72

Separate packages for handling DB connections and SQL? #72

Comments

krlmlr commented Jan 4, 2016

hannes commented Jan 6, 2016

krlmlr commented Jan 13, 2016

krlmlr commented Aug 23, 2019