Skip to content

Organize datasets #247

@nmprista

Description

@nmprista

this post will be updated as work evolves

We presently have many datasets, some in data, others in tests, not all of them used. This complicates updates when RDBES is upgraded. There is not a clear path to know what needs done and by what order

We decide to work with an API in mind [the API will facilitate steps 1 and 2 below:

  1. text book and other examples need to be uploaded to RDBES (also so the data is validated) under ZW
  2. ZW data will be downloaded to data-raw.
  3. A function will be placed in data-raw that splits the ZW zip into the data used in the package.
  4. some data used in tests\testthat will be organized into hierarchy folders in the test folder - these will be mostly csv and zips to test the importing functions - and be named only as H1, H2, ... so paths inside the tests do not have to be updated.
  5. most data will be placed in the data directory - most tests will source from there and so will all vignettes
  6. data and files that appear not to be used, will be moved to data-raw/old

Sub-issues

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions