Call for Papers: Data Science and Reproducibility

Systems researchers increasingly rely on large datasets to carry out their experiments and prove their hypotheses. Whether is data obtained for training ML models, data that comes from monitoring production systems, or data that is produced by large-scale (or long-running) experiments; managing the lifecycle of experiments and their associated data is vital when looking from the point of view of reproducibility. In this context, the "Data Science and Reproducibility" area of JSys welcomes contributions that address a wide variety of topics in the area of reproducibility in the research, design, development, and operation of data-intensive workloads and systems.

Topics of Interests

  • Experiment dependency management.
  • Data versioning and preservation.
  • Provenance of data-intensive experiments.
  • Tools and techniques for incorporating provenance into publications.
  • Automated experiment execution and validation.
  • Experiment portability for code, performance, and related metrics.
  • Experiment discoverability for re-use.
  • Cost-benefit analysis frameworks for reproducibility.
  • Usability and adaptability of reproducibility frameworks into already-established domain-specific tools.
  • Long-term artifact archiving for future reproducibility.
  • Reproducibility-aware computational infrastructure.

If you are unsure whether your work is a good fit for this area, please send a short abstract or description to the area chair(s); they will be happy to give some initial feedback.

Area Board

The Data Science and Reproducibility area is chaired by

and the reviewers are:

Former Members