iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://doi.org/10.1007/978-3-319-98379-0_6
Simulated Domain-Specific Provenance | SpringerLink
Skip to main content

Simulated Domain-Specific Provenance

  • Conference paper
  • First Online:
Provenance and Annotation of Data and Processes (IPAW 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11017))

Included in the following conference series:

  • 738 Accesses

Abstract

The main driver for provenance adoption is the need to collect and understand knowledge about the processes and data that occur in some environment. Before analytical and storage tools can be designed to address this challenge, exemplar data is required both to prototype the analytical techniques and to design infrastructure solutions. Previous attempts to address this requirement have tried to use existing applications as a source; either by collecting data from provenance-enabled applications or by building tools that can extract provenance from the logs of other applications. However, provenance sourced this way can be one-sided, exhibiting only certain patterns, or exhibit correlations or trends present only at the time of collection, and so may be of limited use in other contexts. A better approach is to use a simulator that conforms to explicitly specified domain constraints, and generate provenance data synthetically, replicating the patterns, rules and trends present within the target domain; we describe such a constraint-based simulator here. At the heart of our approach are templates - abstract, reusable provenance patterns within a domain that may be instantiated by concrete substitutions. Domain constraints are configurable and solved using a Constraint Satisfaction Problem solver to produce viable substitutions. Workflows are represented by sequences of templates using probabilistic automata. The simulator is fully integrated within our template-based provenance server architecture, and we illustrate its use in the context of a clinical trials software infrastructure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. David Allen, M., Chapman, A., Blaustein, B.: Engineering choices for open world provenance. In: Ludäscher, B., Plale, B. (eds.) IPAW 2014. LNCS, vol. 8628, pp. 242–253. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16462-5_25

    Chapter  Google Scholar 

  2. Belhajjame, K., Chapman, A.: 2nd ProvBench: Benchmarking Provenance Management Systems (2014). https://sites.google.com/site/provbench/home/provbench-provenance-week-2014

  3. Belhajjame, K., Zhao, J.: 1st ProvBench: Benchmarking Provenance Management Systems. In: Proceedings of the Joint EDBT/ICDT 2013 Workshops (2013)

    Google Scholar 

  4. Belhajjame, K., Zhao, J., Garijo, D., et al.: A workflow PROV-corpus based on Taverna and Wings. In: Proceedings of the Joint EDBT/ICDT 2013 Workshops, ProvBench: Provenance Benchmark Challenge (2013)

    Google Scholar 

  5. Curcin, V., Fairweather, E., Danger, R., et al.: Templates as a method for implementing data provenance in decision support systems. J. Biomed. Inform. 65 (2017)

    Google Scholar 

  6. Delaney, B., Curcin, V., Andreasson, A., et al.: Translational medicine and patient safety in Europe: TRANSFoRm - Architecture for the Learning Health System in Europe. Biomed Research Int., special edition on Improving Performance of Clinical Research: Development and Interest of Electronic Health Records (2015)

    Google Scholar 

  7. Fairweather, E., Alper, P., Porat, T., et al.: Architecture for Building Provenance Documents using Templates (2017). https://elliot.fairweather.eu/resources/ArchProvTemp.pdf

  8. Firth, H., Missier, P.: ProvGen: generating synthetic PROV graphs with predictable structure. In: Ludäscher, B., Plale, B. (eds.) IPAW 2014. LNCS, vol. 8628, pp. 16–27. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16462-5_2

    Chapter  Google Scholar 

  9. Gehani, A., Tariq, D.: Cross-platform provenance. In: Proceedings of the Joint EDBT/ICDT 2013 Workshops (2013)

    Google Scholar 

  10. Houkjær, K., Torp, K., Wind, R.: Simple and realistic data generation. In: Proceedings of the 32nd International Conference on Very Large Data Bases (2006)

    Google Scholar 

  11. Khalek, S. A., Elkarablieh, B., Laleye, Y. O. et al.: Query-aware test generation using a relational constraint solver. In: Proceedings of the 2008 23rd IEEE/ACM International Conference on Automated Software Engineering (2008)

    Google Scholar 

  12. Knublauch, H., Kontokostas, D.: Shapes constraint language (SHACL). Technical report, W3C (2017)

    Google Scholar 

  13. Leskovec, J., Chakrabarti, D., Kleinberg, J., Faloutsos, C.: Realistic, mathematically tractable graph generation and evolution, using kronecker multiplication. In: Jorge, A.M., Torgo, L., Brazdil, P., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 133–145. Springer, Heidelberg (2005). https://doi.org/10.1007/11564126_17

    Chapter  Google Scholar 

  14. Michaelides, D., Huynh, T. D., Moreau, L.: PROV-TEMPLATE: A template system for PROV documents (2014). https://provenance.ecs.soton.ac.uk/prov-template/

  15. Prud’homme, C., Fages, J.-G., Lorca, X.: Choco Documentation (2016)

    Google Scholar 

  16. Rossi, F., van Beek, P., Walsh, T.: Handbook of Constraint Programming (Foundations of Artificial Intelligence). Elsevier Science Inc. (2006)

    Google Scholar 

  17. Shuai, H.-H., Yang, D.-N., Yu, P.S., et al.: On Pattern Preserving Graph Generation. In: 2013 IEEE 13th International Conference on Data Mining (2013)

    Google Scholar 

  18. Soltana, G., Sannier, N., Sabetzadeh, M., et al.: A model-based framework for probabilistic simulation of legal policies. In: 18th ACM/IEEE International Conference on Model Driven Engineering Languages and Systems (MoDELS 2015) (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Elliot Fairweather .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Alper, P., Fairweather, E., Curcin, V. (2018). Simulated Domain-Specific Provenance. In: Belhajjame, K., Gehani, A., Alper, P. (eds) Provenance and Annotation of Data and Processes. IPAW 2018. Lecture Notes in Computer Science(), vol 11017. Springer, Cham. https://doi.org/10.1007/978-3-319-98379-0_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-98379-0_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-98378-3

  • Online ISBN: 978-3-319-98379-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics