Abstract
CQL is an open-source query and data integration scripting language that can be applied to common challenges in the field of computational science. It can preserve structure as it moves data from one database to another, and thus allows users who share their data to be sure only the correct subset of their data will be used by others who draw from it. This feature of CQL migrations allows those who draw from public databases to be sure they will get the expected results only if they meet a certain specification. We argue some open problems in the area of managing scientific datasets could benefit from this paradigm.