Biodiversity Atlas Sweden Uppsala April 12, 9:00 - 16:45 Open Development
April, 2018
Biodiversity Atlas Sweden Uppsala April 12, 9:00 - 16:45 Open Development
Examples of web-friendly frontends for geospatial analysis available under Open Source Licenses:
Not only about code, data and systems - actually mostly about "People and Processes" that collaborate rather than compete, sharing data and code. To enable creativity and innovation through freedom and rights to make changes provided through Open Source Licenses.
Estonia: Uses FOSS solutions as much as possible across the board for tax payer financed software - a gov decision, so required to use it by law, reaches all the way out even consultants through procurement rules
Economic rationale:
Open | Non-Open |
---|---|
Public / government / research - data is common good | Private / corporate / business - data is secret and a competitive advantage |
Source code lives on GitHub and apps on Docker Hub -> "Look Under The Hood" Transparency | Source code lives on Internal Network with no external public access -> "Walled Garden" with "Trust Me Blindly" Opaqueness |
FOSS-licenses set at inception | Non-FOSS licenses or Not (Yet) Known |
Avoids lock-ins | Proud over Lock-ins - "we got great rebates from vendors" |
Polyglot - tolerance for any languages running on FOSS stack | Monoculture - dictates use of a specific language |
Open | Non-Open |
---|---|
Modules - independent units loosely coupled | Monorepos - tight dependencies |
Decentralized Distributed Micro-Services (nodes with in-built redundancy) | Centralized (one-stop-shop monolithic website with SPOF, Single Point of Failure and Single Sign On loginwall) |
Portable, immutable infrastructure - Runs on laptop/cloud/anywhere | Stateful Servers (often with vendor lock-in) - Runs only on specific servers |
Local build possible from source | "Depend on and Trust this EXE or binary blob" |
Agile - developed with "diffs" from current state with continuous delivery - Bazaar Building Style adapting quickly to changes in environment | Waterfall - Five-Year Long Central Plans - Cathedral Building Style waiting often long times for the Final Delivery |
EPA: "Choose Open Standards without licensing costs instead of proprietary or vendor-specific formats" (Naturvårdsverket 2016)
Avoid problematic vendor lock-in and enable long-term maintenance of data using standardized archive formats
Long-term sustainable file formats and software projects are fundamental prerequisites for long-term maintenance of biodiveristy information over longer periods of time (decades) - data archives won't change as often as software or APIs
Swedish Land Survey Office about using the Creative Commons license CC0 for publishing open data.
Principles for managing data over a long life cycles - five recommendations:
Make use of sustainable communities and projects - consider socio-technical perspectives. Are there communities with a life length more than a couple of years that maintains standards/formats/tools and "are there stable releases over time publicly available"? The GBIF and Atlas communities offer this.
Don't "bikeshed" - use already existing appropriate open standards/formats (such as DarwinCore Archives with zipped CSV, DCAT-AP standard for national and European data sharing) rather than making interoperable systems elsewhere dependant on custom XML/JSON from local homegrown APIs
Use appropriate licenses (technical, legal) - such as Creative Commons licenses for content and Affero GPLv3 for code.
Use suitable available libraries for reading and writing the formats (supporting automation using various languages - examples include dwcio libraries, name parsers etc, conversion tools)
Use relevant and adequate FOSS tools - ALA and GBIF offer many software components (such as IPT, Darwin Core validator tools and perhaps also various APIs that extend use to HTTP and which are expected to change more frequently than underlying archival file formats). There are also CKAN with DCAP-AP-extensions for Open Data exchange along with various format conversion tools (ipt-to-dcatapswe), EML parser etc.