OPEN ARCHITECTURE

An SDMX based modular platform covering the complete end-to-end data lifecycle (GSBPM) from collection through to dissemination.

The .Stat Suite covers a broader part of the data lifecycle (GSBPM steps: Design; Collect; Process; Analyse; Disseminate) helping to resolve the limitations observed today by providing a set of back-office components that can be assembled in a way that is tailored for the specific context and processes of each organisation.

By adding SDMX full compliance (on both the input and output side), as well as connectors with main market tools, this is greatly facilitating an end-to-end integration with other data processing platforms already in place, as well as excel or other similar tools.

This is helping dissemination teams in statistical organisations to manage in a much more cost efficient way the different data sources they need to aggregate and set up the proper workflows ensuring quality outputs.

By taking an approach that is both modular and open (architecture) by design, complemented with the proper technical capacity building, this enables teams to be mobilised from around the world in a way that is efficient and to synergise developments done by other organisations to avoid reinventing the wheel.

OPEN standards

COMMON STATISTICAL PRODUCTION ARCHITECTURE

CSPA is a reference architecture for the statistical industry covering statistical production across the different processes as defined by the GSBPM to promote greater interoperability among the different stakeholders across ecosystem of official statistics. CSPA main objective is to provide guidelines for defining, developing and deploying statistical services more efficiently.

STATISTICAL DATA AND METADATA EXCHANGE

SDMX is an international agreed standard that goes beyond just it’s name facilitating for the creation of a set of tools and services from a underlying common library. Assisting in the standardisation of data modeling and reporting for a solid foundation to the statistical system.

GENERIC STATISTICAL BUSINESS PROCESS MODEL

GSBPM describes all the activities undertaken by a statistical organisation, including those undertaken to conceptualise, design, build and maintain information and application assets used in the production of statistical outputs. GSBPM defines the Information, Application and Technology architectures for a statistical organisation.

GENERIC STATISTICAL INFORMATION MODEL

GSIM is a reference framework of information objects, which enables generic descriptions of the definition, management and use of data and metadata throughout the statistical production process. It also describes the information standards and frameworks that underpin the statistical information. GSIM facilitates discoverability and accessibility, leading to greater reuse and sharing.

Open technology

REUSABLE COMPONENTS FOR THE WEB

Reusable Components for the Web (RCW) are a set of JavaScript components covering all aspects of end-user web applications, from APIs to User Interfaces.

In order to efficiently reuse components, they are versioned and managed by the most well established package manager of the JavaScript ecosystem: npm.

Collaboration is the key concern of RCWs. To ease it, source code is handled by CVS (like git) and hosted by services like github.

The technical stack on which the components are based is React for the UI and Redux for the behaviour.

The visualisation is a very specific part of the User Interface that is powered by D3. A versatile dedicated chart engine has been built on top of it, with an optional integration with React.

SDMXSOURCE AND TOOLS

The Community works in close partnership with Eurostat enabling the reuse of common modules from the SDMX Reference Infrastructure (RI) package. This includes a web service supporting information exchanges with other modules, and consumption of internal and external flows in sdmx-ml, sdmx-json, and sdmx-csv formats, a web interface to support mappings between different data structures, and a storage for sdmx structural artefacts.

SDMX-RI and SDMXSOURCE are released under open source license, and used within more than 50 countries worldwide. It is available in both Java and .NET and can be used with different database engines back-ends such as Mariadb, MSSQLServer, and Oracle.

SEARCH ENGINE

An out-the-box and exchangeable search engine, Solr (Apache Lucene), provides storage for search documents and returning of a relevant subset by search queries. Storage of accumulated data for dynamic facet generation as well as non-search related information such as configuration settings.

Solr is an enterprise open source search engine, written in Java, that uses a NoSQL database to enable extended search capabilities.

Interested? Contact us!