#Data

General Overview

The components that are responsible for data storage, storage management, data sharing, standardisation and exchange.

The #data components within the framework of NOUS play a pivotal role in addressing the intricate dynamics of the data life cycle. NOUS is committed to establishing a comprehensive infrastructure for data life cycle management, integrating key elements such as governance, accessibility, and privacy considerations. A primary focus of NOUS is on researching and implementing cutting-edge, low-power distributed ledger technologies to ensure the integrity of stored data. This involves expanding upon innovative concepts like directed acyclic graphs or tangles, wherein only cryptographic summaries or hashes of the data are registered, respecting individuals’ right to be forgotten. Moreover, NOUS envisions its data storage components as a central hub for aggregating information from diverse Data Spaces and platforms. The project aims to develop syndication tools that facilitate the seamless combination of data from various sources. As a practical contribution to the developer and engineering community, NOUS will introduce essential tools. Firstly, an auto standardizer will be devised to transform data from any format into a standardized, universally recognized format, particularly in Mobility, Energy, and Green Deal data spaces. Secondly, NOUS plans to introduce a Virtual Lab designed for the collaborative sharing of knowledge on advanced data analytics techniques. These practical tools are anticipated to empower developers and engineers working within data spaces, fostering a more streamlined and efficient data ecosystem.

Applied tools and Methodology

Blockchain-based systems and databases, Data standardiser, Dataspace connectors

Solution

The data component of project will be focused on delivering a blockchain implementation, research standardisation processes and connecting with Data Spaces. Distributed Ledger Technologies such as tangles and green blockchains will be used in the project for data life cycle transparency and monitoring. While this is mentioned as a blockchain part, the project is more interested into the distributed ledger and the way it is implemented in the Blockchain. Thus, the project will determine if and how the distributed ledger can be used for the aforementioned goals, designed and validated given the use cases’ defined problems. Furthermore, the use-cases will be used extract information on how data life cycle is currently handled in real-world scenarios (2nd function of use cases). The standardisation processes will be researched for the domains of mobility, energy and green deal in the scope of first, determining harmonisation possibilities between the standards per data space and with EIRA and second, creating an auto-standardiser (AS). The AS aims to assist in a highly practical issue; organisations and companies that want to standardise their data, need to spend a significant amount of money to do, since not all domains have standardisation tools. The AS is a data-lake concept where data are “thrown” in the “buckets” and the AS attempts to relate the data elements of the provided dataset to data elements of a standard. When it fails, it will make specific requests to humans to manually correlate the data elements. This algorithm is expected to be heuristic and recursive, making use of imitation learning techniques to learn from human manual correlations and progressively improve their performance. Lastly, the connection with Data Spaces aims to address the call request for the successful deployment of Data Spaces. NOUS is a conductor of data analysis. It is where the actual data analytics take place because the cloud suites have computational resources. Within the NOUS data should flow freely in two ways, first, data can be uploaded by users or created by users’ applications (e.g., a web blog that allows registrations and has videos) or data can flow from other data pools/warehouses. The latter part is what is of interest for the project while relating it with data standardisation and data sovereignty (see blockchain above). Thus, the latter part of #data aims to address the entirety of the data life cycle in the NOUS considering that data can flow in from external data warehouses.

Impact

The integration of NOUS with diverse Data Spaces and the blockchain solution marks a significant enhancement, imparting additional value to the data. This is attributed to the high interoperability achieved and the ability to leverage information across various sectors. The NOUS approach is poised to swiftly establish practical solutions characterized by reasonable investments and rapid time-to-market deployment. The anticipated outcome is the emergence of more competitive and innovative data-driven services and products, positioning NOUS as a catalyst for transformative advancements in the digital landscape.

NOUS, through its dynamic framework, will facilitate seamless data access, fusion, and sharing within digital infrastructures. This streamlined data processing and exchange will be conducted with a keen focus on ensuring compliance with security, privacy, and data space requirements. NOUS achieves this through the implementation of cross-domain standards and a decentralized blockchain database. A key objective of the project is to enhance data-sharing capabilities and improve access to cloud, High-Performance Computing (HPC) services, and Data Spaces. This aligns with the overarching mission of NOUS to expedite the development of data-driven and AI-based digital services, facilitating their deployment by both public and private users. The project aspires to be a driving force in accelerating the evolution of digital services and technologies, ushering in a new era of efficiency and innovation.