Tools and data management

PreviousQuantitative Analysis NextResults

Last updated 7 years ago

Tools and data management

In this part of the thesis, some of the more technical high level tools are introduced, these are mainly made of programming languages, libraries and protocols that were essential in reaching the goals for the analysis.

Data Storage and Retrieval

The first tool that is worth mentioning is the graph database where all of the patents, scientific publications and projects were stored, work developed in the context of the AMICa pathfinder project at DTU. Graph databases are a special type of database that use network structures composed of nodes and edges, to represent and store data. This is opposed to the relational database model where data is separated into different tables. The advantage of “graph databases” lies in the relationships, and the fact that these are explicit.

The database used is managed in , an open source graph database management system that allows not only to easily run a database server in a machine, but also allows the direct interaction and querying of the data.

To query the data, Neo4j requires the usage of a programming language know as . Cypher can be understood as the graph database equivalent of SQL, and allows for fast relational queries to the data. As a simple example to the cypher language, let us retrieve all of the technological assets located in Denmark:

A snippet of the response follows:

Data Handling and Analysis

For handling the data, an extensive list of python libraries was used, the most important ones follow:

For visualizations, three toolkits were recurrently used:

Code Management and Presentation

PreviousQuantitative Analysis NextResults

Last updated 7 years ago