Semantic Analytics Stack (SANSA)

SANSA is a big data processing engine for scalable processing of large-scale RDF data, please check SDA Web pages. SANSA uses Spark and Flink which offer fault-tolerant, highly available and scalable approaches to process massive sized datasets efficiently. SANSA provides the facilities for Semantic data representation, Querying, Inference, and Analytics.

SANSA-Stack’s core is a processing data flow engine that provides data distribution and fault tolerance for distributed computations over RDF large-scale datasets.

SANSA includes several libraries for creating applications:

Read / Write RDF / OWL library for RDF/OWL operations,
Querying library support a query language on top of distributed RDF/OWL library,
Inference library implements rule-based reasoning on RDF/OWL data,
ML- Machine Learning core library

SANSA is easily integrated with well-known open source systems both for data input and output (HDFS) and is build on top of Spark and Flink.

Tool type

Analytics Software / System / Platform