Distributed Semantic Analytics

Posted on: Fri, 06/18/2021 - 14:21 By: valentina.janev

This module will cover the needs and challenges of distributed analytics and then dive into the details of scalable semantic analytics stack (SANSA) used to scalable analytics for knowledge graphs. This module will cover the setup, Apis and different layer of SANSA. At the end of this module, the audience will be able to execute examples and create programs that use SANSA APIs.

 

Semantic Analytics in the Palm of Your Browser

Posted on: Mon, 05/31/2021 - 15:36 By: valentina.janev

Linked open data sources and the semantic web has become a precious data source for data analytics tasks and data integration. The growing data set sizes of RDF Knowledge Graph data need scalable processing and analytics techniques. The processing power of in-memory frameworks which can perform scalable distributed semantic analytics like SANSA, make use of Apache Spark and Apache Jena to provide start-to-end extensive scalable analytics on RDF knowledge graphs.

PLATOON Data Analytics Toolbox

Posted on: Wed, 05/12/2021 - 10:54 By: valentina.janev

The PLATOON Data Analytics toolbox is formed of all the data analytics tools that are developed and validated for the different use cases of the project. These tools will allow the extraction of value from heterogeneous data sources. There will be two main groups of data analytics tools:

Scalable Knowledge Graph Processing using SANSA

Posted on: Fri, 05/29/2020 - 10:55 By: valentina.janev

The size and number of knowledge graphs have increased tremendously in recent years. In the meantime, the distributed data processing technologies have also advanced to deal with big data and large scale knowledge graphs. This lecture introduces Scalable Semantic Analytics Stack (SANSA), which addresses the challenge of dealing with large scale RDF data and provides a uni ed framework for applications like link prediction, knowledge base completion, querying, and reasoning.

SANSA - Scalable Semantic Analytics Stack

Posted on: Wed, 04/01/2020 - 20:59 By: valentina.janev

The size of knowledge graphs has reached the scale where centralised analytical approaches have become infeasible. Recent technological progress has enabled powerful distributed in-memory analytics that have been shown to work well on simple data structures. However, the application of such distributed analytics approaches on semantic knowledge graphs lags significantly behind. To advance both scalability and accuracy of large-scale knowledge graph analytics to a new level, foundational research on methods leveraging distributed in-memory computing and semantic technologies in combination w

Distributed Semantic Analytics II

Posted on: Mon, 12/24/2018 - 16:22 By: valentina.janev

This module will cover the setup, APIs and different layers of SANSA. At the end of this module, the audience will be able to execute examples and create programs that use SANSA APIs. The final part of this lecture is planned to be an interactive session to wrap up the introduced concepts and present attendees some open research questions which are nowadays studied by the community.

Distributed Semantic Analytics I

Posted on: Mon, 12/24/2018 - 16:21 By: valentina.janev

This module will cover the needs and challenges of distributed analytics and then dive into the details of scalable semantic analytics stack (SANSA) used to perform scalable analytics for knowledge graphs. It will cover different SANSA layers and the underlying principles to achieve scalability for knowledge graph processing.

Please, download from the following link.

Distributed Big Data Libraries

Posted on: Mon, 12/24/2018 - 16:21 By: valentina.janev

In the practical level, the Big Data frameworks use different APIs for graph computations and graph processing. In this lecture, the important libraries built on top of Apache Spark will be covered. These include SparkSQL, GraphX and MLlib. The audience will learn to build scalable algorithms in Spark using Scala.

Please, downoloadfrom the following link.

Subscribe to Smart Data Analytics