Reasoning in Knowledge Graphs: An Embeddings Spotlight

Posted on: Fri, 05/29/2020 - 10:38 By: valentina.janev

In this lecture, we introduce the aspect of reasoning in Knowledge Graphs. We give a broad overview focusing on the multitude of reasoning techniques: spanning logic-based reasoning, embedding-based reasoning, neural network-based reasoning, etc. In particular, we discuss three dimensions of reasoning in Knowledge Graphs. Complementing these dimensions, we will structure our exploration based on a pragmatic view of reasoning tasks and families of reasoning tasks: reasoning for knowledge integration, knowledge discovery and application services.

Data Lakes and Federated Query Processing

Posted on: Wed, 04/01/2020 - 20:54 By: valentina.janev

Big data plays a relevant role in promoting both manufacturing and scientific development through industrial digitization and emerging interdisciplinary research. Semantic web technologies have also experienced great progress, and scientific communities and practitioners have contributed to the problem of big data management with ontological models, controlled vocabularies, linked datasets, data models, query languages, as well as tools for transforming big data into knowledge from which decisions can be made.

Reasoning in Knowledge Graphs

Posted on: Mon, 12/24/2018 - 16:28 By: valentina.janev

This lecture discusses reasoning in Knowledge Graphs. Reasoning is essential to gain value from Knowledge Graphs by deriving insights and making available new implicit data from existing data. We will cover the theory and practice of reasoning in Knowledge Graphs, and provide a number of easily accessible examples based on Oxford’s Vadalog system.

 

Distributed Big Data Frameworks

Posted on: Mon, 12/24/2018 - 16:20 By: valentina.janev

The “processing frameworks” are one of the most essential components of a Big Data systems. There are three categories of such frameworks namely: Batch-only frameworks (Hadoop), Stream-only frameworks (Storm, Samza), and Hybrid frameworks (Spark, Hive and Flink). In this lecture, we will introduce them and cover one of the major Big Data frameworks, Apache Spark. We will cover Spark fundamentals and the model of “Resilient Distributed Datasets (RDDs)” that are used in Spark to implement in-memory batch computation.

Big Data Solutions in Practical Use-cases

Posted on: Mon, 12/24/2018 - 16:19 By: valentina.janev

This lecture focuses on architecting Big Data solution. We will discuss the role and importance of the components in realizing system architectures. The participants will be introduced to unique problem characteristics that drive Big Data and the unending technology options to solve them. The application of the introduced concepts and components will be discussed in real-world example of practical use-cases.

Introduction to Big Data Architecture

Posted on: Mon, 12/24/2018 - 16:18 By: valentina.janev

This lecture will cover the existing advanced Big Data architectures following a bottom-up approach. In this lecture, the important knowledge to design and architect scalable solutions for challenging problems will be introduced. The primary components in the architecture of such systems and their architectures will be presented and discussed including “inter alia distributed kernels” and cluster managers, distributed file systems and storage systems.

Subscribe to Semantic Big Data Architectures