Alpaca


Introduction

Introduction

Functional requirements

The following requirements should be fulfilled:

Introduction

Design Principles

To limit the possibilities we also decided on some principles the solution should follow:

Introduction

Basic pipeline architecture

alpaca_highlevel_2024.png

During each (critical) step data will be written to storage (which will be HA/redundant) to ensure no data will be lost when a critical failure occurs.
The amount of data in memory will be limited as much as possible

Introduction

Base components

During extensive research and experiences from the past, the following software-stacks have been selected as the preferred components to build the new solution.

Other packages will be selected depending on need or to handle specific use-cases.

Download

Download

Vector

At the core of ALPACA vector is doing most of the heavy lifting.

It can be downloaded at: www.vector.dev

Download

Apache Kafka

Between each major operation data is buffered.

Alpaca uses apache-kafka as a proven/reliable/scalable solution.

It can be downloaded here: https://kafka.apache.org/

Download

Opensearch

opensearch is used for datalake and dashboarding.

It can be downloaded here: https://opensearch.org/

Download

Installer

The installer is a set of Ansible playbooks

A complete tar-file can be downloaded here and can be placed in a pre-existing ansible-environment.

 

An installer for an "All-in-one" server (single-node) can be found here: <TODO>

Installation

Configuration

Monitoring

Integration