EdgeFirst VideoStream Library

Video utility library for embedded Linux providing inter-process frame sharing, V4L2 camera capture, and hardware-accelerated H.264 encoding.

VideoStream delivers essential video I/O capabilities for embedded vision applications, including camera access, codec integration, and optional inter-process communication.

Overview

VideoStream Library provides three core capabilities for embedded Linux video applications:

1. Video4Linux2 Camera Capture

V4L2 device interface for camera frame acquisition
DmaBuf export for zero-copy capture from camera drivers (required for DMA operation)
Format negotiation and capability detection
Multi-planar format support (YUV420, NV12, YUYV, etc.)
Camera controls (exposure, gain, white balance, etc.)
Hardware compatibility with MIPI CSI-2, USB cameras, and other V4L2 devices

2. Hardware H.264/H.265 Encoding

V4L2 codec integration for hardware-accelerated encoding on NXP platforms
Hantro VPU on i.MX 8M Plus, Wave6 VPU on i.MX 95
Zero-copy input from DmaBuf or camera buffers
H.264/H.265 codec support with configurable bitrate profiles
Low-power encoding minimizes CPU load
Direct DmaBuf encoding for efficient pipeline integration

3. Inter-Process Frame Sharing

Zero-copy buffer sharing via Linux DmaBuf and POSIX shared memory
UNIX domain socket IPC with file descriptor passing (SCM_RIGHTS)
Thread-safe reference counting for multi-consumer frame access
Host/Client architecture - one producer, multiple consumers
GStreamer plugins (vslsink, vslsrc) for multi-process pipeline integration
Configurable frame lifespans with automatic expiry and cleanup

Originally developed as part of the DeepView project, VideoStream is now used by EdgeFirst Perception and available as a standalone library for embedded vision applications.

Features

Inter-Process Frame Sharing

Zero-Copy Transfer - DmaBuf and POSIX shared memory for efficient frame sharing
UNIX Socket IPC - File descriptor passing via SCM_RIGHTS ancillary data
Multi-Consumer Support - Multiple clients can access same frame pool concurrently
Reference Counting - Automatic frame lifecycle management prevents memory leaks
GStreamer Plugins - vslsink (producer) and vslsrc (consumer) for pipeline integration
Timeout Management - Configurable frame lifespans and client timeouts

V4L2 Camera Capture

Standard V4L2 Interface - Compatible with any Video4Linux2 camera driver
DmaBuf Export - Zero-copy frame acquisition via VIDIOC_EXPBUF (required for DMA operation)
Format Support - YUV (420, 422, 444), RGB variants, Bayer patterns
Multi-Planar - Native support for multi-planar formats (NV12, NV21, etc.)
Camera Controls - Exposure, gain, white balance, and other V4L2 controls
DMA-Capable Buffers - Uses DmaBuf for hardware accelerator compatibility (VPU, NPU)

Hardware H.264/H.265 Encoding

V4L2 Codec Backend - Unified interface for Hantro VPU (i.MX 8M Plus) and Wave6 VPU (i.MX 95)
Codec Support - H.264 (AVC) and H.265 (HEVC) with profile/level configuration
Rate Control - CBR, VBR, and fixed QP encoding modes
Zero-Copy Input - Direct encoding from camera DmaBuf or shared memory
Low Latency - Hardware encoding minimizes CPU load and latency
Multi-Stream - Support for concurrent encoding streams

Platform Support

NXP i.MX8M Plus - Full support (G2D, Hantro VPU, V4L2 DmaBuf)
NXP i.MX 95 - V4L2 codec (Wave6 VPU), G2D (DPU), libcamera NEO-ISP camera pipeline
NXP i.MX8M - DmaBuf and basic hardware acceleration
Generic ARM64/ARMv7 - POSIX shared memory fallback
x86_64 - Development and testing (software encoding)

See HARDWARE.md for detailed platform capabilities and tested configurations.

Quick Start

Installation

Prerequisites

# Ubuntu/Debian
sudo apt-get install -y \
    build-essential cmake pkg-config \
    libgstreamer1.0-dev \
    libgstreamer-plugins-base1.0-dev

# Fedora/RHEL
sudo dnf install -y \
    gcc gcc-c++ cmake \
    gstreamer1-devel \
    gstreamer1-plugins-base-devel

Build from Source

git clone https://github.com/EdgeFirstAI/videostream.git
cd videostream
cmake -S . -B build -DCMAKE_BUILD_TYPE=Release
cmake --build build -j$(nproc)
sudo cmake --install build

Cross-Compile for ARM64 (NXP Yocto SDK)

source /opt/fsl-imx-xwayland/5.4-zeus/environment-setup-aarch64-poky-linux
cmake -S . -B build -DCMAKE_BUILD_TYPE=Release
cmake --build build -j$(nproc)

Usage Examples

Example 1: VideoStream CLI (Recommended)

The videostream CLI provides the easiest way to work with cameras and video streaming:

# Display hardware capabilities (camera, VPU encoder/decoder)
videostream info

# Record H.264 video from camera (60 frames)
videostream record output.h264 --frames 60 --device /dev/video3

# Convert H.264 to MP4 container
videostream convert output.h264 output.mp4

# Stream camera frames to VSL socket (with H.264 encoding)
videostream stream /tmp/camera.sock --encode --codec h264 &

# Receive frames and display metrics
videostream receive /tmp/camera.sock --frames 100 --json

For detailed CLI usage, run videostream --help or videostream <command> --help.

Example 2: V4L2 Camera Capture (C API)

#include <videostream.h>
#include <stdio.h>

int main(int argc, char *argv[])
{
    // Open V4L2 camera device
    vsl_camera* camera = vsl_camera_open_device("/dev/video0");
    if (!camera) {
        fprintf(stderr, "Failed to open camera\n");
        return 1;
    }

    // Configure camera for 1920x1080 NV12 capture
    int width = 1920;
    int height = 1080;
    int buf_count = 4;
    uint32_t fourcc = VSL_FOURCC('N', 'V', '1', '2');

    if (vsl_camera_init_device(camera, &width, &height, &buf_count, &fourcc)) {
        fprintf(stderr, "Failed to init camera\n");
        vsl_camera_close_device(camera);
        return 1;
    }

    printf("Camera initialized: %dx%d, buffers=%d\n", width, height, buf_count);

    // Start streaming
    vsl_camera_start_capturing(camera);

    // Capture frames
    for (int i = 0; i < 100; i++) {
        vsl_camera_buffer* buffer = vsl_camera_get_data(camera);
        if (!buffer) {
            continue;
        }

        printf("Captured frame %d, size=%u bytes\n",
               i, vsl_camera_buffer_length(buffer));

        // Process frame data
        void* data = vsl_camera_buffer_mmap(buffer);
        // ... process frame ...

        // Return buffer to queue
        vsl_camera_release_buffer(camera, buffer);
    }

    vsl_camera_stop_capturing(camera);
    vsl_camera_uninit_device(camera);
    vsl_camera_close_device(camera);
    return 0;
}

Example 3: Hardware H.264 Encoding (C API)

#include <videostream.h>
#include <stdio.h>

int main(int argc, char *argv[])
{
    // Create encoder instance for H.264
    uint32_t h264_fourcc = VSL_FOURCC('H', '2', '6', '4');
    VSLEncoder* encoder = vsl_encoder_create(VSL_ENCODE_PROFILE_5000_KBPS,
                                              h264_fourcc,
                                              30);  // 30 fps
    if (!encoder) {
        fprintf(stderr, "Failed to create encoder\n");
        return 1;
    }

    // Create source and destination frames
    VSLFrame* source = vsl_frame_init(1920, 1080, 1920,
                                      VSL_FOURCC('N', 'V', '1', '2'),
                                      NULL, NULL);
    vsl_frame_alloc(source, NULL);  // Auto dmabuf/shm allocation

    // Encode frames
    for (int i = 0; i < 100; i++) {
        // Create output frame for encoded data
        VSLFrame* dest = vsl_encoder_new_output_frame(encoder,
                                                       1920, 1080,
                                                       33333333LL,  // duration (ns)
                                                       i * 33333333LL,  // pts
                                                       i * 33333333LL); // dts

        // Encode frame
        int keyframe = 0;
        int ret = vsl_encode_frame(encoder, source, dest, NULL, &keyframe);

        if (ret == 0) {
            printf("Encoded frame %d: %d bytes%s\n",
                   i, vsl_frame_size(dest),
                   keyframe ? " (keyframe)" : "");

            // Access encoded data
            void* data = vsl_frame_mmap(dest, NULL);
            // ... write to file or network ...
            vsl_frame_munmap(dest);
        }

        vsl_frame_release(dest);
    }

    vsl_frame_release(source);
    vsl_encoder_release(encoder);
    return 0;
}

Example 4: Inter-Process Frame Sharing with GStreamer

This example demonstrates sharing camera frames between processes using vslsink and vslsrc.

Terminal 1 - Producer (Camera Host):

# Capture from V4L2 camera and share frames via VideoStream
# The path can be explicit or auto-generated from element name
gst-launch-1.0 v4l2src device=/dev/video0 ! \
    video/x-raw,width=1920,height=1080,format=NV12,framerate=30/1 ! \
    vslsink path=/tmp/vsl_camera lifespan=100

Socket Path Convention:

Filesystem paths start with / (e.g., /tmp/vsl_camera)
Abstract sockets don't start with / (e.g., vsl_camera)
If no path specified, vslsink auto-generates: /tmp/<element_name>.<thread_id>

Terminal 2 - Consumer 1 (Display):

# Receive frames and display
gst-launch-1.0 vslsrc path=/tmp/vsl_camera ! autovideosink

Terminal 3 - Consumer 2 (Recording):

# Simultaneously record to file (both consumers share same frames)
gst-launch-1.0 vslsrc path=/tmp/vsl_camera ! \
    x264enc ! mp4mux ! filesink location=recording.mp4

Example 5: Frame Sharing with Native C API

#include <videostream.h>
#include <stdio.h>

// Client application accessing shared frames
int main(int argc, char *argv[])
{
    // Connect to frame pool host
    VSLClient* client = vsl_client_init("/tmp/vsl_camera", NULL, true);
    if (!client) {
        fprintf(stderr, "Failed to connect to frame pool\n");
        return 1;
    }

    printf("Connected to %s\n", vsl_client_path(client));

    while (1) {
        // Wait for next frame (blocks until frame available)
        // Pass 0 for "until" to get next available frame
        VSLFrame* frame = vsl_frame_wait(client, 0);
        if (!frame) {
            fprintf(stderr, "Failed to receive frame: %s\n", strerror(errno));
            continue;
        }

        // Lock frame for access
        if (vsl_frame_trylock(frame) < 0) {
            vsl_frame_release(frame);
            continue;  // Frame no longer valid
        }

        // Access frame metadata
        printf("Frame %ld: %dx%d, format=0x%x, pts=%ld\n",
               vsl_frame_serial(frame),
               vsl_frame_width(frame),
               vsl_frame_height(frame),
               vsl_frame_fourcc(frame),
               vsl_frame_pts(frame));

        // Map frame memory
        size_t size;
        void* data = vsl_frame_mmap(frame, &size);
        if (data) {
            // Process frame (e.g., run computer vision algorithm)
            // process_image(data, vsl_frame_width(frame), vsl_frame_height(frame));

            vsl_frame_munmap(frame);
        }

        // Unlock and release frame
        vsl_frame_unlock(frame);
        vsl_frame_release(frame);
    }

    vsl_client_release(client);
    return 0;
}

How VideoStream Fits into EdgeFirst Perception

VideoStream is a utility library used by EdgeFirst Perception's camera service. The edgefirst-camera node leverages VideoStream's V4L2 camera capture and hardware encoding features to acquire and compress video streams before publishing to the Zenoh middleware.

EdgeFirst Perception Architecture

graph TB
    subgraph "Sensor Layer"
        CAM[Camera<br/>ISP + Codec]
        SENS[Radar/LiDAR<br/>NavSat/IMU]
    end

    subgraph "Processing Layer"
        VIS[Vision Model<br/>Inference]
        FUS[Fusion Model<br/>Inference]
    end

    subgraph "Output Layer"
        REC[Recorder<br/>MCAP]
        WEB[Web UI<br/>HTTPS]
    end

    subgraph "Middleware"
        ZEN[ZENOH<br/>Middleware]
    end

    subgraph "User Applications"
        APPS[ROS2, Foxglove]
    end

    CAM --> VIS
    CAM --> FUS
    SENS --> FUS
    VIS --> REC
    FUS --> WEB
    VIS --> ZEN
    FUS --> ZEN
    REC --> ZEN
    WEB --> ZEN
    ZEN --> APPS

    style CAM fill:#e1f5ff
    style VIS fill:#fff4e1
    style FUS fill:#fff4e1
    style REC fill:#e8f5e9
    style WEB fill:#e8f5e9
    style ZEN fill:#f3e5f5
    style APPS fill:#fce4ec

VideoStream's Role in edgefirst-camera:

graph TB
    subgraph EFC["edgefirst-camera Service"]
        CAMDEV["/dev/video0<br/>Camera Device"]

        subgraph VSL["VideoStream Library"]
            V4L2[V4L2 Camera<br/>Capture API]
            ENC[Hardware Encoder<br/>H.264/H.265 API]
        end

        ZENOH_PUB[Zenoh Publisher<br/>not part of VSL]
    end

    ZENOH[Zenoh Middleware]
    PERC[Perception Pipeline]

    CAMDEV --> V4L2
    V4L2 -->|DmaBuf Raw Frames| ZENOH_PUB
    V4L2 -->|DmaBuf| ENC
    ENC -->|H.264 Stream| ZENOH_PUB
    ZENOH_PUB --> ZENOH
    ZENOH --> PERC

    style V4L2 fill:#bbdefb
    style ENC fill:#bbdefb
    style ZENOH_PUB fill:#c5e1a5
    style ZENOH fill:#f3e5f5

VideoStream's Role in EdgeFirst Perception

Primary Use: Camera and Codec Utilities in edgefirst-camera

VideoStream provides low-level camera and codec APIs used by the edgefirst-camera service:

V4L2 Camera Capture (vsl_camera_* API)
- Acquires frames from camera devices (MIPI CSI-2, USB cameras)
- Exports DmaBuf file descriptors for zero-copy operation
- Required for DMA-capable hardware (VPU, NPU)
Hardware H.264/H.265 Encoding (vsl_encoder_* API)
- Compresses frames using VPU hardware acceleration
- Accepts DmaBuf input for zero-copy encoding
- Produces H.264/H.265 streams for network transmission
Data Flow:
- VideoStream captures frames and provides them to edgefirst-camera
- edgefirst-camera publishes raw DMA camera frames directly over Zenoh (using pidfd_getfd)
- Optionally, VideoStream encodes frames to H.264/H.265 for efficient streaming
- Zenoh publisher (part of edgefirst-camera, not VSL) handles network distribution
- Zenoh middleware distributes to perception pipeline

VideoStream Does NOT Handle:

Network transport (handled by Zenoh middleware)
Inter-service messaging (handled by Zenoh)
Frame sharing between perception services (EdgeFirst uses Zenoh + pidfd_getfd)
Publishing frames to Zenoh (handled by edgefirst-camera service)

Optional Feature: Inter-Process Frame Sharing via UNIX Sockets

VideoStream includes an optional IPC mechanism for non-EdgeFirst applications:

UNIX Domain Socket + FD Passing: Share frames between local processes using SCM_RIGHTS
GStreamer Plugins: vslsink (producer) and vslsrc (consumer) for multi-process pipelines
Use Case: Standalone applications requiring local frame sharing WITHOUT network transport
Note: EdgeFirst Perception services use Zenoh for messaging, not VSL's IPC mechanism

Standalone Use Cases

VideoStream can be used independently for:

V4L2 camera capture applications
Hardware-accelerated encoding applications
GStreamer-based vision systems
Multi-process video pipelines
Embedded video I/O without perception middleware

Architecture Highlights

VideoStream is organized into three main functional areas:

1. Frame Sharing Architecture

Host/Client Model: One process hosts frame pool, multiple clients consume
Threading: Multi-threaded host with GStreamer task thread for client servicing
IPC Protocol: UNIX domain sockets with file descriptor passing (SCM_RIGHTS)
Memory Management: DmaBuf zero-copy or POSIX shared memory fallback

2. V4L2 Camera Interface

Direct kernel interface via ioctl() calls
Buffer management: MMAP, USERPTR, or DmaBuf buffer types
Format negotiation: Automatic capability detection and format selection
Camera controls: Full access to V4L2 control interface

3. Hardware Encoding Pipeline

V4L2 codec wrapper: Abstraction layer for Hantro VPU (i.MX 8M Plus) and Wave6 VPU (i.MX 95)
Zero-copy encoding: Direct encoding from DmaBuf or camera buffers
Buffer recycling: Efficient frame buffer reuse for low latency
Bitrate control: Configurable CBR/VBR encoding modes
Device discovery: Automatic V4L2 device probing by capability

See ARCHITECTURE.md for detailed threading diagrams, data flow, and internal design.

Integration with EdgeFirst Perception

For edgefirst-camera Developers

VideoStream provides the camera I/O and encoding foundation:

// Typical edgefirst-camera integration pattern

// 1. Open camera via VideoStream V4L2 interface
int camera = v4l2_open_device("/dev/video0");
v4l2_configure(camera, 1920, 1080, V4L2_PIX_FMT_NV12, 30);

// 2. Initialize hardware encoder
VPUEncoder *encoder = vpu_encoder_create();
vpu_encoder_configure(encoder, &encode_params);

// 3. Capture and encode loop
while (running) {
    // Capture frame from camera
    struct v4l2_buffer buffer;
    v4l2_capture_frame(camera, &buffer);

    // Encode frame using hardware VPU
    uint8_t *h264_data;
    size_t h264_size;
    vpu_encoder_encode_frame(encoder, buffer.data, &h264_data, &h264_size);

    // Publish to Zenoh middleware
    zenoh_publish("/camera/h264", h264_data, h264_size);

    // Release resources
    vpu_encoder_release_output(encoder, h264_data);
    v4l2_release_frame(camera, &buffer);
}

For Application Developers

When to use VideoStream:

Need V4L2 camera capture with DmaBuf export for DMA operation
Require hardware H.264/H.265 encoding (NXP i.MX 8M Plus / i.MX 95 platforms)
Building standalone applications needing camera + codec APIs
Building multi-process vision pipelines with GStreamer
Need local inter-process frame sharing without network transport

When to use alternatives:

EdgeFirst Perception middleware messaging: For distributed perception services (uses Zenoh + pidfd_getfd)
GStreamer native elements: For standard video processing pipelines
Direct V4L2 API: For simple camera access without VideoStream abstractions

Documentation

📖 Architecture Guide - Threading model, data flow, internal design
🔧 API Reference - Complete C/C++ API documentation
🏗️ EdgeFirst Perception - Full perception middleware docs
📝 Contributing Guide - Development setup and guidelines
🔒 Security Policy - Vulnerability reporting and security practices

Contributing

We welcome contributions, especially for:

Additional camera driver support and V4L2 controls
Software encoding fallbacks (libx264, openh264)
Platform support (Raspberry Pi, NVIDIA Jetson)
Performance optimizations for 4K/8K capture and encoding

Get Started:

Check CONTRIBUTING.md for development setup
Check GitHub Issues for tasks
Review ARCHITECTURE.md to understand internals
Follow Code of Conduct

Support

Community Resources

📒 Documentation
💬 GitHub Discussions
🐛 Issue Tracker

EdgeFirst Ecosystem

EdgeFirst Perception - Complete perception middleware
edgefirst-camera - Camera service using VideoStream
EdgeFirst Studio - MLOps platform for edge AI deployment
EdgeFirst Modules - Maivin, Raivin hardware platforms

Commercial Support

Au-Zone Technologies offers professional services:

Camera driver development and V4L2 integration
Custom hardware encoding implementation
Performance optimization for specific platforms
Production support with SLAs

📧 Contact: support@au-zone.com

Security

Report vulnerabilities: support@au-zone.com (Subject: "Security Vulnerability - VideoStream")

See SECURITY.md for our security policy and responsible disclosure process.

License

Licensed under the Apache License, Version 2.0. See LICENSE for details.

Part of the Au-Zone EdgeFirst ecosystem | au-zone.com

Name		Name	Last commit message	Last commit date
Latest commit History 283 Commits
.config		.config
.github		.github
cmake		cmake
crates		crates
debian		debian
doc		doc
ext		ext
gst		gst
include		include
lib		lib
scripts		scripts
src		src
tests		tests
tools		tools
videostream		videostream
.clang-format		.clang-format
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
HARDWARE.md		HARDWARE.md
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
TESTING.md		TESTING.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
release.toml		release.toml
requirements.txt		requirements.txt
sonar-project.properties		sonar-project.properties
sonar.py		sonar.py
videostream.pc.in		videostream.pc.in

Folders and files

Latest commit

History

Repository files navigation

EdgeFirst VideoStream Library

Overview

1. Video4Linux2 Camera Capture

2. Hardware H.264/H.265 Encoding

3. Inter-Process Frame Sharing

Features

Inter-Process Frame Sharing

V4L2 Camera Capture

Hardware H.264/H.265 Encoding

Platform Support

Quick Start

Installation

Prerequisites

Build from Source

Cross-Compile for ARM64 (NXP Yocto SDK)

Usage Examples

Example 1: VideoStream CLI (Recommended)

Example 2: V4L2 Camera Capture (C API)

Example 3: Hardware H.264 Encoding (C API)

Example 4: Inter-Process Frame Sharing with GStreamer

Example 5: Frame Sharing with Native C API

How VideoStream Fits into EdgeFirst Perception

EdgeFirst Perception Architecture

VideoStream's Role in EdgeFirst Perception

Architecture Highlights

1. Frame Sharing Architecture

2. V4L2 Camera Interface

3. Hardware Encoding Pipeline

Integration with EdgeFirst Perception

For edgefirst-camera Developers

For Application Developers

Documentation

Contributing

Support

Community Resources

EdgeFirst Ecosystem

Commercial Support

Security

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 19

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages