Skip to content

roc-streaming/roc-pulse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PulseAudio modules for Roc Toolkit

Build GitHub release Matrix chat

What is this?

This repo provides a set of PulseAudio modules enabling it to use Roc Toolkit as a network transport and improve its service quality over an unreliable network such as Wi-Fi.

Advantages over Roc command-line tools:

  • Seamless integration into the PulseAudio workflow. The user can connect a local audio stream to a remote audio device using common PulseAudio tools like pavucontrol.

  • A bit lower latency. Since Roc is integrated into the PulseAudio server, there is no additional communication step between Roc and PulseAudio server.

Advantages over PulseAudio "native" protocol:

  • Better service quality when the latency is low and the network is unreliable. PulseAudio "native" protocol uses TCP, while Roc uses RTP, which is better suited for real-time communication than TCP-based protocols.

  • Compatibility with standard protocols. PulseAudio "native" protocol is PulseAudio-specific, while Roc implements a set of standardized RTP-based protocols.

Advantages over PulseAudio built-in RTP support:

  • Better service quality when the latency is low and the network is unreliable. PulseAudio uses bare RTP, while Roc also employs Forward Erasure Correction extensions.

Interoperability

These modules can communicate with all other Roc Toolkit senders and receivers:

Build instructions

Simple build

When using this method, CMake will automatically download and build dependencies (Roc Toolkit, PulseAudio, libtool). Roc will be statically linked into the modules and there is no need to install it into the system.

First install build tools and PulseAudio:

sudo apt install -y \
    gcc g++ \
    make \
    libtool intltool m4 autoconf automake \
    meson libsndfile-dev \
    cmake \
    scons \
    git \
    wget \
    python3 \
    pulseaudio

Then build PulseAudio modules:

make

And then install modules into the system:

sudo make install

Advanced build

You can disable automatic downloading of dependencies and build them manually.

First, download, build and install Roc Toolkit into the system as described on this page.

Then download and unpack PulseAudio source code from here. There is no need to configure and build it, only source code is needed.

Note: PulseAudio doesn't provide official API for out-of-tree modules. This is the reason why we need full PulseAudio source code to build modules.

Note: PulseAudio source code should have exactly the same version as the actual PulseAudio daemon which will be used to load modules.

Optionally, download and unpack libtool. As with PulseAudio, there is no need to build it. Alternatively, install it into the system using your package manager.

Then you can build and install modules:

mkdir build/x86_64-linux-gnu
cd build/x86_64-linux-gnu
cmake ../.. \
  -DDOWNLOAD_ROC=OFF \
  -DDOWNLOAD_PULSEAUDIO=OFF \
  -DDOWNLOAD_LIBTOOL=OFF \
  -DPULSEAUDIO_DIR=<...> \
  -DPULSEAUDIO_VERSION=<...>
make VERBOSE=1
sudo make install

Don't forget to specify correct PULSEAUDIO_VERSION!

If you've installed Roc Toolkit to non-standard directory, you can use -DROC_INCLUDE_DIR=<...> and -DROC_LIB_DIR=<...>. If you want to use custom libtool instead of the one from system, you can use -DLIBTOOL_DIR=<...>.

Cross-compilation

For simple cases, you can do everything automatically by specifying just two environment variables:

TOOLCHAIN_PREFIX=<...> PULSEAUDIO_VERSION=<...> make

For more granular configuration, you can invoke CMake directly:

mkdir build/<...>
cd build/<...>
cmake ../.. -DTOOLCHAIN_PREFIX=<...> -DPULSEAUDIO_VERSION=<...>
make VERBOSE=1

Commands above will cross-compile PulseAudio modules, as well as download and cross-compile their dependencies. Dependencies will be statically linked into modules. Modules will be installed into ./bin.

In these commands, TOOLCHAIN_PREFIX defines toolchain triple of the target system, e.g. aarch64-linux-gnu. In this case aarch64-linux-gnu-gcc and other tools should be available in PATH.

For more complicated cases, refer to standard instructions for cross-compiling using CMake.

Pre-built toolchains

You can use one of our docker images with prebuilt cross-compilation toolchains. The commands below will cross-compile PulseAudio modules and install them to ./bin.

Raspberry Pi (64-bit):

docker run -t --rm -u "${UID}" -v "${PWD}:${PWD}" -w "${PWD}" \
  rocstreaming/toolchain-aarch64-linux-gnu \
    env TOOLCHAIN_PREFIX=aarch64-linux-gnu PULSEAUDIO_VERSION=<...> make

Raspberry Pi 2 and later (32-bit):

docker run -t --rm -u "${UID}" -v "${PWD}:${PWD}" -w "${PWD}" \
  rocstreaming/toolchain-arm-linux-gnueabihf \
    env TOOLCHAIN_PREFIX=arm-linux-gnueabihf PULSEAUDIO_VERSION=<...> make

Raspberry Pi 1 and Zero (32-bit):

docker run -t --rm -u "${UID}" -v "${PWD}:${PWD}" -w "${PWD}" \
  rocstreaming/toolchain-arm-bcm2708hardfp-linux-gnueabi \
    env TOOLCHAIN_PREFIX=arm-bcm2708hardfp-linux-gnueabi PULSEAUDIO_VERSION=<...> make

Running receiver

For the receiving side, use module-roc-sink-input PulseAudio module. It creates a PulseAudio sink input that receives samples from Roc sender and passes them to the sink it is connected to. You can then connect it to any audio device.

Roc sink input supports several options:

option default description note
local_ip 0.0.0.0 local address to bind to
local_source_port 10001 local port for source (RTP) packets
local_repair_port 10002 local port for repair (FEC) packets
local_control_port 10003 local port for control (RTCP) packets
sink <default sink> the name of the sink to connect the new sink input to
sink_input_properties empty additional sink input properties
sink_input_rate 44100 local sink input sample rate
sink_input_format f32 local sink input sample format (f32)
sink_input_chans stereo local sink input channel layout (mono, stereo)
packet_encoding_id 10 encoding id for audio packets (any number, but same on sender and receiver) for custom network encoding
packet_encoding_rate 44100 sample rate for audio packets for custom network encoding
packet_encoding_format s16 sample format for audio packets (s16) for custom network encoding
packet_encoding_chans stereo channel layout for audio packets (mono, stereo) for custom network encoding
fec_encoding rs8m encoding for FEC packets (default, disable, rs8m, ldpc)
resampler_backend selected automatically resampler backend (default, builtin, speex, speexdec)
resampler_profile medium resampler profile (default, high, medium, low)
latency_backend selected automatically latency tuner backend (default, niq)
latency_profile selected automatically latency tuner profile (default, intact, responsive, gradual)
target_latency_msec 200 target latency in milliseconds
min_latency_msec selected automatically minimum latency in milliseconds
max_latency_msec selected automatically maximum latency in milliseconds
io_latency_msec 40 playback latency in milliseconds
no_play_timeout_msec selected automatically no playback timeout in milliseconds
choppy_play_timeout_msec selected automatically choppy playback timeout in milliseconds

Here is how you can create a Roc sink input from command line:

pactl load-module module-roc-sink-input

Alternatively, you can add this line to /etc/pulse/default.pa to create a Roc sink input automatically at PulseAudio start:

load-module module-roc-sink-input

You can then connect the Roc sink input to an audio device (i.e. a sink) via command line:

# determine Roc sink-input number
pactl list sink-inputs

# connect Roc sink-input to a sink
pactl move-sink-input <roc_sink_input_number> <sink>

Or via the pavucontrol graphical tool:

image

Running sender

For the sending side, use module-roc-sink PulseAudio module. It creates a PulseAudio sink that sends samples written to it to a preconfigured receiver address. You can then connect an audio stream of any running application to that sink, or make it the default sink.

Roc sink supports several options:

option default description note
remote_ip required parameter remote receiver address
remote_source_port 10001 remote receiver port for source (audio) packets
remote_repair_port 10002 remote receiver port for repair (FEC) packets
remote_control_port 10003 remote receiver port for control (RTCP) packets
sink_name roc_sender the name of the new sink
sink_properties empty additional sink properties
sink_rate 44100 local sink sample rate
sink_format f32 local sink sample format (f32)
sink_chans stereo local sink channel layout (mono, stereo)
packet_encoding_id 10 encoding id for audio packets (any number, but same on sender and receiver) for custom network encoding
packet_encoding_rate 44100 sample rate for audio packets for custom network encoding
packet_encoding_format s16 sample format for audio packets (s16) for custom network encoding
packet_encoding_chans stereo channel layout for audio packets (mono, stereo) for custom network encoding
packet_length_msec 5 audio packet length in milliseconds
fec_encoding rs8m encoding for FEC packets (default, disable, rs8m, ldpc)
fec_block_nbsrc 18 number of source packets in FEC block
fec_block_nbrpr 10 number of repair packets in FEC block
resampler_backend selected automatically resampler backend (default, builtin, speex, speexdec)
resampler_profile medium resampler profile (default, high, medium, low)
latency_backend disabled latency tuner backend (default, niq) for sender-side latency tuner
latency_profile disabled latency tuner profile (default, intact, responsive, gradual) for sender-side latency tuner
target_latency_msec disabled target latency in milliseconds for sender-side latency tuner
min_latency_msec disabled minimum latency in milliseconds for sender-side latency tuner
max_latency_msec disabled maximum latency in milliseconds for sender-side latency tuner

Here is how you can create a Roc sink from command line:

pactl load-module module-roc-sink remote_ip=<receiver_ip>

Alternatively, you can add this line to /etc/pulse/default.pa to create a Roc sink automatically at PulseAudio start:

load-module module-roc-sink remote_ip=<receiver_ip>

You can then connect an audio stream (i.e. a sink input) to the Roc sink via command line:

pactl move-sink-input <sink_input_number> roc_sender

Or via the pavucontrol graphical tool:

image

Advanced configuration

Custom local encoding

You can use these options to control how Roc sink or sink input presents itself to PulseAudio:

  • sink_rate, sink_format, sink_channels
  • sink_input_rate, sink_input_format, sink_input_channels

These options may be useful if you want to avoid automatic conversions (such as resampling and channel mapping) performed by PulseAudio when device and application encodings don't match.

Note that if sink/sink input encoding doesn't match packet encoding (see below), Roc will perform conversion by itself, so you probably want to configure that too.

Also note that Roc receiver usually performs resampling even when there is no sample rate mismatch, because it is used to adjust clock speed. Hence it makes sense to avoid resampling in PulseAudio and keep resampling only in Roc.

Custom packet encoding

By default, module-roc-sink-input and module-roc-sink code audio as 44100Hz with 16-bit stereo.

To employ alternative encoding, you need to provide the following options:

  • packet_encoding_id - arbitrary encoding identifier, any number in range [1; 255]
  • packet_encoding_rate - sampling rate (e.g. 44100)
  • packet_encoding_format - sample format (e.g. s16 for 16-bit signed integers)
  • packet_encoding_chans - channel layout (e.g. stereo)

All four parameters should be provided on both sender and receiver and have exact same values.

Custom FEC encoding

By default, module-roc-sink-input and module-roc-sink use Reed-Solomon (rs8m) FEC encoding for loss repair.

This can be configured via fec_encoding parameter. Available options are:

  • disable - disable FEC
  • rs8m - use Reed-Solomon
  • ldpc - use LDPC-Staircase

This parameter should be provided on both sender and receiver and have exact same value.

Configuring latency

Essential receiver-side (module-roc-sink-input) parameters are:

  • target_latency - which latency should be maintained
  • min_latency - which minimum latency causes session restart
  • max_latency - which maximum latency causes session restart

You can also force specific latency tuner settings (by default they're auto-selected based on target latency):

  • latency_backend:
    • niq - tune latency based on Network Incoming Queue size
  • latency_profile:
    • intact - disable latency tuner
    • responsive - quickly react to changes; good for low latency, but also requires low network jitter
    • gradual - smoothly react to changes; can handle high network jitter, but can't handle very low latency

Note: By default, latency tuner happens on receiver side. It is also possible (though rarely needed) to do it on sender side. For that reason, all parameters above are also available on sender, but are disabled by default. If you enabled them on sender, you likely want to disable them on receiver by using intact latency profile.

There are also sender-side (module-roc-sink) parameters that affect latency:

  • packet_length_msec - audio packet length
  • fec_block_nbsrc - number of audio packets in a FEC block (if FEC is enabled)
  • fec_block_nbrpr - number of redundancy packets in a FEC block (if FEC is enabled)

For lower latency, you may need lower packet length and FEC block size. And vice versa, for higher latency and network jitter, you may need to increase both packet length (for less overhead) and FEC block size (for better repair).

Configuring source or sink name

PulseAudio sinks and sink inputs have name and description. Name is usually used when the sink or sink input is referenced from command-line tools or configuration files, and description is shown in the GUI.

Sink name and description can be configured via sink_name module argument and device.description sink property set by sink_properties module argument:

pactl load-module module-roc-sink remote_ip=192.168.1.38 \
  sink_name=my_name sink_properties=device.description=My-Description

Sink input name and description can be configured via sink_input_name module argument and media.name sink input property set by sink_input_properties module argument:

pactl load-module module-roc-sink-input \
  sink_input_name=my_name sink_input_properties=media.name=My-Description

Troubleshooting

First, run PulseAudio server in verbose mode, both on sending and receiving sides:

pulseaudio -vvv

Among other things, you should find some messages from Roc sink and sink-input there, which may give some idea about what's going wrong.

Second, you can try to replace sender, receiver, or both with Roc command line tools to determine whether the issue is specific to PulseAudio modules or not.

Authors

See here.

License

PulseAudio modules are licensed under LGPL 2.1.

For details on Roc Toolkit licensing, see here.