Linux Audio Development: A Report from Karlsruhe

April 1st, 2003 by Dave Phillips in

Updates on the latest plans for Linux audio functionality from the first developers' conference in Germany.
Your rating: None

From March 14-16, the Zentrum für Kunst und Medientechnologie (ZKM) in Karlsruhe, Germany, hosted the first conference of Linux audio developers. Developers from more than a dozen countries attended this successful conference, representing organizations such as SuSE, Linux Audio Systems, Stanford University, IRCAM and Centro Tempo Reale. Topic discussions included in-depth presentations of the rapidly evolving Linux sound system, a look at the details of programming for professional audio standards and a survey of recent applications and audio-centric Linux distributions.

The first session took place on Friday evening, March 14, with two more sessions on Saturday and a final open session on Sunday, the 16th. All sessions were well-attended by developers and the public, and it is hoped that a second annual meeting will be planned for next year.

The New Linux Sound Architecture

Historically, the basic Linux sound system has been built on an API known most recently as OSS/Free. As of the 2.5 development series, the Linux kernel now officially deprecates the aging OSS/Free API and has replaced it with the Advanced Linux Sound Architecture (ALSA). ALSA provides a backwards-compatible layer for accommodating older applications designed for OSS/Free. But, its advanced API includes more interesting features, such as support for sound hardware from consumer-level sound cards to professional-grade digital audio boards, a fully modularized driver design, safe operation in SMP and threaded programming environments, and a user-space library (libasound) to simplify applications programming.

Conference participants included ALSA luminaries Jaroslav Kysela (founder and team leader), Takashi Iwai, Paul Davis, Frank van de Pol and Matthias Nagorni. Topic presentations included an in-depth analysis of the ALSA driver design, ALSA development in historical and technical perspective, an exposition of the JACK low-latency audio server/connector, details of the ALSA sequencer API and a demonstration of various ALSA-aware applications and utilities. As might be expected in this session, the level of discussion was quite technical at times, but all presentations were well-prepared and clearly delivered.

Some special mention must be made regarding JACK. This unique software provides a professional-grade audio server in a low-latency environment, making arbitrary audio signal routing possible, without dropouts or distortion. In a short period of time, JACK has been embraced enthusiastically by Linux audio applications developers, and the list of JACK-aware programs grows almost daily. JACK is not actually a part of ALSA, but its chief designer (Paul Davis) also has written some of ALSA's most significant drivers. Although it does not require ALSO specifically, JACK development closely follows ALSA's progress and, at this time, is best employed in an ALSA-based environment.

Takashi Iwai's presentation of the ALSA bake mix was another highlight, not soon to be forgotten by those who witnessed it. If his presentation was an indicator of new packaging trends, I believe we can expect some colorful innovations from Takashi and crew.

Programming the Ardour Hard-Disk Recorder and the LADSPA Effects Processors

Paul Davis's second presentation was titled "Some Things You Might Not Have Thought about When Writing an Audio Application", and it was a distillation of his experience programming the Ardour hard-disk recorder/digital audio workstation (DAW). Ardour has become a center of attention for those of us who wish to use Linux in a professional audio setting. As its development nears a first public release, anticipation and interest has become quite intense. Happily, Paul's presentation of the program's current status was most exciting, demonstrating features such as control automation, advanced editing procedures and effects processing using the LADSPA plugins. Ardour still has a way to go before reaching its 1.0 release, but it is already being tested in real-world scenarios with excellent results.

Ardour's development history has been of great importance to a new generation of Linux audio software programmers. As Paul points out, no road map exists for programming software like Ardour; that is, there is no existing code base for writing a professional-quality hard-disk recording system. Successful commercial DAW software manufacturers, such as Steinberg or Emagic, leave no open sources that could function as a guide to the construction of such a complex application. Thus, every important lesson learned by programmers in those commercial houses must be learned anew by the open-source developers. That Ardour has come so far and evolved so well is a testament to the talents and dedication of its programming crew. Thanks to Paul's efforts, along with those of many other Linux audio developers, the next generation of Linux audio software coders will find their way made easier and their labor less, well, arduous.

LADSPA stands for the Linux audio developers simple plugin architecture. The API was first proposed and designed by Richard W. E. Furse with the collaboration of other members of the Linux audio development community, but Steve Harris is probably the name most often associated with LADSPA plugins. Steve has authored the most popular collection of plugins and has consistently worked to improve not only his plugins but the LADSPA API as well. In his presentation at ZKM Steve demonstrated the ease with which LADSPA plugins can be written. He even managed to convince most of his audience that they too could become accomplished DSP programmers after only a few easy lessons, particularly if they employ Steve's XML-based plugin programming framework.

Applications and Distributions

The last session focused on the actual programs LAD people have written and new means of packaging and distribution. As the possible interactions between applications become more complex, some development efforts have taken on the task of ensuring that the new user finds a ready-made integrated environment that includes a system prepared for low-latency, high-priority performance. The environment needs a selection of applications that can take advantage of the new Linux sound system.

Francois Dechelle demonstrated recent free software developments taking place at IRCAM in Paris. The jMax software is already quite well-known and has evolved into one of the most sophisticated audio production and processing environments available for Linux. jMax can be thought of as a synthesizer, a sample playback machine, a DSP engine, an audio/video composition and processing environment or even a LADSPA plugin. Although Francois's demonstration was plagued by technical difficulties, it was enough to be able to glimpse jMax's enormous flexibility. Francois also presented news regarding the OpenMusic project, another free software development sponsored by IRCAM. OpenMusic is targeted for composers; that is, it is essentially music composition software, very advanced in its features and capabilities. Alas, the port to Linux is incomplete, but a dedicated team at IRCAM continues the work. We can expect wonderful things when we finally see OpenMusic 1.0 for Linux.

Andrea Glorioso reported on the efforts of the AGNULA team, of which I am proud to be a member. AGNULA stands for "a GNU/Linux audio" distribution, and it has been designed to provide turnkey systems for new Linux users who especially want to work with audio and video software. AGNULA plans to release two complete distributions, one based on Debian and another based on Red Hat. These distributions are complete systems, not crippled in any way, with enhancements such as a kernel patched for low-latency and a collection of entirely free software (free in the sense described by the FSF). A 0.9 release of the Debian distribution (Demudi) should be ready this summer.

The PlanetCCRMA suite is another response to the need for a system targeted at new users interested in the possibilities of Linux audio and video software. Developer Fernando Pablo Lopez-Lezcano described PlanetCCRMA's evolution in historical and technical perspective. Fernando also described the system's use of the apt-get utility so familiar to Debian users. With apt-get and a fast network connection, a user can download, install and update the entire PlanetCCRMA system over the Internet. Alternatively, she can download CD ISO images and install the system off-line. Unlike AGNULA, PlanetCCRMA is not an actual distribution. Instead, it depends on an existing Red Hat installation (7.x, 8.x) and replaces the default kernel with a kernel optimized for multimedia performance (low-latency, capabilities-enabled, high-priority scheduling). It also adds the ALSA drivers to your system and, of course, provides an excellent bundle of selected audio/video applications for Linux.

My own presentation was a rather rambling account of issues I've encountered while documenting Linux audio software. The lack of release dates, the constant evolution of the software described and the rapid pace of system development all conspire to make the doc writer's work difficult, particularly if he is trying to write tutorial documentation for normal users. Other documentation issues include standardized bug report and test forms, the need to distinguish between reference and instructional documentation and the predictable difficulty of writing simple and clear user-level introductions and tutorials.

The Wrap

The open session on Sunday was quite exciting; applications were demonstrated, code was viewed and shared, conversation and discussion thrived and many pictures were taken. Memorable moments included Torben Hohn's demonstration of the gAlan synthesis/processing network environment, Stefen Westerfeld's presentation of the BEAST synthesis/composition software, Fernando Lopez-Lezcano's composition for quad-speaker playback and Frank Barknecht's awesome demo of Pd-as-techno-machine.

Informal discussions took place throughout the conference, over breakfast and dinner, while sampling the excellent beer and in apartments and hotel rooms until the wee hours. I think I averaged about four hours of sleep per night, and by Monday morning I was exhausted and exhilarated. Nevertheless, on my flight back to the US I found myself wishing for another few days in Karlsruhe. I was told that the city and its environs include many lovely sights, but involvement with the conference was so intense that none of us got to be tourists. Perhaps next year's meeting will be a little longer, and we'll be able to see more of Karlsruhe, visit Heidelberg or take a bicycle ride along the Rhein and into the Schwarzwald.

Acknowledgements

First thanks must go to Frank Neumann, Matthias Nagorni and Götz Dipper. They managed a large and diverse group of developers arriving from various points at various times, ensuring everyone's comfort and well-being. For a first-time effort organizing a conference of this size, they all deserve the highest praise and thanks. The directorate and staff at ZKM also must receive great gratitude for their hospitality and financial assistance. The Zentrum is an awesome place to hold such a conference, and I recommend it to anyone visiting Karlsruhe. Thanks also to Jörn Nettingsmeier for managing the live internet feed and for helping in so many other ways.

Finally, vast thanks to the developers themselves who have done so much and given so freely. They are true heroes, persevering to create a unique phenomenon: powerful software freely available to musicians and sound people the world over, running on the world's most free and powerful operating system. What an amazing group, what remarkable accomplishments!

Resources

Pictures and MP3s from the conference are available on-line.

More information about ZKM may be found at www.zkm.de.

Information regarding the Linux Audio Developers group is at www.linuxdj.com/audio/lad/.

More general information about Linux sound and music software can be found at linux-sound.org.

Dave Phillips is a musician, teacher and writer living in Findlay, Ohio. He has been an active member of the Linux audio community since his first contact with Linux in 1995. He is the author of The Linux Book of Music & Sound.

__________________________


Special Magazine Offer -- 2 Free Trial Issues!
Receive 2 free trial issues of Linux Journal as well as instant online access to current and past issues. There's NO RISK and NO OBLIGATION to buy. CLICK HERE for offer

Linux Journal: delivering readers the advice and inspiration they need to get the most out of their Linux systems since 1994.

Sorry, offer available in the US only. International orders, click here.

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.

Re: Linux Audio Development: A Report from Karlsruhe

On April 4th, 2003 Anonymous says:

thats all good n a bag of beans. Now write a VST wrapper for the plugins and drivers... maybee then linux will be actually useful instead of lost in its own little crappy audio world like it is now.... Hell figure out a way to load windoze VST plugs via wine or sumptin.. Until then no one that produces will give a rats ass.

captchas suck. this implementation sucks more than most.

On March 17th, 2008 nonymous (not verified) says:

I had some valid points for both sides of the argument, but now that I am so irritated by the @*$&^#! captcha, I am just going to rant about that.

It took me like 10 tries to get the first one right. Now I have to enter another one to actually post?

An indication of whether it's case-sensitive would probably have eliminated the first 3-4 fails. Then I just got unreadable ones. Yes I can read, but the display is excessively ambiguous at 1600x1200 on my 21" CRT.

A dhtml function to just regenerate the image would be slightly less annoying than having to resubmit the whole page.

I have wasted valuable time which would have been better spent helping create/improve software for the benefit of all humanity, and I daresay that is a higher negative cost than suffering the theoretical spam it is intended to prevent. (Yes, I could have cut those losses by just bailing out immediately, but I believe crusading for usability is every software designer's duty.)

Please improve on this situation!

> Now write a VST wrapper

On September 7th, 2006 Anonymous (not verified) says:

> Now write a VST wrapper for the plugins and drivers...
> maybee then linux will be actually useful instead of lost
> in its own little crappy audio world like it is now

For those unable to parse this, let me translate it for you. It actually means:

"I am a thoroughly brainwashed Microsoftie unable to do anything without my decorative little WinXP desktop in the background, and so totally tied to my VST plugins that I can't even make a cup of coffee because my kettle has no VST interface. So, because you Linux lot cannot accomodate my myopic view of the world, you are all obviously irrelevant and unprofessional. Oh, and Leenucks sux and XP r0x0rs!!"

There, that should be a bit clearer.

Re: Linux Audio Development: A Report from Karlsruhe

On November 28th, 2003 Anonymous says:

Possibly windows-vst-plugins (and macintosh-plugins on PPC-linux) can be embedded to a linux-host-application natively. AFAIK, over 90% of all plugins are not linked against any operating system library. They came with own widgetsets and all using api functions come from the host application.

Not that I want a rat's ass, but FYI Kjetil Matheussen's VSTserver provides VST support for LADSPA plugins and Pd. It's very cool, I've experimented with up to a dozen or more VST plugins activated in a Pd patch. Btw, Kjetil did indeed utilize some WINE capabilities to achieve his software.

Also, Paul Davis recently experimented with JACK + VST. If all goes well I think we might expect VST support from JACK. More cool...

Regarding drivers: ALSA has evolved into a splendid audio system, it's doubtful we'd get much of a technical "win" by attempting ASIO support (if that's what you're talking about).

== dp

Jack supports ASIO under Linux, just start it with the -a option. It's just a way of defining the data size sent to the PCI card, not a way of getting low latency etc. Some cards support this transfer mode, some don't.

All these things, ASIO, VST, Direct Monitoring, Rewire etc etc are just brand names. They don't guarantee good performance or sound, just a particular standard and way of achieving a goal.
We would have VST native in Ardour now if the header files were re-distributable, the technical side is not hard. (Emulating windows gui is another matter...)

You can do the same things under Linux, but they are called JACK, ALSA, LADSPA etc etc.

you're no producer, just a kid who uses pirated copies of windows audio software he found on kazzaa and dreams of being the next drum&bass "star"... get off the crystal, stop snorting ketamine, take a bath and maybe we'll take you seriously.

Re: Linux Audio Development: A Report from Karlsruhe

On April 5th, 2003 Anonymous says:

Open your eyes and get informed. Most studios don't tell you which

operating system they use. Upon close inspection, you may find

out some interesting facts. Or the, maybe not. You see what you want to see.

Re: Linux Audio Development: A Report from Karlsruhe

On April 4th, 2003 Anonymous says:

I just wish and hope that the ALSA drivers will start working better than the OSS ones soon. I have tried them with three different sound boards, and had to revert to the OSS drivers on the grounds that they seem to behave better.

It's not MP3 just because its compressed audio

On April 4th, 2003 Anonymous says:

> Pictures and MP3s from the conference are available on-line.

Anyone else upset that about MP3 being the Kleenex of audio? Those are ogg/vorbis, not MP3.

Another oops. My bad, sorry about that.

== dp

Oops, I forgot to mention SuSE

On April 2nd, 2003 Anonymous says:

I should have mentioned that thanks also go to SuSE for their support. The general Linux community owes SuSE big thanks for directly supporting the ALSA project, it's the kind of project that benefits everyone.

== dp

yep.

On April 1st, 2003 Anonymous says:

Hey Dave.
great summary. That'll show my M$-loving friends how far we are and what is to expect.

Maybe Linuxjournal should always append a picture like
that one of yours to show what shiny happy people we are.

All I long for now is a Linux-Audio-User-and-Docu-hacking-Convention.
Thanks to Lukas, Francois, Julien and Joern for the little jamming you/we did! Had a great time!

Cheers, tobias.

Re: yep.

On April 3rd, 2003 Anonymous says:

yes:)

Re: Linux Audio Development: A Report from Karlsruhe

On April 1st, 2003 Anonymous says:

Wonderful summary, Dave, that captures especially well the nice spirit and friendly vibe surrounding the LAD conference. Maybe next year, I have enough Pd patches ready to host a whole dance night ;)

Regards,

--

Frank Barknecht

Post new comment

Please note that comments may not appear immediately, so there is no need to repost your comment.
The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <pre> <ul> <ol> <li> <dl> <dt> <dd> <i> <b>
  • Lines and paragraphs break automatically.

More information about formatting options

Featured Videos

The October 9, 2008 edition of Linux Journal Live! Associate Editor, Shawn Powers, and Kyle Rankin, "Hack and /" columnist and author of Knoppix Hacks, Linux Multimedia Hacks, Knoppix Pocket Reference and others, discuss Linux distributions.

The October 2, 2008 edition of Linux Journal Live! Associate Editor, Shawn Powers, and Steven Evatt, Online Development manager for The Houston Chronicle discuss surviving disaster with Linux.

From the Magazine

November 2008, #175

There aren't many numbers that put the US national debt to shame, but here's one: 1,100,000,000,000,000. What's that? That's how many floating-point operations per second the Roadrunner supercomputer at Las Alamos can perform. That's about 100 FLOPS per dollar of US debt (unfortunately, the debt is winning the second derivative race). Read the article about Roadrunner in this month's High Performance Computing issue of LJ.

Along with that, find out how to program the Cell processor and how to use CUDA with your NVIDIA GPU. Also in this issue: Mr HandS (aka Kyle Rankin) gives us a few tips on using Compiz, Chef Marcel shows you how to get blogging off your plate quicker, Mick Bauer talks about Samba security, Dan Sawyer interviews Cory Doctrow and Doc talks about how information technology can affect democracy and fix the national debt (just kidding about that last part). That and more for your reading pleasure in this month's Linux Journal.

Read this issue