Automatically ripping CDs on linux

Part 2 of building a music server from scratch. See here for part 1.

  1. This is a work in progress. Use at your own peril.

  2. To auto-rip the cds inserted into my headless music server I’m going to use abcde and cdparanoia. Install them with
    sudo apt-get install abcde lame eyed3 glyrc imagemagick cdparanoia1

  3. MusicBrainz is missing some dependencies so run sudo apt-get install libmusicbrainz-discid-perl libwebservice-musicbrainz-perl

  4. After running abcde-musicbrainz-tool from the command line I got an error about missing Mojo::DOM. Fix this with sudo cpan install Mojo::DOM

  5. We need to a config file for abcde. This is abcde.conf. There are multiple places that this could go (see the man page), but since I have a single user setup for now I am placing this in my users home directory /home/phil/.abcde.conf, but I could place it anywhere else and get the program to read it with the -c [filename] option. Placing it here means if I run abcde from the command line, this will be the default configuration.

  6. Edit the your .abcde.conf file to use the correct ripper and write to files to the correct place.

    CDROMREADERSYNTAX=cdparanoia
    CDPARANOIAOPTS="--never-skip=40"
    OUTPUTTYPE=flac
    #edit OUTPUTDIR. Its the location of the output flacs to write
    OUTPUTDIR=/home/phil/storage/music/flac/abcde
    #you might need to change CDROM
    CDROM=/dev/sr0
    EJECTCD=y
    CDDBMETHOD=musicbrainz
    GLYRC=glyrc
    CONVERT=convert
    ALBUMRARTFILE="cover.jpg"
    ALBUMARTTYPE="JPEG"
    # Make a local cache of cddb entries and then volunteer to use
    # these entries when and if they match the cd:
    CDDBCOPYLOCAL="y"
    CDDBLOCALDIR="$HOME/.cddb"
    CDDBLOCALRECURSIVE="y"
    CDDBUSELOCAL="y"
    FLAC=flac                                 # Path to FLAC encoder
    # Specify the encoder to use for FLAC. In this case
    # flac is the only choice.
    FLACENCODERSYNTAX=flac
    
    # Specify the path to the selected encoder. In most cases the encoder
    # should be in your $PATH as I illustrate below, otherwise you will
    # need to specify the full path. For example: /usr/bin/flac
    FLAC=flac
    
    # Specify your required encoding options here. Multiple options can
    # be selected as '--best --another-option' etc.
    # Overall bitrate is about 880 kbs/s with level 8.
    FLACOPTS='-s -e -V -8'
    # Give the location of the CD identification program:
    CDDISCID=cd-discid
    # The default actions that abcde will take.
    ACTIONS=cddb,playlist,read,getalbumart,encode,tag,move,clean
    
    # If NOSUBMIT is set to y, then abcde will never prompt asking if you
    # wish to submit your edited cddb file.
    NOSUBMIT=n
    # Decide here how you want the tracks labelled for a standard 'single-artist',
    # multi-track encode and also for a multi-track, 'various-artist' encode:
    OUTPUTFORMAT='${OUTPUT}/${ARTISTFILE}-${ALBUMFILE}/${TRACKNUM}.${TRACKFILE}'
    VAOUTPUTFORMAT='${OUTPUT}/Various-${ALBUMFILE}/${TRACKNUM}.${ARTISTFILE}-${TRACKFILE}'
    # This function takes out dots preceding the album name, and removes a grab
    # bag of illegal characters. It allows spaces, if you do not wish spaces add
    # in -e 's/ /_/g' after the first sed command.
    mungefilename ()
    {
    
        CDDISKID=$(cd-discid "$CDROM")
        FIRSTPART=$(echo $CDDISKID | cut -d' ' -f1)
        echo $FIRSTPART "$@" | /home/phil/dev/munger.py
    }
    
    MAXPROCS=2                                # Run a few encoders simultaneously
    PADTRACKS=y                               # Makes tracks 01 02 not 1 2
    EXTRAVERBOSE=2                            # Useful for debugging
    INTERACTIVE=n
    post_read()
    {
    
    }
    #ok, now we need to set the correct ownership for the flac files.
    post_encode ()
    {
        ARTISTFILE="$(mungefilename "$TRACKARTIST")"
        ALBUMFILE="$(mungefilename "$DALBUM")"
    
        if [ "$VARIOUSARTISTS" = "y" ] ; then
            FINDPATH="$(eval echo "$VAOUTPUTFORMAT")"
        else
            FINDPATH="$(eval echo "$OUTPUTFORMAT")"
        fi
    
    FINALDIR="$(dirname "$OUTPUTDIR/$FINDPATH")"
    C_CMD=(chown -R phil:lms "$FINALDIR")
    #echo "${C_CMD[@]}" >>tmp2.log
    "${C_CMD[@]}"
    }
    

    munger.py strips any odd unicode out of the string to make the filename safe. It can be donwnloaded from here
    Most of the above is straight forwards abcde configuration. It uses cdparanoia to rip the cd, musicbrainz to find the track information and then it uses flac encoding. The mungefilename() function removes any unwanted characters from the file name and also appends the discID number onto the directory name for any unknown album – otherwise abcde will overwrite it. The post_encode() function changes the ownership of the files to phil:lms. You will need to edit this. TODO: Currently this is placing all the temp files in / which might be a problem.

  7. The ownship of the directory /home/phil/storage/music/flac/abcde and /home/phil/storage/music/flac/abcde/flac will need correcting with sudo chmod -R /home/phil/storage/music/flac/abcde

  8. Test the configuration by inserting a CD and ripping it from the command line. abcde -c /home/phil/.abcde.conf

  9. Ok, so how to get it to autorip when a CD is inserted. For this the udev is needed which is the Linux device manager. We need to get to trigger a command when it detects a CD is inserted. However, udev does not allow long running programmes to be run. If you try to call abcde directly you will find udev kills the process before its finished ripping. To get around this we create a new service and get udev to start the service.

  10. Create a new file /etc/systemd/system/ripper.service with the following

    [Unit]
    Description=Auto CD ripper
    
    [Service]
    Type=oneshot
    ExecStart=/home/phil/dev/rip.sh
    
    [Install]
    WantedBy=multi-user.target
    
  11. And create the /home/phil/dev/rip.sh file which gets run

    #!/bin/bash
    LOCKDIR=rip.lock
    LOGFILE=/home/phil/dev/log.txt
    
    function cleanup {
    
     if rmdir $LOCKDIR; then
        echo "Finished $(date)" >> $LOGFILE
    
    else
        echo "Failed to remove lock dir '$LOCKDIR'" >> $LOGFILE
        exit 1
    
    fi
    
    }
    
    if mkdir $LOCKDIR; then
        trap "cleanup" EXIT
        echo "Acquired Lock, running" >> $LOGFILE
        echo "Started $(date) $$" >> $LOGFILE
        abcde -c /home/phil/.abcde.conf
        #trigger LMS rescan
        wget -q --spider http://localhost:9000/settings/index.html?p0=rescan
        echo "Stopped $(date) $$" >> $LOGFILE
        # optional notifications, uncomment if needed
        # echo "Sending IFFT webhook:">>$LOGFILE
        # /home/phil/dev/ifttt_send.sh "Ripper done. Please eject CD" >> $LOGFILE
        # send the logfile by email, you need to configure mail  first for this to work
        # cat $LOGFILE | mail -s "Ripper log" your_email@example.com
    else
        echo "Could not create lock, already running? '$LOCKDIR'"
        exit 1
    
    fi
    

    This file basically only allows abcde to run as a single instance. It needs to be made executable with chmod u+x /home/phil/dev/rip.sh
    The wget command triggers the LMS rescan which looks for new media. The LOGFILE is really only useful to see if the script is working, you might want to set it to /dev/null eventually.
    The ifttt_send.sh used IFTT to send a notification that the process has finished to my phone. You need an IFTT account. I created a Maker Event called ripper_done that connect to a rich text notification. The ifttt_send.sh contains:

    #!/bin/bash
    source /home/phil/dev/ifttt_key.sh
    CMDSTRING=("curl -H \"Content-Type: application/json\" -X POST -d '{\"value1\":\" $1\"}' https://maker.ifttt.com/trigger/ripper_done/with/key/$IFTTT_KEY")
    eval $CMDSTRING
    

    ifttt_key.sh defines IFTTT_KEY which you need. It a single line export IFTTT_KEY="YOUR KEY VALUE"

  12. Start the service

    sudo systemctl daemon-reload
    sudo systemctl enable ripper.service
    

    Now if you place a CD in the drive you should be able to rip it by running sudo systemctl start ripper. The status of the ripping can be obtained with systemctl status ripper

  13. We now make a udev rule to trigger the service start when a cd is inserted. Create a file /etc/udev/rules.d/99-cd-audio-rip.rules with the following

    SUBSYSTEM=="block",KERNEL=="sr0",ACTION=="change",RUN+="/bin/systemctl start ripper"
    

    sr0 is the name of my CD-ROM drive.
    Now add the rule with sudo udevadm control --reload

  14. Now when you insert a CD it should autorip. You can always test the progress of the ripping with systemctl status ripper

1: My apt-get version of abcde was 2.9 which has a bug and crashes if Musicbrainz returns an unknown disc. To fix it we need to update it. Remove abcde with sudo apt-get remove abcde, download the latest version from here 2.9.2 at the time of writing. Untar it, enter the directory and run sudo make install. Since we removed the managed version of abcde the dependencies will be marked for removal. To stop this install them sudo apt-get install vorbis-tools cd-discid fdkaac. You might want to check for other dependencies if you have a problems with apt-cache depends abcde.

view raw

cd_ripper.md

hosted with ❤ by GitHub

Setting up a home music server/player on linux

For years I’ve been using Vortexbox as my music server at home. It’s basically a headless linux server configured with Logitech Music Server, an automatic CD ripper, a software player (for local sound playback) and a reasonable web interface. Vortexbox however hasn’t been updated for years. The maintainer is promising an update but I didn’t want to wait and instead set out to build my own. This is running on a HP micro-server (1.2GHz Atom) and I have Naim DAC plugged into it via USB for music.

It should be possible for anyone to install this but it is complicated and I’m not sure I’d recomend this to anyone with either no Linux experience or no desire to learn Linux.

  1. Install Lubunutu – This version comes with a GUI. I could have used one of the server versions, but the GUI is handy for the intial setup. My thought process on the distro didn’t going beyond this! I’m not going to go into detail here on the install becuase there plenty of tutorials online.

  2. Setup a user, my is phil

  3. We will also need ssh to remote login sudo apt-get install openssh-server. The rest of the set up can be done locally or now remotely via SSH.

  4. My server has 3 hard drives. One contains the OS, one is the music, one is for backup. The music files are going to be stored in the user home space rather than the root. So I edited fstab to mount my music drive in /home/phil/storage/ . Your setup may well be different. My music is held in /home/phil/storage/music/flac

  5. Install LMS (Logitech Media Server). It has a dependancy so run sudo apt-get install libio-socket-ssl-perl to install it. Download the latest version of LMS from http://www.mysqueezebox.com/. Once downloaded sudo dpkg -i logitechmediaserver_7.9.2~1530185314_amd64.deb. You will need to change the name of the .deb file for the version you downloaded. Then run sudo apt-get -f install and start the service sudo service logitechmediaserver start. You should then configure the users and group settings.

    sudo usermod -aG audio squeezeboxserver
    sudo usermod -aG lms squeezeboxserver
    sudo usermod -aG audio phil
    
  6. Create a Playlist folder somewhere and set the file group ownership to lms with sudo chown phil:lms -R Playlistand maybe sudo chmod -R 744 Playlist. The music files also need to belong to the lms group so sudo chown -R phil:lms /home/phil/storage/music/flac

  7. Open a web browser and the address will be http://[ip address of your machine]:9000. Configure LMS so that it knows the location of the music and playlist folders. Rescan the LMS music library. LMS should now be running. It’s probably not a bad idea to fix the IP address of your machine. You can normally do this with your routers web interface, or by setting the network configuration of your server (the router method is normally the easiest.)

  8. To add music to our folder over the home network we need samba to share a folder. sudo apt-get install samba, and create a samba user sudo smbpasswd -a phil. Edit the samba config sudo vi /etc/samba/smb.conf and add to the end

    [music]
    path = /home/phil/storage/music
    valid users = phil
    read only = no
    

    Restart the service with sudo service smbd restart and test the smb.conf config is valid by running testparm

  9. Install squeezelite player via the systems package manager(sudo apt-get intsall squeezelite). [Update: the repo version is rather old and seems unstable on my system. I recommend instead building from the source code. see here ].Now run squeezelite -l which should list the available audio devices. Running squeezelite from the console will start the program. If you are like me running squeezelite from the console works – it should appear as a player under the LMS webpage. ^C kills the player.

    The problem here is I would need to log in and run the command every time. It’s far better to run this as a service that starts automatically on boot. The service should be installed via the package mangager but the default configuration probably does not work. I think this is becuase the default sound card option is via pulse audio and this cannot be run as root. In the output of squeezelite -l there is (hopefully) one option starting with hw:, in my case it is hw:CARD=Audiophilleocom,DEV=0. This is the direct hardware option and it is the one we want. In /etc/default/squeezelite add the line SL_SOUNDCARD="hw:CARD=Audiophilleocom,DEV=0" or whatever your equivalent is. Restart the service with sudo service squeezelite restart and hopefully it is all working. 1

  10. Since the server is headless we don’t need to waste resources running the GUI. Switch to console login via sudo systemctl set-default multi-user.target

  11. Vortexbox automatically ripped CDs when inserted. See here for my version that uses abcde.

  12. Since I don’t log in to the machine very often via SSH, the updates can get very far behind. I’ve therefore enabled auto-update. Install with sudo apt-get install unattended-upgrades -y and sudo apt-get install update-notifier-common -y. You can then edit /etc/apt/apt.conf.d/50unattended-upgrades as you see fit. I’d recommend enabling the email notifications.

  13. On site and off site backups

1: I have had a few problems with Squeezelite crashing and I am currenlty looking into this. I suspect the pulseaudio might be the problem and it might need to be disabled. Since we aren’t using the desktop, this should be ok. Do not uninstall pulseaudio as some forums suggest. It can be disabled gobally by editting /etc/pulse/client.conf and changing the line ; autospawn = on to autospawn = off. Note the removal of the ;. My system now seems stable, however, its hard to say if it was pulseaudio or rebuilding the squeezelite from source that solved the problem. If you still have problems, it might be an idea to increase the ASLA buffersize

 

Piping numpy arrays to other processes in python

To pipe data from one process to another as a stream in python we need to pickle the object and pass it to the pipe stream.
In this example I’ve used Numpy arrays but this could be applied to any object that can be pickled in Python.
This took far too long to get working and I could find little information online on how to put it all together so here it is.
This code is Python 3 only, I’ve only run this on a Mac.

I’ve used binary as the stream rather than text purley becuase of effiencies. Numpy arrays can get huge! This means readline() is not
going to work. Instead, I send a single control byte , 1, for data and 0 for stop. This could be extended to include other control operations.
I then send the length of the data as a 8 byte int, followed by the data itself.

simpleSend.py

import numpy as np
import pickle
import sys
import io
import time

#define some control bytes
control_data=bytes([1])
control_stop=bytes([0])

def send_data(arr):
    dataStr=pickle.dumps(arr)  #pickle the data array into a byte array
    dlen=len(dataStr).to_bytes(8, byteorder='big') #find the length of the array and
    print(control_data.decode('latin-1'),end='',flush=True)  #convert this to a byte array
    print(dlen.decode('latin-1'), end='', flush=True)   #encode the data and write it
    print(dataStr.decode('latin-1'), end='', flush=True)  # end='' will remove that extra \r\n

def send_stop():
    print(control_stop.decode('latin-1'), end='', flush=True) 

#set the stdout such that it prints in latin-1,   sys.stdout.detach() is a binary stream
sys.stdout = io.TextIOWrapper(sys.stdout.detach(), encoding='latin-1')

for p in range(10):
    arr=np.ones((5000,500))*p  #generate some data
    send_data(arr)
    #the sleep is purely for testing and can be removed, ie does the reader fall over after a long delay
    time.sleep(.1)
send_stop()        

simpleReceiver.py

import numpy as np
import sys
import pickle

#define some control bytes
control_data=bytes([1])
control_stop=bytes([0])

while True:
    data=sys.stdin.buffer.read(1)   #read the control byte
    if data==control_data:
        data=sys.stdin.buffer.read(8)  #read the data length
        dlen=int.from_bytes(data, byteorder='big')
        print('data lenght %d'%dlen)        
        data=sys.stdin.buffer.read(dlen) #read the data        
        npd=pickle.loads(data)  #unpickle
        print(npd.shape)
        print(npd.max())
    elif data==control_stop:
        print('stopped')
        break
    else:
        print('Oh no')

to run this
python simpleSend.py | python simpleReceiver.py

If we want to use Python’s subprocess module to start simpleReceiver.py we basically need to write to the STDIN instead of print

import numpy as np
import pickle
import sys
import subprocess as sp

#define some control bytes
control_data=bytes([1])
control_stop=bytes([0])

def send_data(arr,buff):
    dataStr=pickle.dumps(arr)  #pickle the data array into a byte array
    dlen=len(dataStr).to_bytes(8, byteorder='big') #find the length of the array and
    mp.stdin.write(control_data)
    mp.stdin.write(dlen)
    mp.stdin.write(dataStr)
    mp.stdin.flush() #not sure this needed
     
def send_stop(mp):
    mp.stdin.write(control_stop)
    mp.stdin.flush()
     
try:
    mp = sp.Popen("python3 simpleReceiver.py",  shell = True,stdin=sp.PIPE)   
except sp.CalledProcessError as err:
    print('ERROR:', err)
    sys.exit(-1)

for p in range(10):
    arr=np.ones((5000,5000))*p  #generate some data
    send_data(arr,mp)
send_stop(mp)        

With such a large array 5000×5000 this takes sometime. Running it through the python profiler indicates about 75% of the time is taken by pickle.dumps and most of the rest of the remaining 25% is taken by the write operation. Numpy’s own method gives a speed increase. Replacing dataStr=pickle.dumps(arr) with dataStr=arr.tobytes() and npd=pickle.loads(data) with npd=np.frombuffer(data) more than halves the time taken but lose the shape and dtype information. This would have to be sent along with the data.

view raw

numpyPipe.md

hosted with ❤ by GitHub

Finding webcams

List all devices on windows using ffmpeg

ffmpeg -list_devices true -f dshow -i dummy

and on Linux

v4l2-ctl –list-devices

To get the device capabilites

ffmpeg -f dshow -list_options true -i video="Mobius"

where “Mobius” is the name of the camera.

On the Mac use

ffmpeg -f avfoundation -list_devices true -i ""

How to write lossless video in Python

OpenCV does a reasonable job of reading videos from file or webcams. It’s simple and mostly works. When it comes to writing videos,
it however leaves a lot to be desired. There is little control over the codecs and it is almost impossible to know which codecs are installed. It also wants to know things like the frame size at intailisation. This isn’t always a problem, but if you don’t know it yet it means you have to set up the video writer inside your main processing loop.

To make something as cross-platform compatible as possible it would be nice to use FFmpeg. There are a few python wrappers around, but as far as I can tell they are mainly used for transcoding type applications. One solution is run FFmpeg as a subprocess and set its input to accept a pipe. Then every video frame is passed through the pipe. You write this yourself, in fact it’s only a few lines of code. However, the scikit-video package will do this for us, with some nice boilerplate to make life easier.

The steps are:

  1. install FFmpeg — if you running on Linux use your system’s package manager if it’s not already installed. If you’re unlucky enough to be using Windows you need to download the zip file from here, and add the bin directory to your system’s path.
  2. install scikit-video –I tried installing scikit-video via pip on my Anaconda distro but the version was too old. Instead, I cloned the github version and installed that. Instructions are provided on github.

Below is a simple example the grabs from your webcam and records lossless video.

#test recording of video
import cv2
import skvideo.io


capture=cv2.VideoCapture(0) #open the default webcam
outputfile = "test.mp4"   #our output filename
writer = skvideo.io.FFmpegWriter(outputfile, outputdict={
  '-vcodec': 'libx264',  #use the h.264 codec
  '-crf': '0',           #set the constant rate factor to 0, which is lossless
  '-preset':'veryslow'   #the slower the better compression, in princple, try 
                         #other options see https://trac.ffmpeg.org/wiki/Encode/H.264
}) 
while True:
    ret,frame=capture.read()
    if ret==False:
        print("Bad frame")
        break
    cv2.imshow('display',frame)
    writer.writeFrame(frame[:,:,::-1])  #write the frame as RGB not BGR
    ret=cv2.waitKey(10)
    if ret==27: #esc
        break

writer.close() #close the writer
capture.release()
cv2.destroyAllWindows()