After observing PLeX streams churning up some CPU time, I decided to get a GPU to offload transcoding tasks. I went with a GTX 1050 Ti 4GB. This is more or less the same chip as the Quadro P2000, which is the current ~budget~, uh, low-powered darling. With a sneaky workaround, the artificial two-transcode limit is easily circumvented. For 5 times less than a P2000, I’ll take that deal!
To pass your GPU through to your PLeX docker, there are preparatory steps needed.
From Community Applications install unRAID-nVidia. Go to Settings → unRAID-nVidia, Select the nVidia build for your version of unRAID, and install.
su
cd /boot
wget https://raw.githubusercontent.com/keylase/nvidia-patch/master/patch.sh
chmod +x patch.sh
mv patch.sh nvidia-patch.sh
cat /boot/nvidia-patch.sh >> config.go
Installing the card is straightforward. Remove the PCIe riser, insert the card in the 16x wide slot, reinsert, reboot.
Do note that the 16x slot is 8x electrical; but this does not matter for our purposes. My particular card did not need an extra GPU power cable. If yours does, you need the 10 pin to GPU power adapter from HP, or get this one from moddiy.com
…but first, go to Settings → unRAID-nVidia; and copy your GPU GUID somewhere convenient.
In the unRAID web UI navigate to Docker, and reconfigure the PLeX docker.
Switch to advanced view, and under “Extra Parameters” add --runtime=nvidia
.
Under NVIDIA_VISIBLE_DEVICES add that GUID.
Save, restarting the PLeX docker.
In the PLeX webui, go to settings for your server. Under “Transcoding”, select “Use hardware transcoding when available”
And there you go!
Remember the fan control rain dance from the last entry in this series? And HPE’s agressive stance towards fan control? Well, HPE’s not gonna let you forget. After installing the GPU, my fans were running at an… excessive > 60 %. And my previous fan hack - just setting every fan baseline to 1 - didn’t work anymore. This reddit post could point me in the right direction, though.
This process is also a little involved, so buckle up.
There’s a bug in the fan-control hacked firmware that makes it not display command outputs in SSH sessions beyond the first after a reset. And this output is important for the next step.
iLO is, as my son so eloquently put it, crying “stranger danger” on account of not recognizing the GPU.
This can be illustrated by SSHing into iLO, and running the command fan info g
.
A nice table like the following should be presented:
GROUPINGS
0: FASTEST Output: 63 [02*07 ...
1: FASTEST Output: 63 [02*07 ...
2: FASTEST Output: 35 [01 02*...
3: FASTEST Output: 36 [01 02 ...
4: FASTEST Output: 60 [01 03 ...
5: FASTEST Output: 60 [01 05 ...
(Example borrowed from the linked Reddit post, since I forgot to save my actual output)
Note that some numbers are marked with an *
.
This indicates that that is the sensor iLO is reading as the hottest - in my case, sensor 52.
To quiet down just that sensor, run fan pid 52 hi 300
or some other low number.
And enjoy immediate relief, as your fans settle down somewhere around 10-15 %.
Quick testing yielded 2 4K → 1080p transcodes, at ~1500 megabytes of GPU ram each; alongside one 1080p → 720p transcode. Realistically, I wont have much more than one 4K transcode at any given moment, if at all. Very nearly 0 CPU usage though, which was nice.
Feels good when a plan comes together.