Hands on with Opendedup

      by Wyatt Walter

After reading about Opendedup on Slashdot this weekend, I decided to try it out to see how well it all really worked. My test server was an install of Ubuntu 9.10 x64. If you happen to be using that stack, the installation isn’t too difficult:

Download required files (adding links to the most recent versions of each, check for newer versions as necessary):
cd /usr/local/src
wget http://download.java.net/jdk7/binaries/
wget http://opendedup.googlecode.com/files/debian-fuse.tar.gz
wget http://opendedup.googlecode.com/files/sdfs-latest.tar.gz

And install:
chmod +x jdk-7-ea-bin-b87-linux-x64-25_mar_2010.bin
./jdk-7-ea-bin-b87-linux-x64-25_mar_2010.bin

(follow instructions – afterwards, but sure to set the JAVA_HOME variable)
export JAVA_HOME=/usr/local/src/jdk1.7.0
tar zxf debian-fuse.tar.gz
cd debian-fuse
dpkg --install *.deb

Next, just extract the SDFS packages and use:
tar zxf sdfs-latest.tar.gz
cd sdfs-bin

Now, we make our filesystem and mount it:
./mkfs.sdfs --volume-name=deduped --volume-capacity=5000MB
./mount.sdfs -m /srv -v deduped

Assuming all goes well, you should have a newly mounted deduped mount.

Great results from testing in the small
As a test, I copied over a sample song from my music collection (what nerd doesn’t enjoy a little Weird Al?). Copying to /root, the file size was 2.9MB. Once I copied it to my deduped /srv directory, the file size took just 46K on disk! Not too shabby. Just as a sanity check, I copied the file back off the deduped filesystem and the file size grew back to normal.

Things not all rosy in Opendedup-land
I decided to try throwing a little more data at it as a test and copied over the Documents directory from my desktop. The folder that I copied was slightly over 600MB of docs, text files, images, and a few other file types. During the file copy, Opendedup took a significant amount of memory (it hung around the 90% mark). My test machine was a small virtual machine (1 CPU, 2GB of RAM) and the file transfer slowed it down significantly. Eventually, I got curious as to how much had been transferred. I cd’d to the test dir and did an ‘ls’ which never completed and I could no longer open a new shell via SSH to the vm either. I’m sure this would be much better if I had the resources to throw a little more RAM and CPU at it (since I’m running the minimum), but I don’t have time the resources to try at the moment.

Conclusion
Overall, the technology seems really promising and pretty straightforward to use. If my compression rates hold true, this could dramatically cut down on the amount of disk space needed to store my backups and virtual machine templates. Judging by the performance I’ve seen thus far, I don’t think I’d want to run this in production, but it looks promising, nonetheless.

Tags: ,
Filed under Tech Trends : Comments (4) : Mar 30th, 2010

  • http://www.opendedup.org Sam Silverberg

    Wyatt,

    Thanks for taking a look at our software. I really appreciate the feedback and review. Deduplication can be memory and cpu intensive but we actively working on enhancing performance of the filesystem.

    Thanks again!

  • Paul Victor

    But I’ve seen the entire data go to /opt/sdfs folder and that folder growing as huge as the size of the copied files.Is this normal?

  • Alan V

    /opt/sdfs is the default folder. You’ll need to modify the config before the first mount to change that location. You can find this on the Quick install guide on OpenDedup’s site.

    Also, I wanted to correct the Java download path..

    wget http://www.java.net/download/jdk7/binaries/jdk-7-ea-bin-b96-linux-x64-03_jun_2010.bin

  • Matt

    I installed everything correctly but I can’t seem to figure out the commands to get a drive mounted with sfds. I have /dev/sde1 with a JFS file system. How do I get that whole drive up and running as SDFS?