Sparse File Creation

For some reason, this one is hard to keep in my head.  Sometimes it is necessary to create a “large file” to act as a disk while monkeying with virtualization.  If I need, say, 10GB of disk space, do I really want to allocate all 10GB, or just what is needed – up to 10GB?  Sparse files let you do this.

$ pwd
/home/training
$ dd if=/dev/zero of=sparse-file bs=1 count=0 seek=10G
$ ls -al sparse-file
-rw-r--r-- 1 training admin 10737418240 Apr 11 08:00 sparse-file

Whoa, you say. It does take up 10 GB! Appearances can be deceiving, especially in tech work. To see how big a file really is, we’ll need to take a look at the size of the file, in blocks, not bytes. This is accomplished through the “-s” option.

$ ls -als sparse-file 
0 -rw-r--r-- 1 training admin 10737418240 Apr 11 08:00 sparse-file

Zero blocks. That’s better. Just to show that this is real, let’s switch to root and format the sparse file with an ext3 filesystem:

$ su -
Password:
# mke2fs -j $PWD/sparse-file
mke2fs 1.39 (29-May-2006)
/home/training/sparse-file is not a block special device.
Proceed anyway? (y,n) y
Filesystem label=
OS type: Linux
Block size=4096 (log=2)
Fragment size=4096 (log=2)
1310720 inodes, 2621440 blocks
131072 blocks (5.00%) reserved for the super user
First data block=0
Maximum filesystem blocks=2684354560
80 block groups
32768 blocks per group, 32768 fragments per group
16384 inodes per group
Superblock backups stored on blocks: 
        32768, 98304, 163840, 229376, 294912, 819200, 884736, 1605632

Writing inode tables: done                            
Creating journal (32768 blocks): done
Writing superblocks and filesystem accounting information: done

This filesystem will be automatically checked every 31 mounts or
180 days, whichever comes first.  Use tune2fs -c or -i to override.
# ls -als /home/training/sparse-file
299216 -rw-r--r-- 1 training-admin 10737418240 Apr 11 08:06 /home/lukeh/T/sparse-file

So now we’re up to almost 300MB of usage. Just for fun, let’s mount the file system and add 1 GB of kernel randomness, and check out usage one more time.

# mount -o loop /home/training/sparse-file /mnt
# time dd if=/dev/urandom of=/mnt/random.bin bs=1G count=1
1+0 records in
1+0 records out
1073741824 bytes (1.1 GB) copied, 205.06 seconds, 5.2 MB/s

real    3m25.359s
user    0m0.001s
sys     3m5.679s
# umount /mnt
# exit
$ ls -als sparse-file
1349792 -rw-r--r-- 1 training admin 10737418240 Apr 11 08:22 sparse-file

Well, I am a little embarrassed here – 3 minutes 25 seconds to create 1 GB of randomness from the kernel? It shows what kind of hardware I am using to author this.

Finally, throw the -h flag to ls to see “human readable” output:

$ ls -alsh sparse-file 
1.3G -rw-r--r-- 1 training admin 10G Apr 11 08:22 sparse-file

Cheers!

Comments are closed.