5 Ideas for Optimizing Audio File Sizes


In some unspecified time in the future or one other, everybody ought to find out how audio information work. This data could seem trivial or unimportant, however it will probably turn out to be useful when recording music, making a podcast or optimizing your music library.


This publish will discover the varied elements affecting audio high quality and audio file measurement. Hanging the right steadiness between the 2 is not simple, however you need to know sufficient to really feel snug and experiment for your self by the top.

Word: To place this data into apply, you will need to seize a free audio editor like Audacity or any options. Studying these instruments is past the scope of this piece.


1. Pattern Charge

In actual life, sound is a wave. When somebody speaks or claps their palms, what you are really listening to is a change in strain that travels by the air and finally hits your eardrums.

However how will we seize that sound and convert it into digital information? We will not simply report the complete sound wave as it’s; as a substitute, we’ve got to take periodic “snapshots” of the sound over time. Once you play all of it again in sequence, you get an approximate recreation of the unique sound.

Every snapshot is known as a pattern, and the interval used between every snapshot is known as the pattern price. To outline them, it’s the variety of digital snapshots taken per second in an audio file by an analog to digital converter. The sampling price is measured in Hertz, so it may be expressed as a frequency.

The shorter the interval, the quicker the frequency. Sooner frequencies produce extra correct recordings but in addition require extra information to retailer every second of recorded sound.

For instance, CD-quality audio makes use of a pattern frequency of 44.1 kHz (or 44,100 samples per second), whereas TV and DVD high quality audio makes use of a pattern frequency of 48 kHz. Given a 10-minute uncompressed mono audio recording, the previous is likely to be 51.7 MB whereas the latter could be 56.3 MB.

You’ll be able to drop to 32 kHz for speech-only recordings and never expertise a lot loss in high quality, however stick with 44.1 kHz if music is concerned or when you want utmost high quality. Dropping to 22.05 kHz will sound nearer to AM radio.

2. Bitrate

Bitrate just isn’t the identical factor as pattern price. Lots of people are likely to conflate the 2, nevertheless it’s essential that you do not. To begin with, if the pattern price is how typically the snapshots of sound are taken, then bit depth is how a lot information is recorded throughout every snapshot.

As an instance, think about a sound wave as a stream of water, and also you’re attempting to seize (i.e., report) that water with a bucket. Pattern price could be how typically you dip your bucket into the stream, whereas bit depth could be the scale of your bucket. The measurement for bit depth is bits. For every one bit improve, the accuracy of the recording doubles.

The upper the bit depth, the extra information is captured per pattern. This results in a extra correct recording on the expense of extra space required to retailer that information.

However when you cut back the bit depth an excessive amount of, sound information will get misplaced. Audio CDs use 16 bits per pattern, whereas DVD and Blu-ray discs use 24 bits for every pattern.

Bitrate is how a lot precise sound information is processed (expressed in kilobits per second). To get the bitrate, you multiply the pattern price by the bit depth. A CD audio file with a 44.1 kHz pattern price and a 16-bit depth would have an uncompressed bitrate of 44100*16, i.e., 705.6 kbps.

To present you an thought of the distinction in file measurement, let’s contemplate a five-minute uncompressed track recorded in a two-channel stereo audio

  1. 44.1kHz/16-bit: 44100*16*2 = 1411200 bits per second (1.4 Mbps)
  2. 192kHz/24-bit: 192000*24*2 = 9216000 bits per second (9.2Mbps)

Utilizing the bitrate calculated, multiply it by the size of the recording

  1. 1.4*300 = 420Mb or 52.5 MB
  2. 9.2*300 = 2760Mb or 345 MB

So audio recorded in 192kHz/24-bit will take six instances extra space, nevertheless it all boils all the way down to what you need to do with the audio recording. Typically the complete bitrate is not wanted in a given snapshot, similar to when there’s silence.

In that case, you should utilize variable bitrate (VBR) supported by MP3, OGG, AAC, and WMA. Prior to now, VBR wasn’t extensively supported, however these days is not a lot of a problem.

3. Stereo vs. Mono

This level is fairly easy, so I will maintain it temporary. Mono means one channel, whereas Stereo means two channels. The 2 channels in a stereo audio file will be known as the “left” and “proper” channels.

With a pair of headphones, you’ll hear one of many stereo channels in a single ear and the opposite stereo channel within the different ear. When listening to a mono audio file, you will hear the identical actual channel in each ears.

In a way, stereo audio information are basically two mono audio information in a single, which signifies that a stereo audio file is all the time twice as huge as a mono audio file, assuming the pattern price, bit depth, supply sound, and so on. are the identical between the 2. So the best method to immediately minimize an audio file measurement in half is to transform it from stereo to mono.

For voice-only recordings, mono is nearly all the time most popular because it makes the sound highly effective, clear, and upfront. However if you wish to report two or extra vocalists in a room with distinctive acoustics, the vocals ought to be stereo.

Equally, podcast recording will be mono as effectively. Nevertheless, in music recordings, a stereo is what makes numerous music sound extra three-dimensional, as if the music is taking part in round you slightly than at you (i.e., mono sounds flatter).

4. Compression

Should you’re working with WAV information, the one method to cut back file measurement is by tinkering with one of many above settings (pattern price, bit depth, or variety of channels). For all the things else, compression is the largest think about audio file measurement. There are two sorts of compression:

  • Lossy compression removes “pointless” information from the audio, similar to sounds which might be past the listening to vary of most individuals. As soon as compressed, this discarded information cannot be recovered.
  • Lossless compression takes an audio file and packs it down as a lot as attainable utilizing mathematical algorithms. Nevertheless, it have to be decompressed on the time of playback, which requires extra processing energy. No precise information is misplaced.

The compression mode you need to use relies upon upon the supposed use of the audio file. Usually, you need to go together with lossless compression whenever you need to retailer a virtually good copy of the supply materials and lossy compression when the imperfect copy is sweet sufficient for day-to-day utilization.

For instance, you may need to protect your ripped CD assortment in FLAC (if cupboard space is not a problem) and use MP3 to retailer them on the telephone. If you do not know a lot about compression, here is our full information on how file compression works and an inventory of instruments to compress massive audio information successfully.

5. File Format

As soon as you’ve got determined to go together with lossy compression, it’s a must to determine which file format is greatest for you. As of this writing, the three hottest choices are MP3, OGG, and AAC. To know extra, learn our information on the comparability of varied audio file codecs.

MP3 is the most well-liked by far, primarily as a result of it was the primary of the three to reach on the scene. AAC is technically higher than MP3 however would not have the identical utilization price. OGG is sweet too, however not many units help it, so keep on with MP3 or AAC.

No matter which one you employ, you will find yourself compressing to a goal bitrate. If we assume you are going to use the MP3 format, then these are the 5 commonest bitrates at the moment used:

  • 64 kbps is AM radio high quality. Excellent for talk-only podcasts as a result of voices aren’t as complicated as music.
  • 96 kbps is FM radio high quality. Music will sound nice, however you’ll inform that it is not full-bodied, primarily as a result of sure hearable frequencies had been eliminated.
  • 128 kbps is CD audio high quality. That is as normal because it will get. Music sounds “ok” for most folk at this bitrate.
  • 256 kbps is excessive audio high quality. You could discover sure sounds and devices that weren’t detectable at decrease bitrates.
  • 320 kbps is the perfect audio high quality. You’ll be able to go greater, however you most likely will not have the ability to inform the distinction, even when you contemplate your self to be an audiophile.

When it comes to file measurement discount, an MP3 compressed to 128 kbps loses roughly 90% of the unique sound information, whereas an MP3 compressed to 320 kbps solely loses about 60%.

Additionally, when you’ve got an MP3 and an AAC each compressed to the identical bitrate, the AAC will typically sound higher as a result of it makes use of a extra superior compression algorithm. This implies you will get extra “high quality per megabyte” with AAC than MP3.

Optimize Your Audio Information Sizes

Understanding these 5 elements will aid you determine one of the best ways to report and compress music and/or podcasts that you have created and aid you determine what sort of music codecs to buy or which streaming companies to make use of.



Supply hyperlink