shithub: sox

Download patch

ref: d4a0c0caf9a9eefd44e70e1f37672669a4c166fe
parent: a6226206180e467cc43e291fb23078d37c2ea7a3
author: robs <robs>
date: Mon Dec 18 14:09:27 EST 2006

Retire -v for output file; clean-ups

--- a/sox.1
+++ b/sox.1
@@ -40,16 +40,15 @@
 optionally apply effects to them; it includes a simple audio synthesiser,
 and on unix-like systems, can also play and record sound files.
 .P
-If more than one input file is specified then they are concatenated into the
-output file.  In this case, there is the restriction that all input files
-must be of the same data type and sample rates.
+Multiple input files can be combined to form the output file using one
+of three methods: `concatenate', `mix', or `merge'.  There is currently
+the restriction that multiple input files must have the same number of
+channels and the same sample rate (though not necessarily the same file
+format).
 .P
-.I soxmix
-is functionally the same as the command line program
-.I sox
-except that it takes two or more files as input and mixes the audio together
-to produce a single file as output.  It has a restriction that all
-input files must be of the same data type and sample rates.
+The default combining method for \fBsox\fR is `concatenate';
+\fBsoxmix\fR is an alias for \fBsox\fR for which the the default
+combining method is `mix'.
 .P
 Exit status is 0 for no error, 1 if there is a problem with the command-line arguments, and 2 if an error occurs during file processing.
 .P
@@ -64,7 +63,7 @@
 be used to describe the audio in some way, e.g. for music, the title,
 the author, etc.
 .P
-The second type is header-less data, often called raw data.
+The second type is headerless data, often called raw data.
 For this type, a
 user must pass enough information to
 .I SoX
@@ -88,7 +87,7 @@
 The number of audio channels contained in the file.  1 ("mono") and 2
 ("stereo") are widely used.
 .P
-Please refer to the
+Refer to the
 .B soxexam(1)
 manual page for a long description with examples on how to use SoX with
 various file formats.
@@ -101,7 +100,7 @@
 (i.e. converting back again would yield an exact copy of the original
 audio signal)
 where it
-can be, i.e. when not using "lossy" compression (e.g. A-law, mp3, etc.)
+can be, i.e. when not using "lossy" compression (e.g. A-law, MP3, etc.)
 and the number of bits used in the destination format is not less than
 in the source format.
 
@@ -116,7 +115,7 @@
 .br
 Clipping is distortion that occurs when an audio signal
 level exceeds the range of the chosen representation.
-Clipping is rarely desirable and so should usually be corrected by
+Clipping is nearly always undesirable and so should usually be corrected by
 adjusting the audio volume prior to the point at which clipping occurs.
 
 In \fISoX\fR, clipping could occur, as you might expect, when using the
@@ -137,7 +136,7 @@
 .I stat
 effect can assist here by displaying the signal level in an audio file.
 
-If clipping occurrs at any point during processing, then
+If clipping occurs at any point during processing, then
 .I SoX
 will display a warning message to that effect.
 
@@ -152,21 +151,28 @@
 into a Microsoft .WAV file, while
 .P
 .br
-	sox -v 0.5 file.au -r 12000 file.wav dither
+	sox file.au -r 12000 -1 file.wav vol 0.5 dither
 .P
 .br
 does the same format translation but also 
-lowers the amplitude by 1/2, changes
-the sampling rate to 12000 Hz, and applies the \fBdither\fR sound effect
-to the audio data.
+changes the sampling rate to 12000 Hz, 
+the sample size to 1 byte (8 bits),
+and applies the \fBvol\fR and \fBdither\fR sound effects
+to the audio data;
 .P
-The following will mix two sound files together to produce a single sound
-file.
+.br
+	sox short.au long.au longer.au
 .P
 .br
-	soxmix music.wav voice.wav mixed.wav
+concatenates two sound files to produce a single file, whilst
+.P
+.br
+	sox -m music.mp3 voice.wav mixed.flac
+.P
+.br
+mixes together two sound files.
 .PP
-\fBSpecial Filenames:\fR
+\fBSpecial Filenames\fR
 .TP 10
 \fB-\fR
 SoX can be used in pipeline operations by using the special
@@ -190,25 +196,29 @@
 .I -n
 but is left here for historical reasons.
 .PP
-\fBGlobal options:\fR
+\fBGlobal Options\fR
 .TP 10
-\fB-h\fR
+\fB\-h\fR, \fB\-\-help\fR
 Print version number and usage information.
 .TP 10
-\fB--help\fR
-Same as \fB-h\fR
-.TP 10
 \fB--help-effect=name\fR
-Print usage information on the specified effect.  The name
-\fBall\fR can be used to disable usage on all effects.
+Display usage information on the specified effect.  The name
+\fBall\fR can be used to display usage on all effects.
 .TP 10
 \fB\-m\fR, \fB\-\-mix\fR
-Behave as \fBsoxmix\fR, i.e. mix multiple input files instead of
-concatenating them.
+Set the input file combining method to `mix'.
+Two or more input files must be given,
+and will be mixed together (instead of concatenated)
+to form the output file.
+
+See also \fBInput File Balancing\fR below.
 .TP 10
 \fB\-M\fR, \fB\-\-merge\fR
-Merge multiple input files instead of concatenating them.
-Input files must have the same data format characteristics.
+Set the input file combining method to `merge'.
+Two or more input files must be given,
+and will be merged together (instead of concatenated)
+to form the output file.
+
 This can be used for example to merge two mono files into one
 stereo file; the first and second mono files become
 the left and right channels of the stereo file.
@@ -230,7 +240,7 @@
 option.
 .TP
 \fB-S\fR
-Print status while processing audio data.  Tells how much of audio data has been
+Display status while processing audio data.  Shows how much of audio data has been
 processed in terms of audio running time instead of samples.
 .TP 10
 \fB--version\fR
@@ -278,9 +288,45 @@
 sets it to 0.
 .IP
 .PP
-\fBInput And Output File Format Options:\fR
+\fBInput File Balancing\fR
+.br
+When multiple input files are given, \fISoX\fR applies any specified
+effects (including, for example, volume adjustment) after the audio
+has been combined.  However, as with a traditional audio mixer, it is
+useful to be able to set the volume of (i.e. `balance') the inputs
+individually, before combining takes place.
+
+If the selected combining method is `mix' then, to guarantee that
+clipping does not occur at the mixing stage, \fISoX\fR defaults to
+adjusting the amplitude of each input signal by a factor of 1/n, where n
+is the number of input files; if this results in audio that is perceived
+to be too quiet, then the volume adjustments can be set manually
+instead.  For the other combining methods, the default behaviour is for no
+input volume adjustments.
+
+Manual input file volume adjustment is achieved using the following
+option which, as with format options, can be given for one or more input
+files; if it is given for only some of the input files then the others
+receive no volume adjustment (regardless of combining method):
+
+.TP 10
+\fB-v \fIvolume\fR
+Adjust volume by a factor of \fIvolume\fR.
+This is a linear (amplitude) adjustment, so a number less than 1
+decreases the volume; greater than 1 increases it.  If a negative number
+is given, then in addition to the volume adjustment, the audio signal
+will be inverted.
+
+See the \fBstat\fR effect for information on how to find
+the maximum volume of an audio file to help with setting
+suitable values for this option.
+.P
+The \fB-V\fR option will show what input file volume adjustments
+have been selected (either manually or automatically).
 .PP
-Format options affect the input or output file that they immediately precede.
+\fBInput And Output File Format Options\fR
+.br
+These options apply to the input or output file that they immediately precede.
 .PP
 Self describing input files can contain all the format information in the header and so don't generally need format options.  Headerless input files lack this information and so format options must be used to inform SoX of the file's data type, sample rate, and number of channels.
 .PP
@@ -310,7 +356,7 @@
 \fB-t \fIfiletype\fR
 gives the file type of the sound sample file.  Useful when file extension 
 is not standard or can not be determined by looking at the header of the file.
-.ti +3
+
 The 
 .I -t
 option can also be used to override the type implied by an input filename
@@ -318,35 +364,9 @@
 .I SoX
 will exit with an appropriate error message if such a header is not
 actually present.
-.ti +3
+
 See the section \fRFILE TYPES\fR for a list of supported file types.
 .TP 10
-\fB-v \fIvolume\fR
-Change amplitude (floating point); 
-less than 1.0 decreases, greater than 1.0 increases.  May use a negative
-number to invert the phase of the audio data.  It is interesting to note
-that we perceive volume
-logarithmically but this adjusts the amplitude linearly.
-
-As with other format options, the volume option effects the
-file it's specified with.  This is useful when processing multiple
-input files as the volume adjustment can be specified for each
-input file or just once to adjust the output file.  This can be
-compared to an audio mixer were you can control the volume of
-each input as well as a master volume (output side).
-
-\fIsoxmix\fR defaults the value of the -v option for each input
-file to 1/input_file_count.  This means if you're mixing two
-input files together then each input file's volume is adjusted
-by 0.5.  This is done to prevent clipping of audio data during
-the mixing operation. 
-Users will most likely not be happy with this large of a volume adjustment
-and can specify the -v option to override this default value.
-
-Note: For the non-mixing case, see the \fBstat\fR effect for information on 
-finding the maximum volume adjustment that can be done with this option 
-without causing audio data to be clipped.
-.TP 10
 \fB-x\fR
 The sample data comes from a machine with the opposite word order 
 than yours and must
@@ -394,9 +414,9 @@
 Aliases for -1/-2/-4/-8.
 Abbreviations of: byte, word, long word, double long (long long) word.
 .PP
-\fBOutput File Format Options:\fR
-.PP
-These options may precede only the output file.
+\fBOutput File Format Options\fR
+.br
+These options apply to and may precede only the output file.
 .TP 10
 \fB--comment \fItext\fR
 Specify the comment text to store in the output file header (where applicable).
@@ -416,7 +436,7 @@
 .I SoX
 uses the following method to determine the type of audio to use for
 each input file and the output file:
-.ti +3
+
 If
 .I -n
 or
@@ -529,7 +549,7 @@
 music. It is similar to MP3 and Ogg Vorbis, but lossless,
 meaning that audio is compressed in FLAC without any loss in
 quality. 
-.ti +3
+
 .B SoX
 can decode native FLAC files (.flac) but not Ogg FLAC files (.ogg).
 [But see 
@@ -536,7 +556,7 @@
 .B .ogg
 below for information relating to support for Ogg
 Vorbis files.]
-.ti +3
+
 .B SoX
 has rudimentary support for writing FLAC files: it can encode to
 native FLAC using compression levels 0 to 8. 8 is the default
@@ -545,7 +565,7 @@
 level can be selected using the
 .B -C
 option (see above) with a whole number from 0 to 8.
-.ti +3
+
 Note that Replay Gain information is not used by
 .B SoX
 if present in FLAC input files and is not generated by
@@ -562,7 +582,7 @@
 .B SoX
 ).
 .br
-.ti +3
+
 FLAC support in
 .B SoX
 is optional and requires optional FLAC libraries.  To
@@ -605,7 +625,7 @@
 MPEG standards for audio and video compression. It is a lossy
 compression format that achieves good compression rates with little
 quality loss. Also see Ogg Vorbis for a similar format.
-.ti +3
+
 MP3 support in
 .B SoX
 is optional and requires access to either or both the external 
@@ -622,22 +642,22 @@
 special filename
 .I -n
 in place of an input or output filename.
-.ti +3
+
 Using this file type to input audio is equivalent to
 using a normal audio file that contains an infinite amount
 of silence, and as such is not generally useful unless used
 with an effect that specifies a finite time length
 (such as \fBtrim\fR or \fBsynth\fR).
-.ti +3
+
 Using this type to output audio amounts to discarding the audio
 and is useful mainly with effects that produce information about the
 audio instead of affecting it
 (such as \fBnoiseprof\fR or \fBstat\fR).
-.ti +3
+
 The number of channels and the sampling rate associated with a null file
 are by default 2 and 44.1kHz respectively, but these can be overriden
 if necessary by using appropriate \fBFormat Options\fR.
-.ti +3
+
 One other use of the null file type is to use it in conjunction
 with
 .I -V
@@ -653,7 +673,7 @@
 and streaming audio.  It is a lossy compression format (similar to MP3,
 VQF & AAC) that achieves good compression rates with a minimum amount of
 quality loss.  Also see MP3 for a similar format.
-.ti +3  
+
 .B SoX
 can decode all types of Ogg Vorbis files, and can encode at different
 compression levels/qualities given as a number from -1 (highest
@@ -663,9 +683,9 @@
 .B -C
 option (see above) with a number from -1 to 10; fractional numbers (e.g.
 3.6) are also allowed.
-.ti +3  
+
 Decoding is somewhat CPU intensive and encoding is very CPU intensive.
-.ti +3  
+
 Ogg Vorbis in
 .B SoX
 is optional and requires access to external Ogg Vorbis libraries.  To
@@ -764,12 +784,12 @@
 format.
 .TP 10
 .B .vox
-A header-less file of Dialogic/OKI ADPCM audio data commonly comes with the
+A headerless file of Dialogic/OKI ADPCM audio data commonly comes with the
 extension .vox.  This ADPCM data has 12-bit precision packed into only 4-bits.
 .TP 10
 .B .wav
 Microsoft .WAV RIFF files.
-The are the native sound file format of Windows, and widely used for uncompressed sound.
+This is the native sound file format of Windows, and widely used for uncompressed sound.
 
 Normally \fB.wav\fR files have all formatting information
 in their headers, and so do not need any format options
@@ -1100,7 +1120,7 @@
 Apply a flanging effect to the signal.
 All parameters are optional (right to left).
 
-       RANGE DEFAULT DESCRIPTION
+PARAM  RANGE DEFAULT DESCRIPTION
 .RS
 .TP 21
 \fIdelay\fR   0 10    0
@@ -1253,7 +1273,7 @@
 
 .br
 -cutoff # : specify the filter cutoff frequency in terms of fraction of
-frequency bandwidth, also know as the Nyquist frequency.  Please see 
+frequency bandwidth, also know as the Nyquist frequency.  See 
 the \fIresample\fR effect for
 further information on Nyquist frequency.  If up-sampling, then this is the 
 fraction of the original signal
@@ -1469,7 +1489,7 @@
 appended with `\fIc\fR', the number of cents (i.e. 100ths of a
 semitone) by which the pitch (and tempo) should be adjusted:
 > 0 increases, < 0 decreases.
-.ti +3
+
 By default, the speed change is performed by the \fBresample\fR
 effect with its default parameters.  For higher quality
 resampling, in addition to the \fBspeedr\fR effect, specify
@@ -1557,7 +1577,7 @@
 the output from the previous stage.
 Audio for each channel in a multi-channel sound file can be synthesised
 independently.
-.ti +3
+
 Though this effect is used to generate audio data, an input file must
 still be specified.  This can be used to set the synthesised audio
 length, the number of channels, and the sampling rate, however since the
@@ -1565,7 +1585,7 @@
 .I null
 file "\fI-n\fR" is usually used instead (and the length specified
 as a parameter to \fIsynth\fR).
-.ti +3
+
 For example, the following produces a 3 second, 44.1kHz,
 stereo audio file containing a sine-wave swept from 300 to 3300 Hz.
 
@@ -1606,7 +1626,7 @@
 \fIlen\fR is the length of audio to synthesise expressed as a time
 or as a number of samples;
 0=inputlength, default=0.
-.ti +3
+
 The format for specifying lengths in time is hh:mm:ss.frac.  The format
 for specifying sample counts is the number of samples with the letter
 's' appended to it.
@@ -1701,11 +1721,12 @@
 mode, this effect will display the percentage of audio data that needed to be
 limited.
 .SH BUGS
-Please report any bugs found in this version of SoX mailing list (sox-users@lists.sourceforge.net)
+Please report any bugs found in this version of SoX to the mailing list
+(sox-users@lists.sourceforge.net).
 .SH SEE ALSO
 .BR play (1),
 .BR rec (1),
-.BR soxexam(1)
+.BR soxexam (1)
 .LP
 The SoX web page at http://sox.sourceforge.net/
 .SH LICENSE