Mike Andrews

Tubes Versus Transistors
Is There An Audible Difference?
Russell O. Hamm

Journal of The Audio Engineering Society
Presented September 14, 1972,
at the 43rd Convention of the
Audio Engineering Society, New York

Sear-Matthews Audio Systems, 212-874-0137
20 W 84th St., New York, New York 10024

ABSTRACT

Engineers and musicians have long debated the question of tube sound versus transistor sound. Previous attempts to measure this difference have always assumed linear operation of the test amplifier. This conventional method of frequency response, distortion and noise measurement has shown that no significant difference exists. This paper, however, points out that amplifiers are often severely overloaded by signal transients (THD 30% ). Under this condition there is a major difference in the harmonic distortion components of the amplified signal, with tubes, transistors, and operational amplifiers separating into distinct groups.

INTRODUCTION

As a recording engineer we became directly involved with the tube sound versus transistor sound controversy as it related to pop recording. The difference became markedly noticeable as more solid-state consoles made their appearance. Of course there are so many sound problems related to studio acoustics that electronic problems are generally considered the least of one's worries. After acoustically rebuilding several studios, however, we began to question just how much of a role acoustics played.

During one session in a studio notorious for bad sound we plugged the microphones into Ampex portable mixers instead of the regular console. The change in sound quality was nothing short of incredible. All the acoustic changes we had made in that studio never had brought about the vast improvement in the sound that a single change in electronics had. Over a period of several years we continued this rather informal investigation of the electronic sound problem. In the past, we have heard many widely varied theories that explain the problem, but no one, however, could actually measure it in meaningful terms.

PSYCHOACOUSTICS

Anyone who listens to phonograph records closely can tell that tubes sound different from transistors. Defining what this difference is, however, is a complex psychoacoustical problem. Any investigation of this admittedly subtle phenomenon must really begin with a few human observations. Some people try to point out and describe valid differences. Others just object to the entire thesis and resort to spouting opinions. It is the listener's job to sort out the facts from the fiction. Psychoacoustically, musicians make more objective subjects than engineers. While their terms may not be expressed in standard units, the musician's "by ear" measuring technique seems quite valid. Consider the possibility that the ear's response may be quite different than an oscilloscope's.

"Tube records have more bass....The bass actually sounds an octave lower," says one rock guitarist. A couple of professional studio players have pointed out on numerous occasions that the middle range of tube recordings is very clear, each instrument has presence, even at very low playback levels. Transistor recordings tend to emphasize the sibilants and cymbals, especially at low levels. "Transistor recordings are very clean but they lack the 'air' of a good tube recording." "With tubes there is a space between the instruments even when they play loud...transistors make a lot of buzzing." Two people commented that transistors added a lot of musically unrelated harmonics or white noise. especially on attack transients. This same phenomenon was expressed by another person as a "shattered glass" sound that restricted the dynamics. 1t was generally agreed that tubes did not have this problem because they overload gently. Finally, according to one record producer, "Transistor records sound restricted like they're under a blanket. Tube records jump out of the speaker at you....Transistors have highs and lows but there is no punch to the sound."

When we heard an unusually loud and clear popular-music studio recording, we tried to trace its origin. In almost every case we found that the recording console had vacuum-tube preamplifiers. We are specific in mentioning preamplifiers because in many cases we found hybrid systems. Typically this is a three- or four-track console that is modified with solid-state line amplifiers to feed a solid-state eight- or sixteen-track tape machine. Our extensive checking has indicated only two areas where vacuum-tube circuitry makes a definite audible difference in the sound quality: microphone preamplifiers and power amplifiers driving speakers or disc cutters. Both are applications where there is a mechanical-electrical interface.

As the preliminary basis for our further investigation we decided to look into microphone and preamplifier signal levels under actual studio operating conditions. Hoping to find some clues here we would then try to carry this work further and relate electrical operating conditions to acoustically subjective sound colorations. Our search through published literature showed that little work bas been undertaken in this area. Most microphone manufacturers publish extensive data on output levels under standard test conditions [1], but this is rather hard to convert to terms of microphone distances and playing volumes. Preamplifier circuit design is well covered for noise considerations [2], but not from the standpoint of actual microphone operating levels. Distortion has been treated in numerous ways [3-5], but with very few references to musical sound quality [10].

MICROPHONE OUTPUT LEVELS

To get a rough idea of the voltage output from different types of microphones, an oscilloscope was paralleled across inputs of a console. During the normal popular-music type sessions, peak readings of 1 volt or more were common, especially from close-up microphones on voice and drums. Due to the linear voltage scale, oscilloscope measurements over more than a 10-dB range are difficult. By building a simple bipolar logarithmic amplifier, the useful measuring range was extended to about four decades (Fig. 1). Considerable studio observation finally led to the construction of a peak holding type decibel meter. This circuit retained transient peaks of more than 50 microseconds within 2-dB accuracy for about 10 seconds; long enough to write them town. Using the logarithmic oscilloscope display and the peak meter together proved very useful in gathering a wealth of data about real-life microphone signals.

Table 1 shows the normal peak outputs from several popular types of studio microphones. All the readings are taken with the microphone operating into the primary of an unloaded transformer. Pickup distances re indicated for each instrument and were determined by normal studio practice. Table II is an abridgment of a similar study done by Fine Recording. Inc., several years ago. Details of this test setup are not available but the readings are probably taken without the 6dB pad commonly used on the U-47 microphone today. Some calculations based on the manufacturer's published sensitivity for these microphones indicates that acoustic sound-pressure levels in excess of 130-dB are common. While the latest console preamplifiers have less noise, less distortion, and more knobs than ever before, they are not designed to handle this kind of input level. In most commercially available preamplifiers, head room runs on the order of +20-dBm (1.0-dBm is 1 mw into 600 ohms), and gain is commonly set at 40-dB. With these basic parameters it is clear from the data shown in Tables I and II that severe overloads can occur on peaks from almost all instruments. For example, a U-87 microphone gives a peak output of -1 dBm from a large floor tom. Amplification by 40 dB in the microphone preamplifier results in an output swing of +39 dBm, or almost 20 dB above the overload point. Logically a peak of this magnitude should be severely distorted.

Most recording consoles today have variable resistive pads on the microphone inputs to attenuate signal levels which are beyond the capabilities of the preamplifier. The common use of these input pads supposedly came about with the advent of loud rock music; however, this is not true in fact. For some 20 years it has been common to use a Neumann U-47 microphone for close microphone recording of brass and voice. Table II shows output levels requiring 10-20 dB of padding under these conditions, and this does agree with recording practice today where solid-state amplifiers are used. But most tube consoles did not have input pads and yet the same microphone performed with little noticeable distortion. Certainly brass players and singers are not that much louder today than they were yesterday. The microphone distance is about the same. The preamplifier specifications have not changed that much.

Yet transistors require pads and tubes do not.

Here then is the hypothesis for further investigation. In the usual evaluation of audio preamplifiers it is assumed that they are operated in their linear range, i.e., harmonic distortion less than 10%. In this range tubes and transistors do have very similar performance characteristics. But the preceding section points out that amplifiers are often operated far out of their linear range at signal levels which would cause severe distortion. Under these conditions, tubes and transistors appear to behave quite differently from a sound viewpoint.

DISTORTION CHARACTERISTICS OF PREAMPLIFIERS

Three commercially available microphone preamplifiers of different designs were set up in the recording studio. Each amplifier was adjusted for a gain of 40 dB and an overload point of 3% total harmonic distortion (THD) at + 18-dBm. Preamplifier 1 was a transistor design, preamplifier 2 was a hybrid operational amplifier, and preamplifier 3 was a vacuum-tube triode design. The amplifier outputs were terminated in 600 ohm loads and bridged by the monitoring system. The test signal, U-87 microphone. and large floor tom were switchable to each preamplifier input.

An informal group of studio personnel listened to the outputs of the three amplifiers on the normal control room monitor speakers. As the test signal was switched from one amplifier to another, the listeners were asked to judge the sound quality. The output of amplifiers 1 and 2 was unanimously judged to be severely distorted. Amplifier 3, however, sounded clean. The test was repeated several times inserting attenuating pads in the microphone line until each amplifier sounded undistorted. Amplifier 1 could stand overloads of 5-10 dB without noticeable distortion. Amplifier 2 showed noticeable distortion at about 5-dB overload. Further listening revealed that it was only in the range of early overload where the amplifiers differed appreciably in sound quality. Once the amplifiers were well into the distortion region, they all sounded alike-distorted. In their normal nonoverload range all three amplifiers sounded very clean.

The listening tests clearly indicate that the overload margin varies widely between different types of amplifiers. Engineering studies show that any amplifier adds distortion as soon as the overload point is reached. The tests show that all amplifiers could be overloaded to a certain degree without this distortion becoming noticeable. It may be concluded that these inaudible harmonics in the early overload condition might very well be causing the difference in sound coloration between tubes and transistors.

To get a general representation of the character of harmonic distortion in audio amplifiers, overload curves were plotted for about fifty different circuits. The tube circuits used the popular 12AY7 and 12AX7 triodes, the 8628 and 7586 triode nuvistors, and the 5879 pentode. These tubes have all been extensively used in recording console preamplifiers. The 2N3391A, 2N5089. and 2N3117 silicon NPN transistors were also chosen because of their extensive use in console and tape recorder circuitry. For comparison purposes tests were also run on the 2N5087 which is the PNP sister of the 2N5089. Operational amplifiers included the popular 709 and LM301 monolithic units and two commercially available hybrid designs used in recording consoles.

The curves shown in Fig. 2 are representative of the general distortion characteristics of single-stage class Audio amplifiers. The devices are al1 operating open loop (no feedback) with a bias point which allows for maximum undistorted output swing. The curves are referenced to a common point of 3% (THD), regardless of actual input or output levels. Since the objective of these comparisons is to detect variations in the slopes of the distortion characteristics, the X axis is a scale of relative level, independent of circuit impedance considerations. These particular curves were chosen from the many plotted as representative of different families: silicon transistors, triodes, and pentodes. A quick look shows that the often versed opinion that tubes overload more gently than transistors is obviously a myth.

Fig. 3 shows the distortion characteristics for four different commercially available preamplifiers, using two or more stages of amplification. All the circuits use feedback, a couple are push-pull. Each amplifier is operating into 600 ohms at a gain of 40 dB. As in the previous curves, there is a common reference point of 1% THD. While these curves how a marked difference from the single-stage amplifiers, a review of the many different amplifiers tested shows that the slopes of all THD curves run about the same. The lack of a wide variation between the curves indicates that THD plots are not very relevant to what the ear hears in the listening tests.

Another series of tests were made on the same group of preamplifiers Using a spectrum analyzer to measure the amplitude of individual harmonics. Each amplifier was driven 12-dB into overload, starting from a reference point of 1% third harmonic distortion. Every harmonic to the seventh was plotted. Since it is not possible to measure the relative phase of the harmonics on the spectrum analyzer, the overload waveforms were recorded for Fourier analysis on the digital computer. The resulting plots divided amplifiers into three distinct categories.

1. Tube Characteristics

Fig. 4 shows the distortion components for a typical two-stage 12AY7 amplifier. This particular design is quite representative of several single-ended, multistage triode tube amplifiers tested. The outstanding characteristic is the dominance of the second harmonic followed closely by the third. The fourth harmonic rises 3-4-dB later, running parallel to the third. The fifth, sixth, and seventh remain below 5% out to the 12-dB overload point. These curves seem to be a general characteristic of all the triode amplifiers tested whether octal, miniature, nuvistor, single-ended. or push-pull. Fig. 5 is the waveform at 12-dB of overload. The clipping is unsymmetrical with a shifted duty cycle. Again this is characteristic of all the triode amplifiers tested. Fig. 6 shows the distortion components for a two-stage single ended pentode amplifier. Here the third harmonic is dominant and the second rises about 3-dB later with the same slope. Both the fourth and the fifth are prominent while the sixth and seventh remain under 5%. The waveform at 12-dB overload (Fig. 7), is similar to the triode, but its duty cycle is not shifted as much. It is not reasonable to assume that virtually all tube amplifiers can be represented by these two examples. However, the major characteristic of the tube amplifier is the presence of strong second and third harmonics, sometimes in concert with the fourth and fifth, but always much greater in amplitude. Harmonics higher than the fifth are not significant until the overload is beyond 12 dB. These characteristics seem to hold true for wide variations in circuit design parameters. The extreme difference in the tube amplifiers is the interchanging of the position of the second and third harmonics. This effect is not just a characteristic of the pentode, it is common to triodes too.

2. Transistor Characteristics

Figs. 8 and 10 show the characteristics of two transistor amplifiers. Like the previous figures the curves are representative of all the transistor amplifiers tested. The distinguishing feature is the strong third harmonic component All other harmonics are present, but at a much lower amplitude than the third. When the overload reaches a break point, all the higher harmonics begin to rise simultaneously. This point is generally within 3-6 dB of the 1% third harmonic point. The waveforms of these amplifiers (Figs. 9 and 11) are distinctly square wave in form with symmetrical clipping and an almost perfect duty cycle. Both amplifiers shown have single ended inputs and push-pull outputs. However, the circuit designs are radically different.

3. Operational-amplifier Characteristics

Fig. 12 is a hybrid operational amplifier. The third harmonic rises steeply as the dominant distortion component in a characteristic similar to the transistor. Also rising very strongly from the same point are the fifth and seventh harmonics. All even harmonics are suppressed completely. The waveform of Fig. 13 is a perfect square wave. As a classification group, operational amplifiers have the most uniform characteristics with almost no deviation from the curves shown in this example.

In view of the transient nature of audio signals, steady state single-frequency distortion analysis could yield questionable results. Indeed, the arguments for and against sine-wave and pulse test signals for audio system testing have been the subject for a number of engineering papers [4] [7]. For our purposes, however, a few minutes toying with an electronic synthesizer quickly proved that musical instruments do not produce fast pulses. For example, a good simulation of the large floor tom used in the amplifier listening tests is a l00-Hz tone modulated with an envelope rise time of 5-ms and a decay time of 300-ms. Also an extensive study of trumpet tones [6] measured the rise time of the fastest staccato notes at 12-ms Certainly, rise times of these orders can not be considered pulses for audio amplifiers with passbands extending to 20-kHz or better. Just to further prove the correctness of the preceding steady-state results. the synthesized floor tom signal was used to test the same amplifiers at the same level as the microphone signal.

Careful observation of the amplified signal showed that envelope clipping was identical to the steady-state clipping level (Fig. 14). There were no glitches or other fast transient phenomena in the output signal.

SIGNIFICANCE OF MUSICAL HARMONICS

Having divided amplifiers into three groups of distortion characteristics, the next step is to determine how the harmonics relate to hearing. There is a close parallel here between electronic distortion and musical tone coloration that is the real key to why tubes and transistors sound different. Perhaps the most knowledgeable authorities in this area are the craftsmen who build organs and musical instruments [8] [9]. Through many years of careful experimentation these artisans have determined how various harmonics relate to the coloration of an instrument's tonal quality.

The primary color characteristic of an instrument is determined by the strength of the first few harmonics. Each of the lower harmonics produces its own characteristic effect when it is dominant or it can modify the effect of another dominant harmonic if it is prominent. In the simplest classification, the lower harmonics are divided into two tonal groups. The odd harmonics (third and fifth) produce a "stopped" or "covered" sound. The even harmonics (second, fourth, and sixth) produce "choral" or "singing" sounds.

The second and third harmonics are the most important from the viewpoint of the electronic distortion graphs in the previous section. Musically the second is an octave above the fundamental and is almost inaudible; yet it adds body to the sound, making it fuller. The third is termed a quint or musical twelfth. It produces a sound many musicians refer to as "blanketed." Instead of making the tone fuller, a strong third actually makes the tone softer. Adding a fifth to a strong third gives the sound a metallic quality that gets annoying in character as its amplitude increases. A strong second with a strong third tends to open the "covered" effect. Adding the fourth and the fifth to this changes the sound to an "open horn" like character.

The higher harmonics, above the seventh, give the tone "edge" or "bite." Provided the edge is balanced to the basic musical tone, it tends to reinforce the fundamental, giving the sound a sharp attack quality. Many of the edge harmonics are musically unrelated pitches such as the seventh, ninth, and eleventh. Therefore. too much edge can produce a raspy dissonant quality. Since the ear seems very sensitive to the edge harmonics, controlling their amplitude is of paramount importance. The previously mentioned study of the trumpet tone [6] shows that the edge effect is directly related to the loudness of the tone. Playing the same trumpet note loud or soft makes little difference in the amplitude of the fundamental and the lower harmonics. However. harmonics above the sixth increase and decrease in amplitude in a1most direct proportion to the loudness. This edge balance is a critically important loudness signal for the human ear.

RELATIONSHIP OF FACTORS AND FINDINGS

The basic cause of the difference in tube and transistor sound is the weighting of harmonic distortion components in the amplifier's overload region. Transistor amplifiers exhibit a strong component of third harmonic distortion when driven into overload. This harmonic produces a "covered" sound, giving the recording a restricted quality. Alternatively a tube amplifier when overloaded generates a whole spectrum of harmonics. Particularly strong are the second. third, fourth, and fifth overtones which give a full-bodied "brassy" quality to the sound. The further any amplifier is driven into saturation, the greater the amplitude of the higher harmonics like the seventh, eighth. ninth, etc. These add edge to the sound which the ear translates to loudness information. Overloading an operational amplifier produces such steeply rising edge harmonics that they become objectionable within a 5-dB range. Transistors extend this overload range to about 10-dB and tubes widen it 20-dB or more. Using this basic analysis, the psychoacoustic characteristics stated in the beginning of this paper can be related to the electrical harmonic properties of each type of amplifier.

It was not part of the original intent of this paper to analyze operational amplifiers. However, the tests show that they fall into a distinct class of their own. Basically, operational amplifiers produce strong third, fifth, and seventh harmonics when driven only a few dB into overload. The resultant sound is metallic with a very harsh edge which the ear hears as strong distortion. Since this sound is so objectionable, it acts as a clearly audible overload warning signal. Consequently. operational amplifiers are rarely operated in their saturated region. This results in a very cleanly amplified sound with little coloration and true dynamic range within the limitations of the amplifier. True dynamic range is not necessarily the determinant of good sound reproduction, however, since it is much greater than any disc or tape system presently available. Because of their characteristics operational amplifiers produce only the top end of the dynamic range which contains all the transients but lacks the solid pitch information which the ear hears as music. When records of true dynamic range are played on a limited-range system, they sound very thin. This relates directly to the originally cited listener's comment that transistor records were very clean but sounded sibilant and cymbally.

The transistor characteristics which our subjects noted were the buzzing or white-noise sound and the lack of "punch." The buzz is of course directly related to the edge produced by overloading on transients. The guess that this is white noise is due to the fact that many of the edge harmonics like the seventh and ninth are not musically related to the fundamental. The ear hears these dissonant tones as a kind of noise accompanying every attack. The lack of punch is due to the strong third harmonic which is inaudibly "blanketing" the sound. This is correctable by using a large enough pad to prevent all peaks from reaching the amplifier's saturated region. But from a practical standpoint, there is no way of determining this on most consoles. Adding auxiliary peak indicators on the input preamplifiers could alleviate both these problems. and the sound would be very close to that of the operational amplifier in its linear region.

Vacuum-tube amplifiers differ from transistor and operational amplifiers because they can be operated in the overload region without adding objectionable distortion. The combination of the slow rising edge and the open harmonic structure of the overload characteristics form an almost ideal sound-recording compressor. Within the 15-20-dB "safe" overload range, the electrical output of the tube amplifier increases by only 2-4 dB, acting like a limiter. However, since the edge is increasing within this range. the subjective loudness remains uncompressed to the ear. This effect causes tube-amplified signals to have a high apparent level which is not indicated on a volume indicator (VU meter). Tubes sound louder and have a better signal-to-noise ratio because of this extra subjective head room that transistor amplifiers do not have. Tubes get punch from their naturally brassy overload characteristics. Since the loud signals can be recorded at higher levels, the softer signals are also louder, so they are not lost in tape hiss and they effectively give the tube sound greater clarity. The feeling of more bass response is directly related to the strong second and third harmonic components which reinforce the "natural"' bass with "synthetic" bass [5]. In the context of a limited dynamic range system like the phonograph, recordings made with vacuum tube preamplifiers will have more apparent level and a greater signal to system noise ratio than recordings made with transistors or operational amplifiers.

ACKNOWLEDGMENT

The author wishes to thank Walter Sear and Peter Scheiber for innumerable helpful discussions on the musician's viewpoint of sound. He also wishes to thank John Olson of RCA and Steve Temmer of Gotham Audio for the loaning of amplifiers.

REFERENCES

[I] "Neumann Transistor Condenser Microphones," Gotham Audio Corp.. Sales Bull. 1971.

[2] A. D. Smith and P. H. Whittman, "Design Considerations of Low Noise Audio Input Circuitry for a Professional Microphone Mixer." J. Audio Eng. Soc.. vol. 1S. pp. 140-156 (Apr. 1970).

[3] A. Schaumberger, "The Application of Impulse Measurement Techniques to the Detection of Linear Distortion." J. Audio Eng. Soc.. vol. 19, W 664 668 (Sept 1971).

[4] M. Otala, "Circuit Design Modifications for Minimizing Transient Intermodulation Distortion in Audio Amplifiers." J. Audio Eng. Soc.. vol. 20. pp. 396-399 (June 1972).

[5] F. Langford-Smith, Radiotron Designer's Handbook (RCA. 1953), chap. 14.

[6] J. C. Risset, "Computer Study of Trumpet Tones." Bell Telephone Laboratories. File MM-66-122-2.

[7] J. R. Ashley, T. A. Saponas, and R. C. Matson, "Test Signals for Music Reproduction Systems." IEEE Spectrum, vol. 8, pp. 53-61 (July 1971).

[8] A. H. Benade, Horns, Strings and Harmony (Doubleday, New York, 1960).

[9] R. A. Schaefer, "New Techniques for Organ Tone Generation," J. Audio Eng. Soc. vol. 19. pp. 570-575 (July/Aug. 1971).

[10] R. Langevin, "lntermodulation Distortion in Tape Recording," J. Audio Eng. Soc., vol. 11, pp. 270-278 (July 1963).

THE AUTHOR

Milbert Amplifiers, Inc. -- Vacuum Tube Car Audio Equipment Since 1986 Box 1027, Germantown, Maryland 20874 USA -- 301-963-9355 -- 301-840-0511 fax email:mmilbert@access.digex.net -- http://www.access.digex.net/~mmilbert/