About Lessons Handouts Practice AP Questions Contact M&EMs |
Lesson 2.4: The Chain Rule (by Carlo Angiuli)
We already know how to differentiate many functions, but there are some which still escape our grasp: among them are composite functions. Composite functions are those formed by plugging a function into another function. Let’s look at a simple example: . The graph of is shown. We can logically deduce the derivative by comparing to , whose derivative you will recall is . Well, is the same as , except that it is compressed horizontally by a factor of 3. Notice that at we are at the bottom of the wave, not the top, as you would be in at . This makes sense, of course, since you’re multiplying by 3 before plugging it into the sine function. So places at on correspond to places at on , so we have to plug into the derivative, too, so it corresponds to the right location. Therefore our notion so far of the derivative of is . But we’re not done yet. Since is compressed by a factor of three, notice that it’s three times as steep at every point, since it’s going through the same sequence of points three times as quickly. Or, if you’d prefer, is the same for a that’s a third as large (because will go through the same points in that span of values), and . So since it’s three times as steep, we have to multiply its derivative by three. Now we think the derivative of is . This happens to be exactly correct. Now, let’s generalize and figure out the derivative of , which you will sometimes see written as . (Note that in the previous example, and .) Think of as another variable; let’s call it . Therefore we’re looking at the function . behaves similarly to , except that the value of doesn’t determine where you are on the graph; rather, the value of does. Since corresponds to values on , likewise corresponds to the correct place on . The difference is that, since you’re still modifying directly rather than , might be getting compressed or expanded at certain points, depending on how acts at that point. ( is linear, so it provides a consistent compression for the entire graph.) The amount by which is getting compressed or expanded at a given is proportional to how is changing at that point—in other words, it is proportional to . If , then is being compressed, because is going through values faster than is, just as is going through values faster than is. That means that the derivative of is larger than the derivative of . By a factor of , to be precise, as that is the ratio of how much faster goes through values than does. Likewise, if , then is being expanded since is going through values more slowly than is, so the , again by a factor of . And if is negative, then is going backwards as compared to , so we have to take the opposite of the derivative. In all of these cases, differs from by a factor of , which is the same as . Therefore, just as in the first example, the derivative of is . Or, if you prefer, we can write . This latter equation for what we will now call the Chain Rule has the advantage of providing a convenient (albeit somewhat incorrect) mnemonic that you can simply “cancel” . Some of you may be interested in a rigorous proof, but that is really a topic more suited to the study of analysis than that of calculus. |