The chain rule may also be expressed in Leibniz's notation. If a variable z depends on the variable y, which itself depends on the variable x (that is, y and z are dependent variables), then z depends on x as well, via the intermediate variable y. In this case, the chain rule is expressed as
and
for indicating at which points the derivatives have to be evaluated.
Intuitively, the chain rule states that knowing the instantaneous rate of change of z relative to y and that of y relative to x allows one to calculate the instantaneous rate of change of z relative to x as the product of the two rates of change.
As put by George F. Simmons: "If a car travels twice as fast as a bicycle and the bicycle is four times as fast as a walking man, then the car travels 2 × 4 = 8 times as fast as the man."[1]
The relationship between this example and the chain rule is as follows. Let z, y and x be the (variable) positions of the car, the bicycle, and the walking man, respectively. The rate of change of relative positions of the car and the bicycle is Similarly, So, the rate of change of the relative positions of the car and the walking man is
The rate of change of positions is the ratio of the speeds, and the speed is the derivative of the position with respect to the time; that is,
or, equivalently,
which is also an application of the chain rule.
History
The chain rule seems to have first been used by Gottfried Wilhelm Leibniz. He used it to calculate the derivative of as the composite of the square root function and the function . He first mentioned it in a 1676 memoir (with a sign error in the calculation).[2] The common notation of the chain rule is due to Leibniz.[3]Guillaume de l'Hôpital used the chain rule implicitly in his Analyse des infiniment petits. The chain rule does not appear in any of Leonhard Euler's analysis books, even though they were written over a hundred years after Leibniz's discovery.[citation needed]. It is believed that the first "modern" version of the chain rule appears in Lagrange's 1797 Théorie des fonctions analytiques; it also appears in Cauchy's 1823 Résumé des Leçons données a L’École Royale Polytechnique sur Le Calcul Infinitesimal.[3]
Statement
The simplest form of the chain rule is for real-valued functions of one real variable. It states that if g is a function that is differentiable at a point c (i.e. the derivative g′(c) exists) and f is a function that is differentiable at g(c), then the composite function is differentiable at c, and the derivative is[4]
The rule is sometimes abbreviated as
If y = f(u) and u = g(x), then this abbreviated form is written in Leibniz notation as:
The points where the derivatives are evaluated may also be stated explicitly:
Carrying the same reasoning further, given n functions with the composite function , if each function is differentiable at its immediate input, then the composite function is also differentiable by the repeated application of Chain Rule, where the derivative is (in Leibniz's notation):
Applications
Composites of more than two functions
The chain rule can be applied to composites of more than two functions. To take the derivative of a composite of more than two functions, notice that the composite of f, g, and h (in that order) is the composite of f with g ∘ h. The chain rule states that to compute the derivative of f ∘ g ∘ h, it is sufficient to compute the derivative of f and the derivative of g ∘ h. The derivative of f can be calculated directly, and the derivative of g ∘ h can be calculated by applying the chain rule again.[citation needed]
For concreteness, consider the function
This can be decomposed as the composite of three functions:
So that .
Their derivatives are:
The chain rule states that the derivative of their composite at the point x = a is:
In Leibniz's notation, this is:
or for short,
The derivative function is therefore:
Another way of computing this derivative is to view the composite function f ∘ g ∘ h as the composite of f ∘ g and h. Applying the chain rule in this manner would yield:
This is the same as what was computed above. This should be expected because (f ∘ g) ∘ h = f ∘ (g ∘ h).
Sometimes, it is necessary to differentiate an arbitrarily long composition of the form . In this case, define
where and when . Then the chain rule takes the form
or, in the Lagrange notation,
The chain rule can be used to derive some well-known differentiation rules. For example, the quotient rule is a consequence of the chain rule and the product rule. To see this, write the function f(x)/g(x) as the product f(x) · 1/g(x). First apply the product rule:
To compute the derivative of 1/g(x), notice that it is the composite of g with the reciprocal function, that is, the function that sends x to 1/x. The derivative of the reciprocal function is . By applying the chain rule, the last expression becomes:
which is the usual formula for the quotient rule.
Suppose that y = g(x) has an inverse function. Call its inverse function f so that we have x = f(y). There is a formula for the derivative of f in terms of the derivative of g. To see this, note that f and g satisfy the formula
And because the functions and x are equal, their derivatives must be equal. The derivative of x is the constant function with value 1, and the derivative of is determined by the chain rule. Therefore, we have that:
To express f' as a function of an independent variable y, we substitute for x wherever it appears. Then we can solve for f'.
For example, consider the function g(x) = ex. It has an inverse f(y) = ln y. Because g′(x) = ex, the above formula says that
This formula is true whenever g is differentiable and its inverse f is also differentiable. This formula can fail when one of these conditions is not true. For example, consider g(x) = x3. Its inverse is f(y) = y1/3, which is not differentiable at zero. If we attempt to use the above formula to compute the derivative of f at zero, then we must evaluate 1/g′(f(0)). Since f(0) = 0 and g′(0) = 0, we must evaluate 1/0, which is undefined. Therefore, the formula fails in this case. This is not surprising because f is not differentiable at zero.
Faà di Bruno's formula generalizes the chain rule to higher derivatives. Assuming that y = f(u) and u = g(x), then the first few derivatives are:
Proofs
First proof
One proof of the chain rule begins by defining the derivative of the composite function f ∘ g, where we take the limit of the difference quotient for f ∘ g as x approaches a:
Assume for the moment that does not equal for any near . Then the previous expression is equal to the product of two factors:
If oscillates near a, then it might happen that no matter how close one gets to a, there is always an even closer x such that g(x) = g(a). For example, this happens near a = 0 for the continuous functiong defined by g(x) = 0 for x = 0 and g(x) = x2 sin(1/x) otherwise. Whenever this happens, the above expression is undefined because it involves division by zero. To work around this, introduce a function as follows:
We will show that the difference quotient for f ∘ g is always equal to:
Whenever g(x) is not equal to g(a), this is clear because the factors of g(x) − g(a) cancel. When g(x) equals g(a), then the difference quotient for f ∘ g is zero because f(g(x)) equals f(g(a)), and the above product is zero because it equals f′(g(a)) times zero. So the above product is always equal to the difference quotient, and to show that the derivative of f ∘ g at a exists and to determine its value, we need only show that the limit as x goes to a of the above product exists and determine its value.
To do this, recall that the limit of a product exists if the limits of its factors exist. When this happens, the limit of the product of these two factors will equal the product of the limits of the factors. The two factors are Q(g(x)) and (g(x) − g(a)) / (x − a). The latter is the difference quotient for g at a, and because g is differentiable at a by assumption, its limit as x tends to a exists and equals g′(a).
As for Q(g(x)), notice that Q is defined wherever f is. Furthermore, f is differentiable at g(a) by assumption, so Q is continuous at g(a), by definition of the derivative. The function g is continuous at a because it is differentiable at a, and therefore Q ∘ g is continuous at a. So its limit as x goes to a exists and equals Q(g(a)), which is f′(g(a)).
This shows that the limits of both factors exist and that they equal f′(g(a)) and g′(a), respectively. Therefore, the derivative of f ∘ g at a exists and equals f′(g(a))g′(a).
Second proof
Another way of proving the chain rule is to measure the error in the linear approximation determined by the derivative. This proof has the advantage that it generalizes to several variables. It relies on the following equivalent definition of differentiability at a point: A function g is differentiable at a if there exists a real number g′(a) and a function ε(h) that tends to zero as h tends to zero, and furthermore
Here the left-hand side represents the true difference between the value of g at a and at a + h, whereas the right-hand side represents the approximation determined by the derivative plus an error term.
In the situation of the chain rule, such a function ε exists because g is assumed to be differentiable at a. Again by assumption, a similar function also exists for f at g(a). Calling this function η, we have
The above definition imposes no constraints on η(0), even though it is assumed that η(k) tends to zero as k tends to zero. If we set η(0) = 0, then η is continuous at 0.
Proving the theorem requires studying the difference f(g(a + h)) − f(g(a)) as h tends to zero. The first step is to substitute for g(a + h) using the definition of differentiability of g at a:
The next step is to use the definition of differentiability of f at g(a). This requires a term of the form f(g(a) + k) for some k. In the above equation, the correct k varies with h. Set kh = g′(a) h + ε(h) h and the right hand side becomes f(g(a) + kh) − f(g(a)). Applying the definition of the derivative gives:
To study the behavior of this expression as h tends to zero, expand kh. After regrouping the terms, the right-hand side becomes:
Because ε(h) and η(kh) tend to zero as h tends to zero, the first two bracketed terms tend to zero as h tends to zero. Applying the same theorem on products of limits as in the first proof, the third bracketed term also tends zero. Because the above expression is equal to the difference f(g(a + h)) − f(g(a)), by the definition of the derivative f ∘ g is differentiable at a and its derivative is f′(g(a)) g′(a).
The role of Q in the first proof is played by η in this proof. They are related by the equation:
The need to define Q at g(a) is analogous to the need to define η at zero.
Third proof
Constantin Carathéodory's alternative definition of the differentiability of a function can be used to give an elegant proof of the chain rule.[6]
Under this definition, a function f is differentiable at a point a if and only if there is a function q, continuous at a and such that f(x) − f(a) = q(x)(x − a). There is at most one such function, and if f is differentiable at a then f ′(a) = q(a).
Given the assumptions of the chain rule and the fact that differentiable functions and compositions of continuous functions are continuous, we have that there exist functions q, continuous at g(a), and r, continuous at a, and such that,
and
Therefore,
but the function given by h(x) = q(g(x))r(x) is continuous at a, and we get, for this a
A similar approach works for continuously differentiable (vector-)functions of many variables. This method of factoring also allows a unified approach to stronger forms of differentiability, when the derivative is required to be Lipschitz continuous, Hölder continuous, etc. Differentiation itself can be viewed as the polynomial remainder theorem (the little Bézout theorem, or factor theorem), generalized to an appropriate class of functions.[citation needed]
If and then choosing infinitesimal we compute the corresponding and then the corresponding , so that
and applying the standard part we obtain
which is the chain rule.
Multivariable case
The full generalization of the chain rule to multi-variable functions (such as ) is rather technical. However, it is simpler to write in the case of functions of the form
where , and for each
As this case occurs often in the study of functions of a single variable, it is worth describing it separately.
Case of scalar-valued functions with multiple inputs
Let , and for each
To write the chain rule for the composition of functions
one needs the partial derivatives of f with respect to its k arguments. The usual notations for partial derivatives involve names for the arguments of the function. As these arguments are not named in the above formula, it is simpler and clearer to use D-Notation, and to denote by
the partial derivative of f with respect to its ith argument, and by
the value of this derivative at z.
With this notation, the chain rule is
Example: arithmetic operations
If the function f is addition, that is, if
then and . Thus, the chain rule gives
For multiplication
the partials are and . Thus,
The case of exponentiation
is slightly more complicated, as
and, as
It follows that
General rule: Vector-valued functions with multiple inputs
The simplest way for writing the chain rule in the general case is to use the total derivative, which is a linear transformation that captures all directional derivatives in a single formula. Consider differentiable functions f : Rm → Rk and g : Rn → Rm, and a point a in Rn. Let Dag denote the total derivative of g at a and Dg(a)f denote the total derivative of f at g(a). These two derivatives are linear transformations Rn → Rm and Rm → Rk, respectively, so they can be composed. The chain rule for total derivatives is that their composite is the total derivative of f ∘ g at a:
or for short,
The higher-dimensional chain rule can be proved using a technique similar to the second proof given above.[7]
Because the total derivative is a linear transformation, the functions appearing in the formula can be rewritten as matrices. The matrix corresponding to a total derivative is called a Jacobian matrix, and the composite of two derivatives corresponds to the product of their Jacobian matrices. From this perspective the chain rule therefore says:
or for short,
That is, the Jacobian of a composite function is the product of the Jacobians of the composed functions (evaluated at the appropriate points).
The higher-dimensional chain rule is a generalization of the one-dimensional chain rule. If k, m, and n are 1, so that f : R → R and g : R → R, then the Jacobian matrices of f and g are 1 × 1. Specifically, they are:
The Jacobian of f ∘ g is the product of these 1 × 1 matrices, so it is f′(g(a))⋅g′(a), as expected from the one-dimensional chain rule. In the language of linear transformations, Da(g) is the function which scales a vector by a factor of g′(a) and Dg(a)(f) is the function which scales a vector by a factor of f′(g(a)). The chain rule says that the composite of these two linear transformations is the linear transformation Da(f ∘ g), and therefore it is the function that scales a vector by f′(g(a))⋅g′(a).
Another way of writing the chain rule is used when f and g are expressed in terms of their components as y = f(u) = (f1(u), …, fk(u)) and u = g(x) = (g1(x), …, gm(x)). In this case, the above rule for Jacobian matrices is usually written as:
The chain rule for total derivatives implies a chain rule for partial derivatives. Recall that when the total derivative exists, the partial derivative in the i-th coordinate direction is found by multiplying the Jacobian matrix by the i-th basis vector. By doing this to the formula above, we find:
Since the entries of the Jacobian matrix are partial derivatives, we may simplify the above formula to get:
More conceptually, this rule expresses the fact that a change in the xi direction may change all of g1 through gm, and any of these changes may affect f.
In the special case where k = 1, so that f is a real-valued function, then this formula simplifies even further:
This can be rewritten as a dot product. Recalling that u = (g1, …, gm), the partial derivative ∂u / ∂xi is also a vector, and the chain rule says that:
Example
Given u(x, y) = x2 + 2y where x(r, t) = r sin(t) and y(r,t) = sin2(t), determine the value of ∂u / ∂r and ∂u / ∂t using the chain rule.[citation needed]
and
Faà di Bruno's formula for higher-order derivatives of single-variable functions generalizes to the multivariable case. If y = f(u) is a function of u = g(x) as above, then the second derivative of f ∘ g is:
Further generalizations
All extensions of calculus have a chain rule. In most of these, the formula remains the same, though the meaning of that formula may be vastly different.
One generalization is to manifolds. In this situation, the chain rule represents the fact that the derivative of f ∘ g is the composite of the derivative of f and the derivative of g. This theorem is an immediate consequence of the higher dimensional chain rule given above, and it has exactly the same formula.
In differential algebra, the derivative is interpreted as a morphism of modules of Kähler differentials. A ring homomorphism of commutative ringsf : R → S determines a morphism of Kähler differentials Df : ΩR → ΩS which sends an element dr to d(f(r)), the exterior differential of f(r). The formula D(f ∘ g) = Df ∘ Dg holds in this context as well.
The common feature of these examples is that they are expressions of the idea that the derivative is part of a functor. A functor is an operation on spaces and functions between them. It associates to each space a new space and to each function between two spaces a new function between the corresponding new spaces. In each of the above cases, the functor sends each space to its tangent bundle and it sends each function to its derivative. For example, in the manifold case, the derivative sends a Cr-manifold to a Cr−1-manifold (its tangent bundle) and a Cr-function to its total derivative. There is one requirement for this to be a functor, namely that the derivative of a composite must be the composite of the derivatives. This is exactly the formula D(f ∘ g) = Df ∘ Dg.
There are also chain rules in stochastic calculus. One of these, Itō's lemma, expresses the composite of an Itō process (or more generally a semimartingale) dXt with a twice-differentiable function f. In Itō's lemma, the derivative of the composite function depends not only on dXt and the derivative of f but also on the second derivative of f. The dependence on the second derivative is a consequence of the non-zero quadratic variation of the stochastic process, which broadly speaking means that the process can move up and down in a very rough way. This variant of the chain rule is not an example of a functor because the two functions being composed are of different types.
See also
Automatic differentiation – Numerical calculations carrying along derivatives − a computational method that makes heavy use of the chain rule to compute exact numerical derivatives.
Artikel ini sebatang kara, artinya tidak ada artikel lain yang memiliki pranala balik ke halaman ini.Bantulah menambah pranala ke artikel ini dari artikel yang berhubungan atau coba peralatan pencari pranala.Tag ini diberikan pada Agustus 2020. Baroness Fern Andra von WeichsLahirVernal Edna Andrews(1893-11-24)24 November 1893Watseka, Illinois, A.S.Meninggal8 Februari 1974(1974-02-08) (umur 80)Aiken, South Carolina, A.S.PekerjaanAktris, penulis, sutradara filmTahun aktif1913–1930Su...
Компьютерный шрифт — файл, содержащий в себе описание набора буквенных, цифровых, служебных и псевдографических символов, используемый для отображения этих символов (в частности текста) программой или операционной системой. Содержание 1 Понятие 2 Типы шрифтов 2.1 По вн...
American physician and Seventh-day Adventist missionary Harry Willis MillerBornJuly 1, 1879Ludlow Falls, OhioDiedJanuary 1, 1977Riverside, CaliforniaOccupation(s)Physician, Seventh-day Adventist missionary Harry Willis Miller (July 1, 1879 – January 1, 1977) was an American physician, thyroid surgeon and Seventh-day Adventist missionary. Miller was a vegetarian and pioneer in the development of soy milk.[1] Biography Miller was born in Ludlow Falls, Ohio on July 1, 1879.[2] ...
Questa voce sugli argomenti stadi e sport in Slovenia è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Arena Stožice Informazioni generaliStato Slovenia UbicazioneLubiana Inizio lavori10 agosto 2010 Inaugurazione2010 ProprietarioComune di Lubiana Informazioni tecnichePosti a sedere12480 StrutturaArena CoperturaSì Mat. del terrenoParquet Uso e beneficiariPallacanestro Union Olimpija Slovenia ...
Major League Baseball team season 1985 Kansas City RoyalsWorld Series ChampionsAmerican League ChampionsAmerican League West ChampionsLeagueAmerican LeagueDivisionWestBallparkRoyals StadiumCityKansas City, MissouriOwnersEwing KauffmanGeneral managersJohn SchuerholzManagersDick HowserTelevisionWDAF-TV 4(Denny Matthews, Denny Trease, Fred White)Sports Time(Phil Stone, Dwayne Mosley)RadioWIBW–AM 580KCMO–AM 810(Denny Matthews, Fred White) ← 1984 Seasons 1986 → The ...
This article uses bare URLs, which are uninformative and vulnerable to link rot. Please consider converting them to full citations to ensure the article remains verifiable and maintains a consistent citation style. Several templates and tools are available to assist in formatting, such as reFill (documentation) and Citation bot (documentation). (September 2022) (Learn how and when to remove this message) District in Negeri Sembilan, MalaysiaSeremban DistrictDistrictDaerah SerembanOther t...
American visual artist (1933–2023) William Anastasi (August 11, 1933 – November 27, 2023) was an American visual artist working in a wide range of media including drawing, painting, sculpture, photographic works, and text.[1] He lived and worked in New York City from the early 1960s and was known as one of the most underrated conceptual artists of his generation.[2] Biography William Anastasi was born in Philadelphia, Pennsylvania, on August 11, 1933.[1] His first...
Month of 1913 1913 January February March April May June July August September October November December << April 1913 >> Su Mo Tu We Th Fr Sa 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 April 25, 1913: Mary Phagan, 13-year old pencil factory employee, murdered in Atlanta April 24, 1913: The Woolworth Building, tallest in the world until 1930, opens to the public April 29, 1913: Jewish factory superintendent Leo Frank arrested a...
Argentine footballer Julio Olarticoechea Olarticoechea during the 2016 Summer OlympicsPersonal informationFull name Julio Jorge OlarticoecheaDate of birth (1958-10-18) 18 October 1958 (age 65)Place of birth Saladillo, Buenos Aires, ArgentinaHeight 1.70 m (5 ft 7 in)Position(s) DefenderSenior career*Years Team Apps (Gls)1975–1981 Racing Club 230 (13)1981–1984 River Plate 106 (3)1985–1986 Boca Juniors 44 (4)1987 Nantes 27 (3)1987–1988 Argentinos Juniors 25 (2)1988–...
Louis-Marie Caverot Louis-Marie Caverot. Biographie Nom de naissance Louis Joseph Eusèbe Caverot Naissance 26 mars 1806Joinville (France) Ordination sacerdotale 19 mars 1831 Décès 23 janvier 1887 (à 80 ans)Lyon 5e (France) Cardinal de l'Église catholique Créécardinal 12 mars 1877 par lepape Pie IX Titre cardinalice Cardinal-prêtre de Saint-Sylvestre in Capitepuis de la Trinité-des-Monts Évêque de l'Église catholique Ordination épiscopale 22 juillet 1849 par card. Césa...
بلدة يوينغ الإحداثيات 46°03′26″N 87°18′26″W / 46.057222222222°N 87.307222222222°W / 46.057222222222; -87.307222222222 [1] تقسيم إداري البلد الولايات المتحدة[2] التقسيم الأعلى مقاطعة ماركويت خصائص جغرافية المساحة 48.8 ميل مربع ارتفاع 312 متر عدد السكان عدد السكان 150 ...
Public journal of the government of the Republic of the Philippines Official GazetteFounded1902CountryPhilippinesLanguageFilipino, EnglishSpanish (formerly)Websiteofficialgazette.gov.ph The Official Gazette, which is printed by the National Printing Office (NPO), is the public journal and main publication of the government of the Philippines. Its website only uploads what has been published; it is managed by Presidential Communications Office (PCO).[a] History During the Spanish colon...
PT Bank Capital Indonesia TbkJenisPublikKode emitenIDX: BACAIndustrijasa keuanganDidirikanJakarta, Indonesia (1989)KantorpusatMenara Jamsostek, Jakarta, IndonesiaTokohkunciWahyu Dwi Aji (Direktur Utama)ProdukPerbankanPendapatanRp 367 miliar (2017), Rp 430 miliar (2018) Laba bersihRp 86 miliar (2017), Rp 106 miliar (2018)Karyawan814 orang (2018)Situs webwww.bankcapital.co.id Bank Capital Indonesia atau lebih dikenal sebagai Bank Capital adalah sebuah bank yang berdiri sejak 1989 dan berkantor ...
جامعة فلسطين التقنية (خضوري) شعار جامعة فلسطين التقنية – خضوري الأسماء السابقة معهد خضوري الزراعي (1930-1961)كلية الحسين الزراعية (1961-1965)معهد الحسين (1965-1968)معهد طولكرم (1968-1982)كلية مجتمع طولكرم (1982-1993)كلية بوليتكنك (1993-1994)كلية فلسطين التقنية طولكرم (1994-2007) معلومات التأسيس 1930 (منذ 94 ...
You can help expand this article with text translated from the corresponding article in Spanish. (August 2017) Click [show] for important translation instructions. View a machine-translated version of the Spanish article. Machine translation, like DeepL or Google Translate, is a useful starting point for translations, but translators must revise errors as necessary and confirm that the translation is accurate, rather than simply copy-pasting machine-translated text into the English Wikip...
TyddewiSaint David’s Ciudad Bandera TyddewiLocalización de Tyddewi en PembrokeshireCoordenadas 51°52′56″N 5°16′07″O / 51.882222222222, -5.2686111111111Entidad Ciudad • País Reino Unido • Nación Gales Gales • Condado preservado Dyfed • Condado PembrokeshireEventos históricos • Fundación siglo VI (antes 589)Superficie • Total 17,93 mi² (46,44 km²) Población (2011) ...
American composer, concert pianist and conductor Joan TowerTower (circa 2003)Born (1938-09-06) September 6, 1938 (age 85)New Rochelle, New York, U.S.Alma mater Bennington College Columbia University (PhD) Joan Tower (born September 6, 1938)[1][2] is a Grammy-winning contemporary American composer, concert pianist and conductor. Lauded by The New Yorker as one of the most successful woman composers of all time, her bold and energetic compositions have been performed i...