How to find Frechet derivative of $f(x)=|Ax-b|^2$ at any $x^*$?












1












$begingroup$


Given a real $m times n$ matrix $A$ and $b in mathbb{R}^m$, let $f(x)=|Ax-b|^2$ for any $x in mathbb{R}^n$.



Find Frechet derivative of $f(x)=|Ax-b|^2$ at any $x^*$?



Actually I am wondering how to use the following to find the Frechet derivative, i.e., $J$:



$$lim_{h rightarrow 0} frac{|f(x+h)-f(x)-Jh|}{|h|} =0$$



Another question is what the difference between what we would get from Frechet derivative and the gradient $nabla f(x)=2A^T(Ax-b)$?



Please explain your reasons in detail, especially, what is the difference between gradient of $f$ and Frechet derivative. Also, explain when they might be identical.










share|cite|improve this question











$endgroup$












  • $begingroup$
    Write $f$ as a composition of an affine function and a bilinear function.
    $endgroup$
    – Will M.
    Dec 10 '18 at 4:05
















1












$begingroup$


Given a real $m times n$ matrix $A$ and $b in mathbb{R}^m$, let $f(x)=|Ax-b|^2$ for any $x in mathbb{R}^n$.



Find Frechet derivative of $f(x)=|Ax-b|^2$ at any $x^*$?



Actually I am wondering how to use the following to find the Frechet derivative, i.e., $J$:



$$lim_{h rightarrow 0} frac{|f(x+h)-f(x)-Jh|}{|h|} =0$$



Another question is what the difference between what we would get from Frechet derivative and the gradient $nabla f(x)=2A^T(Ax-b)$?



Please explain your reasons in detail, especially, what is the difference between gradient of $f$ and Frechet derivative. Also, explain when they might be identical.










share|cite|improve this question











$endgroup$












  • $begingroup$
    Write $f$ as a composition of an affine function and a bilinear function.
    $endgroup$
    – Will M.
    Dec 10 '18 at 4:05














1












1








1





$begingroup$


Given a real $m times n$ matrix $A$ and $b in mathbb{R}^m$, let $f(x)=|Ax-b|^2$ for any $x in mathbb{R}^n$.



Find Frechet derivative of $f(x)=|Ax-b|^2$ at any $x^*$?



Actually I am wondering how to use the following to find the Frechet derivative, i.e., $J$:



$$lim_{h rightarrow 0} frac{|f(x+h)-f(x)-Jh|}{|h|} =0$$



Another question is what the difference between what we would get from Frechet derivative and the gradient $nabla f(x)=2A^T(Ax-b)$?



Please explain your reasons in detail, especially, what is the difference between gradient of $f$ and Frechet derivative. Also, explain when they might be identical.










share|cite|improve this question











$endgroup$




Given a real $m times n$ matrix $A$ and $b in mathbb{R}^m$, let $f(x)=|Ax-b|^2$ for any $x in mathbb{R}^n$.



Find Frechet derivative of $f(x)=|Ax-b|^2$ at any $x^*$?



Actually I am wondering how to use the following to find the Frechet derivative, i.e., $J$:



$$lim_{h rightarrow 0} frac{|f(x+h)-f(x)-Jh|}{|h|} =0$$



Another question is what the difference between what we would get from Frechet derivative and the gradient $nabla f(x)=2A^T(Ax-b)$?



Please explain your reasons in detail, especially, what is the difference between gradient of $f$ and Frechet derivative. Also, explain when they might be identical.







derivatives frechet-derivative






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Dec 9 '18 at 21:02









Saeed

969310




969310










asked Dec 9 '18 at 19:37









SepideSepide

2788




2788












  • $begingroup$
    Write $f$ as a composition of an affine function and a bilinear function.
    $endgroup$
    – Will M.
    Dec 10 '18 at 4:05


















  • $begingroup$
    Write $f$ as a composition of an affine function and a bilinear function.
    $endgroup$
    – Will M.
    Dec 10 '18 at 4:05
















$begingroup$
Write $f$ as a composition of an affine function and a bilinear function.
$endgroup$
– Will M.
Dec 10 '18 at 4:05




$begingroup$
Write $f$ as a composition of an affine function and a bilinear function.
$endgroup$
– Will M.
Dec 10 '18 at 4:05










1 Answer
1






active

oldest

votes


















0












$begingroup$

If $mathrm{H}$ is a Hilbert space, then every continuous linear function $u:mathrm{H} to mathbf{R}$ can be represented by means of scalar product with respect to a unique vector, here denoted as $x_u:$



$$u(y) = (y mid x_u).$$



Hence, if $f:mathrm{H} to mathbf{R}$ is a differentiable function, then its derivative $u = f'(a)$ at $a$ is a continuous linear function. The vector $x_u$ is denoted $nabla f(a)$ in this case. And we have the fundamental relation:



$$f'(a) cdot h = (nabla f(a) mid h).$$



In regards to your particular $f,$ we can write $f(x) = (Ax -b mid Ax - b)$ and by the products and chain rules,
$$f'(x) cdot h = (Ax - b mid Ah) + (Ah mid Ax - b) = 2(Ah mid Ax - b).$$



If $mathrm{H} = mathbf{R}^d,$ and we are dealing with the standard Euclidean inner product, we can write further $f'(x) cdot h = (2A^intercal (Ax - b) mid h),$ this signifies $nabla f(x) = 2A^intercal (Ax - b).$ Q.E.D.






share|cite|improve this answer









$endgroup$













    Your Answer





    StackExchange.ifUsing("editor", function () {
    return StackExchange.using("mathjaxEditing", function () {
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    });
    });
    }, "mathjax-editing");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "69"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3032857%2fhow-to-find-frechet-derivative-of-fx-ax-b-2-at-any-x%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0












    $begingroup$

    If $mathrm{H}$ is a Hilbert space, then every continuous linear function $u:mathrm{H} to mathbf{R}$ can be represented by means of scalar product with respect to a unique vector, here denoted as $x_u:$



    $$u(y) = (y mid x_u).$$



    Hence, if $f:mathrm{H} to mathbf{R}$ is a differentiable function, then its derivative $u = f'(a)$ at $a$ is a continuous linear function. The vector $x_u$ is denoted $nabla f(a)$ in this case. And we have the fundamental relation:



    $$f'(a) cdot h = (nabla f(a) mid h).$$



    In regards to your particular $f,$ we can write $f(x) = (Ax -b mid Ax - b)$ and by the products and chain rules,
    $$f'(x) cdot h = (Ax - b mid Ah) + (Ah mid Ax - b) = 2(Ah mid Ax - b).$$



    If $mathrm{H} = mathbf{R}^d,$ and we are dealing with the standard Euclidean inner product, we can write further $f'(x) cdot h = (2A^intercal (Ax - b) mid h),$ this signifies $nabla f(x) = 2A^intercal (Ax - b).$ Q.E.D.






    share|cite|improve this answer









    $endgroup$


















      0












      $begingroup$

      If $mathrm{H}$ is a Hilbert space, then every continuous linear function $u:mathrm{H} to mathbf{R}$ can be represented by means of scalar product with respect to a unique vector, here denoted as $x_u:$



      $$u(y) = (y mid x_u).$$



      Hence, if $f:mathrm{H} to mathbf{R}$ is a differentiable function, then its derivative $u = f'(a)$ at $a$ is a continuous linear function. The vector $x_u$ is denoted $nabla f(a)$ in this case. And we have the fundamental relation:



      $$f'(a) cdot h = (nabla f(a) mid h).$$



      In regards to your particular $f,$ we can write $f(x) = (Ax -b mid Ax - b)$ and by the products and chain rules,
      $$f'(x) cdot h = (Ax - b mid Ah) + (Ah mid Ax - b) = 2(Ah mid Ax - b).$$



      If $mathrm{H} = mathbf{R}^d,$ and we are dealing with the standard Euclidean inner product, we can write further $f'(x) cdot h = (2A^intercal (Ax - b) mid h),$ this signifies $nabla f(x) = 2A^intercal (Ax - b).$ Q.E.D.






      share|cite|improve this answer









      $endgroup$
















        0












        0








        0





        $begingroup$

        If $mathrm{H}$ is a Hilbert space, then every continuous linear function $u:mathrm{H} to mathbf{R}$ can be represented by means of scalar product with respect to a unique vector, here denoted as $x_u:$



        $$u(y) = (y mid x_u).$$



        Hence, if $f:mathrm{H} to mathbf{R}$ is a differentiable function, then its derivative $u = f'(a)$ at $a$ is a continuous linear function. The vector $x_u$ is denoted $nabla f(a)$ in this case. And we have the fundamental relation:



        $$f'(a) cdot h = (nabla f(a) mid h).$$



        In regards to your particular $f,$ we can write $f(x) = (Ax -b mid Ax - b)$ and by the products and chain rules,
        $$f'(x) cdot h = (Ax - b mid Ah) + (Ah mid Ax - b) = 2(Ah mid Ax - b).$$



        If $mathrm{H} = mathbf{R}^d,$ and we are dealing with the standard Euclidean inner product, we can write further $f'(x) cdot h = (2A^intercal (Ax - b) mid h),$ this signifies $nabla f(x) = 2A^intercal (Ax - b).$ Q.E.D.






        share|cite|improve this answer









        $endgroup$



        If $mathrm{H}$ is a Hilbert space, then every continuous linear function $u:mathrm{H} to mathbf{R}$ can be represented by means of scalar product with respect to a unique vector, here denoted as $x_u:$



        $$u(y) = (y mid x_u).$$



        Hence, if $f:mathrm{H} to mathbf{R}$ is a differentiable function, then its derivative $u = f'(a)$ at $a$ is a continuous linear function. The vector $x_u$ is denoted $nabla f(a)$ in this case. And we have the fundamental relation:



        $$f'(a) cdot h = (nabla f(a) mid h).$$



        In regards to your particular $f,$ we can write $f(x) = (Ax -b mid Ax - b)$ and by the products and chain rules,
        $$f'(x) cdot h = (Ax - b mid Ah) + (Ah mid Ax - b) = 2(Ah mid Ax - b).$$



        If $mathrm{H} = mathbf{R}^d,$ and we are dealing with the standard Euclidean inner product, we can write further $f'(x) cdot h = (2A^intercal (Ax - b) mid h),$ this signifies $nabla f(x) = 2A^intercal (Ax - b).$ Q.E.D.







        share|cite|improve this answer












        share|cite|improve this answer



        share|cite|improve this answer










        answered Dec 11 '18 at 5:13









        Will M.Will M.

        2,615315




        2,615315






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Mathematics Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3032857%2fhow-to-find-frechet-derivative-of-fx-ax-b-2-at-any-x%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Quarter-circle Tiles

            build a pushdown automaton that recognizes the reverse language of a given pushdown automaton?

            Mont Emei