What does phi mean in this context?












1












$begingroup$


In an article on Selection Bias in A/B Testing, AirBnB proposes a solution to their own estimated bias. This solution is to subtract from the aggregated effect a bias estimate, captured by this equation:
$$
hat{beta} = sum_{i=1}^n W_i phi left( frac{W_i b_i - X_i}{W_i} right)
$$

where




$X_1, ldots, X_n$ are random variables defined on a same probability space, and each $X_i$ follows a distribution with finite mean $a_i$ and finite variance $sigma_i^2$ (the distributions are not necessarily identical.) We regard $a_i$ as the unknown true effect and usually estimate it by the unbiased estimate $X_i$;



$b_i$ is the cut-off from the reference distribution for significance level $alpha_i$, usually set at $0.05$; and



$W_i$ is the estimated standard deviation of $X_i$, to define the bias estimate




In this context, what does $phi$ stand for? My current understanding (by separating the two terms inside the parenthesis into
$$
b_i - X_i / W_i
$$

is that they're calculating the individual bias estimates as the difference between the cutoff and "how many" standard deviations fit into the estimate. Then they're adding these up, but I don't know what $W_iphi$ is doing inside the sum.










share|cite|improve this question











$endgroup$












  • $begingroup$
    The phi coefficient is a correlation between two variables.
    $endgroup$
    – Phil H
    Oct 2 '18 at 0:38










  • $begingroup$
    So, the equation states that Wi is correlated to the expression in parenthesis, but it does not affect the actual sum?
    $endgroup$
    – Khashir
    Oct 3 '18 at 4:53
















1












$begingroup$


In an article on Selection Bias in A/B Testing, AirBnB proposes a solution to their own estimated bias. This solution is to subtract from the aggregated effect a bias estimate, captured by this equation:
$$
hat{beta} = sum_{i=1}^n W_i phi left( frac{W_i b_i - X_i}{W_i} right)
$$

where




$X_1, ldots, X_n$ are random variables defined on a same probability space, and each $X_i$ follows a distribution with finite mean $a_i$ and finite variance $sigma_i^2$ (the distributions are not necessarily identical.) We regard $a_i$ as the unknown true effect and usually estimate it by the unbiased estimate $X_i$;



$b_i$ is the cut-off from the reference distribution for significance level $alpha_i$, usually set at $0.05$; and



$W_i$ is the estimated standard deviation of $X_i$, to define the bias estimate




In this context, what does $phi$ stand for? My current understanding (by separating the two terms inside the parenthesis into
$$
b_i - X_i / W_i
$$

is that they're calculating the individual bias estimates as the difference between the cutoff and "how many" standard deviations fit into the estimate. Then they're adding these up, but I don't know what $W_iphi$ is doing inside the sum.










share|cite|improve this question











$endgroup$












  • $begingroup$
    The phi coefficient is a correlation between two variables.
    $endgroup$
    – Phil H
    Oct 2 '18 at 0:38










  • $begingroup$
    So, the equation states that Wi is correlated to the expression in parenthesis, but it does not affect the actual sum?
    $endgroup$
    – Khashir
    Oct 3 '18 at 4:53














1












1








1





$begingroup$


In an article on Selection Bias in A/B Testing, AirBnB proposes a solution to their own estimated bias. This solution is to subtract from the aggregated effect a bias estimate, captured by this equation:
$$
hat{beta} = sum_{i=1}^n W_i phi left( frac{W_i b_i - X_i}{W_i} right)
$$

where




$X_1, ldots, X_n$ are random variables defined on a same probability space, and each $X_i$ follows a distribution with finite mean $a_i$ and finite variance $sigma_i^2$ (the distributions are not necessarily identical.) We regard $a_i$ as the unknown true effect and usually estimate it by the unbiased estimate $X_i$;



$b_i$ is the cut-off from the reference distribution for significance level $alpha_i$, usually set at $0.05$; and



$W_i$ is the estimated standard deviation of $X_i$, to define the bias estimate




In this context, what does $phi$ stand for? My current understanding (by separating the two terms inside the parenthesis into
$$
b_i - X_i / W_i
$$

is that they're calculating the individual bias estimates as the difference between the cutoff and "how many" standard deviations fit into the estimate. Then they're adding these up, but I don't know what $W_iphi$ is doing inside the sum.










share|cite|improve this question











$endgroup$




In an article on Selection Bias in A/B Testing, AirBnB proposes a solution to their own estimated bias. This solution is to subtract from the aggregated effect a bias estimate, captured by this equation:
$$
hat{beta} = sum_{i=1}^n W_i phi left( frac{W_i b_i - X_i}{W_i} right)
$$

where




$X_1, ldots, X_n$ are random variables defined on a same probability space, and each $X_i$ follows a distribution with finite mean $a_i$ and finite variance $sigma_i^2$ (the distributions are not necessarily identical.) We regard $a_i$ as the unknown true effect and usually estimate it by the unbiased estimate $X_i$;



$b_i$ is the cut-off from the reference distribution for significance level $alpha_i$, usually set at $0.05$; and



$W_i$ is the estimated standard deviation of $X_i$, to define the bias estimate




In this context, what does $phi$ stand for? My current understanding (by separating the two terms inside the parenthesis into
$$
b_i - X_i / W_i
$$

is that they're calculating the individual bias estimates as the difference between the cutoff and "how many" standard deviations fit into the estimate. Then they're adding these up, but I don't know what $W_iphi$ is doing inside the sum.







probability-theory probability-distributions






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Oct 2 '18 at 21:33









mwt

939416




939416










asked Oct 1 '18 at 23:49









KhashirKhashir

1062




1062












  • $begingroup$
    The phi coefficient is a correlation between two variables.
    $endgroup$
    – Phil H
    Oct 2 '18 at 0:38










  • $begingroup$
    So, the equation states that Wi is correlated to the expression in parenthesis, but it does not affect the actual sum?
    $endgroup$
    – Khashir
    Oct 3 '18 at 4:53


















  • $begingroup$
    The phi coefficient is a correlation between two variables.
    $endgroup$
    – Phil H
    Oct 2 '18 at 0:38










  • $begingroup$
    So, the equation states that Wi is correlated to the expression in parenthesis, but it does not affect the actual sum?
    $endgroup$
    – Khashir
    Oct 3 '18 at 4:53
















$begingroup$
The phi coefficient is a correlation between two variables.
$endgroup$
– Phil H
Oct 2 '18 at 0:38




$begingroup$
The phi coefficient is a correlation between two variables.
$endgroup$
– Phil H
Oct 2 '18 at 0:38












$begingroup$
So, the equation states that Wi is correlated to the expression in parenthesis, but it does not affect the actual sum?
$endgroup$
– Khashir
Oct 3 '18 at 4:53




$begingroup$
So, the equation states that Wi is correlated to the expression in parenthesis, but it does not affect the actual sum?
$endgroup$
– Khashir
Oct 3 '18 at 4:53










1 Answer
1






active

oldest

votes


















0












$begingroup$

According to their paper, $phi$ refers to the density function of standard normal distribution.



Let me include some details:



They use the following result:




Suppose $Z$ follows standard normal distribution, then



$$f_{Z|Zge t}(s) = frac{phi(s)}{1-Phi(t)}$$
and hence
$$E[Z|Z ge t]=frac{int_t^infty sphi(s), ds}{1-Phi(t)}=frac{-int_t^infty phi'(s), ds}{1-Phi(t)}=frac{phi(t)}{1-Phi(t)}$$




Assuming $X_i$ follows normal distribution $N(a_i, sigma_i^2)$.
begin{align}
&beta = E[S_A-T_A] \&=sum_{i=1}^nE[I((X_i-a_i)>(b_isigma_i-a_i))(X_i-a_i)]\
&= sum_{i=1}^n sigma_i Eleft[Ileft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{X_i-a_i}{sigma_i}right]\
&=sum_{i=1}^n sigma_iPleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)Eleft[frac{X_i-a_i}{sigma_i}left|frac{X_i-a_i}{sigma}>frac{b_isigma_i-a_i}{sigma}right.right]\
&=sum_{i=1}^n sigma_i Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{phileft( frac{b_isigma_i-a_i}{sigma_i}right)}{Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)} \
&=sum_{i=1}^n sigma_i phileft( frac{b_isigma_i-a_i}{sigma_i}right)\
end{align}



They then replace some parameters via estimation.






share|cite|improve this answer











$endgroup$













    Your Answer





    StackExchange.ifUsing("editor", function () {
    return StackExchange.using("mathjaxEditing", function () {
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    });
    });
    }, "mathjax-editing");

    StackExchange.ready(function() {
    var channelOptions = {
    tags: "".split(" "),
    id: "69"
    };
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function() {
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled) {
    StackExchange.using("snippets", function() {
    createEditor();
    });
    }
    else {
    createEditor();
    }
    });

    function createEditor() {
    StackExchange.prepareEditor({
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: true,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    imageUploader: {
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    },
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    });


    }
    });














    draft saved

    draft discarded


















    StackExchange.ready(
    function () {
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2938671%2fwhat-does-phi-mean-in-this-context%23new-answer', 'question_page');
    }
    );

    Post as a guest















    Required, but never shown

























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    0












    $begingroup$

    According to their paper, $phi$ refers to the density function of standard normal distribution.



    Let me include some details:



    They use the following result:




    Suppose $Z$ follows standard normal distribution, then



    $$f_{Z|Zge t}(s) = frac{phi(s)}{1-Phi(t)}$$
    and hence
    $$E[Z|Z ge t]=frac{int_t^infty sphi(s), ds}{1-Phi(t)}=frac{-int_t^infty phi'(s), ds}{1-Phi(t)}=frac{phi(t)}{1-Phi(t)}$$




    Assuming $X_i$ follows normal distribution $N(a_i, sigma_i^2)$.
    begin{align}
    &beta = E[S_A-T_A] \&=sum_{i=1}^nE[I((X_i-a_i)>(b_isigma_i-a_i))(X_i-a_i)]\
    &= sum_{i=1}^n sigma_i Eleft[Ileft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{X_i-a_i}{sigma_i}right]\
    &=sum_{i=1}^n sigma_iPleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)Eleft[frac{X_i-a_i}{sigma_i}left|frac{X_i-a_i}{sigma}>frac{b_isigma_i-a_i}{sigma}right.right]\
    &=sum_{i=1}^n sigma_i Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{phileft( frac{b_isigma_i-a_i}{sigma_i}right)}{Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)} \
    &=sum_{i=1}^n sigma_i phileft( frac{b_isigma_i-a_i}{sigma_i}right)\
    end{align}



    They then replace some parameters via estimation.






    share|cite|improve this answer











    $endgroup$


















      0












      $begingroup$

      According to their paper, $phi$ refers to the density function of standard normal distribution.



      Let me include some details:



      They use the following result:




      Suppose $Z$ follows standard normal distribution, then



      $$f_{Z|Zge t}(s) = frac{phi(s)}{1-Phi(t)}$$
      and hence
      $$E[Z|Z ge t]=frac{int_t^infty sphi(s), ds}{1-Phi(t)}=frac{-int_t^infty phi'(s), ds}{1-Phi(t)}=frac{phi(t)}{1-Phi(t)}$$




      Assuming $X_i$ follows normal distribution $N(a_i, sigma_i^2)$.
      begin{align}
      &beta = E[S_A-T_A] \&=sum_{i=1}^nE[I((X_i-a_i)>(b_isigma_i-a_i))(X_i-a_i)]\
      &= sum_{i=1}^n sigma_i Eleft[Ileft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{X_i-a_i}{sigma_i}right]\
      &=sum_{i=1}^n sigma_iPleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)Eleft[frac{X_i-a_i}{sigma_i}left|frac{X_i-a_i}{sigma}>frac{b_isigma_i-a_i}{sigma}right.right]\
      &=sum_{i=1}^n sigma_i Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{phileft( frac{b_isigma_i-a_i}{sigma_i}right)}{Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)} \
      &=sum_{i=1}^n sigma_i phileft( frac{b_isigma_i-a_i}{sigma_i}right)\
      end{align}



      They then replace some parameters via estimation.






      share|cite|improve this answer











      $endgroup$
















        0












        0








        0





        $begingroup$

        According to their paper, $phi$ refers to the density function of standard normal distribution.



        Let me include some details:



        They use the following result:




        Suppose $Z$ follows standard normal distribution, then



        $$f_{Z|Zge t}(s) = frac{phi(s)}{1-Phi(t)}$$
        and hence
        $$E[Z|Z ge t]=frac{int_t^infty sphi(s), ds}{1-Phi(t)}=frac{-int_t^infty phi'(s), ds}{1-Phi(t)}=frac{phi(t)}{1-Phi(t)}$$




        Assuming $X_i$ follows normal distribution $N(a_i, sigma_i^2)$.
        begin{align}
        &beta = E[S_A-T_A] \&=sum_{i=1}^nE[I((X_i-a_i)>(b_isigma_i-a_i))(X_i-a_i)]\
        &= sum_{i=1}^n sigma_i Eleft[Ileft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{X_i-a_i}{sigma_i}right]\
        &=sum_{i=1}^n sigma_iPleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)Eleft[frac{X_i-a_i}{sigma_i}left|frac{X_i-a_i}{sigma}>frac{b_isigma_i-a_i}{sigma}right.right]\
        &=sum_{i=1}^n sigma_i Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{phileft( frac{b_isigma_i-a_i}{sigma_i}right)}{Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)} \
        &=sum_{i=1}^n sigma_i phileft( frac{b_isigma_i-a_i}{sigma_i}right)\
        end{align}



        They then replace some parameters via estimation.






        share|cite|improve this answer











        $endgroup$



        According to their paper, $phi$ refers to the density function of standard normal distribution.



        Let me include some details:



        They use the following result:




        Suppose $Z$ follows standard normal distribution, then



        $$f_{Z|Zge t}(s) = frac{phi(s)}{1-Phi(t)}$$
        and hence
        $$E[Z|Z ge t]=frac{int_t^infty sphi(s), ds}{1-Phi(t)}=frac{-int_t^infty phi'(s), ds}{1-Phi(t)}=frac{phi(t)}{1-Phi(t)}$$




        Assuming $X_i$ follows normal distribution $N(a_i, sigma_i^2)$.
        begin{align}
        &beta = E[S_A-T_A] \&=sum_{i=1}^nE[I((X_i-a_i)>(b_isigma_i-a_i))(X_i-a_i)]\
        &= sum_{i=1}^n sigma_i Eleft[Ileft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{X_i-a_i}{sigma_i}right]\
        &=sum_{i=1}^n sigma_iPleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)Eleft[frac{X_i-a_i}{sigma_i}left|frac{X_i-a_i}{sigma}>frac{b_isigma_i-a_i}{sigma}right.right]\
        &=sum_{i=1}^n sigma_i Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)frac{phileft( frac{b_isigma_i-a_i}{sigma_i}right)}{Pleft(frac{X_i-a_i}{sigma_i}>frac{b_isigma_i-a_i}{sigma_i}right)} \
        &=sum_{i=1}^n sigma_i phileft( frac{b_isigma_i-a_i}{sigma_i}right)\
        end{align}



        They then replace some parameters via estimation.







        share|cite|improve this answer














        share|cite|improve this answer



        share|cite|improve this answer








        edited Nov 30 '18 at 18:21

























        answered Nov 30 '18 at 18:11









        Siong Thye GohSiong Thye Goh

        100k1465117




        100k1465117






























            draft saved

            draft discarded




















































            Thanks for contributing an answer to Mathematics Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid



            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.


            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2938671%2fwhat-does-phi-mean-in-this-context%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Quarter-circle Tiles

            build a pushdown automaton that recognizes the reverse language of a given pushdown automaton?

            Mont Emei