How to choose an overcomplete dictionary for dictionary learning in sparse coding?












2














Problem: I'm trying to understand how to choose the dictionary matrix in this paper http://yann.lecun.com/exdb/publis/pdf/gregor-icml-10.pdf. The paper is about Sparse Coding and trained Algorithms for solving the Lasso Problem. Experiments reported in the paper are conducted on datasets of natural image patches and handwritten digits.



Given is the Lasso problem (chapter 1.1 in the paper): $E_{W_d}(x,z) = dfrac12lvertlvert{x}-{W_d}{z}lvertlvert^2_2 + alphalvertlvert{z}lvertlvert_1$, where ${x}inmathbb{R}^n$ is a given input vector, ${z}inmathbb{R}^m$ is the sparse code vector and ${W_d}inmathbb{R}^{ntimes m}$ is a "dictionary matrix whose columns are the (normalized) basis vectors".



My question is: How does $W_d$ and these basis vectors look like, especially in the overcomplete case m > n? How do i choose these basis vectors and the dictionary $W_d$?



In the first Simulation (page 6, chapter 4. Results, first 2 paragraphs) they use a dataset of natural image patches (Screenshot of these 2 parapaghs: enter image description here).



They use image patches of size 10x10 pixels, randomly selected from the Berkeley image database. Therefore they use n = 100 for the input vectors x. For the number of columns in the dictionary they use m = 100 and m = 400 (4 times overcomplete). They then train this dictionary which resulted in gabor-like filters.



How do i choose/initialize this dictionary here before i start training it?
Any answer is appreciated!










share|cite|improve this question





























    2














    Problem: I'm trying to understand how to choose the dictionary matrix in this paper http://yann.lecun.com/exdb/publis/pdf/gregor-icml-10.pdf. The paper is about Sparse Coding and trained Algorithms for solving the Lasso Problem. Experiments reported in the paper are conducted on datasets of natural image patches and handwritten digits.



    Given is the Lasso problem (chapter 1.1 in the paper): $E_{W_d}(x,z) = dfrac12lvertlvert{x}-{W_d}{z}lvertlvert^2_2 + alphalvertlvert{z}lvertlvert_1$, where ${x}inmathbb{R}^n$ is a given input vector, ${z}inmathbb{R}^m$ is the sparse code vector and ${W_d}inmathbb{R}^{ntimes m}$ is a "dictionary matrix whose columns are the (normalized) basis vectors".



    My question is: How does $W_d$ and these basis vectors look like, especially in the overcomplete case m > n? How do i choose these basis vectors and the dictionary $W_d$?



    In the first Simulation (page 6, chapter 4. Results, first 2 paragraphs) they use a dataset of natural image patches (Screenshot of these 2 parapaghs: enter image description here).



    They use image patches of size 10x10 pixels, randomly selected from the Berkeley image database. Therefore they use n = 100 for the input vectors x. For the number of columns in the dictionary they use m = 100 and m = 400 (4 times overcomplete). They then train this dictionary which resulted in gabor-like filters.



    How do i choose/initialize this dictionary here before i start training it?
    Any answer is appreciated!










    share|cite|improve this question



























      2












      2








      2







      Problem: I'm trying to understand how to choose the dictionary matrix in this paper http://yann.lecun.com/exdb/publis/pdf/gregor-icml-10.pdf. The paper is about Sparse Coding and trained Algorithms for solving the Lasso Problem. Experiments reported in the paper are conducted on datasets of natural image patches and handwritten digits.



      Given is the Lasso problem (chapter 1.1 in the paper): $E_{W_d}(x,z) = dfrac12lvertlvert{x}-{W_d}{z}lvertlvert^2_2 + alphalvertlvert{z}lvertlvert_1$, where ${x}inmathbb{R}^n$ is a given input vector, ${z}inmathbb{R}^m$ is the sparse code vector and ${W_d}inmathbb{R}^{ntimes m}$ is a "dictionary matrix whose columns are the (normalized) basis vectors".



      My question is: How does $W_d$ and these basis vectors look like, especially in the overcomplete case m > n? How do i choose these basis vectors and the dictionary $W_d$?



      In the first Simulation (page 6, chapter 4. Results, first 2 paragraphs) they use a dataset of natural image patches (Screenshot of these 2 parapaghs: enter image description here).



      They use image patches of size 10x10 pixels, randomly selected from the Berkeley image database. Therefore they use n = 100 for the input vectors x. For the number of columns in the dictionary they use m = 100 and m = 400 (4 times overcomplete). They then train this dictionary which resulted in gabor-like filters.



      How do i choose/initialize this dictionary here before i start training it?
      Any answer is appreciated!










      share|cite|improve this question















      Problem: I'm trying to understand how to choose the dictionary matrix in this paper http://yann.lecun.com/exdb/publis/pdf/gregor-icml-10.pdf. The paper is about Sparse Coding and trained Algorithms for solving the Lasso Problem. Experiments reported in the paper are conducted on datasets of natural image patches and handwritten digits.



      Given is the Lasso problem (chapter 1.1 in the paper): $E_{W_d}(x,z) = dfrac12lvertlvert{x}-{W_d}{z}lvertlvert^2_2 + alphalvertlvert{z}lvertlvert_1$, where ${x}inmathbb{R}^n$ is a given input vector, ${z}inmathbb{R}^m$ is the sparse code vector and ${W_d}inmathbb{R}^{ntimes m}$ is a "dictionary matrix whose columns are the (normalized) basis vectors".



      My question is: How does $W_d$ and these basis vectors look like, especially in the overcomplete case m > n? How do i choose these basis vectors and the dictionary $W_d$?



      In the first Simulation (page 6, chapter 4. Results, first 2 paragraphs) they use a dataset of natural image patches (Screenshot of these 2 parapaghs: enter image description here).



      They use image patches of size 10x10 pixels, randomly selected from the Berkeley image database. Therefore they use n = 100 for the input vectors x. For the number of columns in the dictionary they use m = 100 and m = 400 (4 times overcomplete). They then train this dictionary which resulted in gabor-like filters.



      How do i choose/initialize this dictionary here before i start training it?
      Any answer is appreciated!







      optimization convex-optimization machine-learning signal-processing






      share|cite|improve this question















      share|cite|improve this question













      share|cite|improve this question




      share|cite|improve this question








      edited Dec 3 at 11:37

























      asked Nov 24 at 18:50









      Sorancon

      113




      113



























          active

          oldest

          votes











          Your Answer





          StackExchange.ifUsing("editor", function () {
          return StackExchange.using("mathjaxEditing", function () {
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          });
          });
          }, "mathjax-editing");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "69"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          noCode: true, onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3011941%2fhow-to-choose-an-overcomplete-dictionary-for-dictionary-learning-in-sparse-codin%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown






























          active

          oldest

          votes













          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes
















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Mathematics Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.





          Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


          Please pay close attention to the following guidance:


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f3011941%2fhow-to-choose-an-overcomplete-dictionary-for-dictionary-learning-in-sparse-codin%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Quarter-circle Tiles

          build a pushdown automaton that recognizes the reverse language of a given pushdown automaton?

          Mont Emei