Search results

graph Laplacian Regularization for Larg-Scale Semidefinite Programming
...comes from expanding the solution of the original problem in terms of the bottom eigenvectors of a graph laplacian. As the smaller SDPs coming from this fac <math>\,\min_{x_1,...,x_n}\Sigma_{i\sim j}{(\|x_i-x_j\|^2-d_{ij}^2)^2}</math> (1)<br /> ...

12 KB (1,953 words) - 09:45, 30 August 2017
The Curious Case of Degeneration
...the decoding strategies impact machine text. the author For example in the figure below, the GPT2 model tries to generate the continuation text given the con [[File: GPT2_example.png |caption=Example text|center |800px|caption position=bottom]] ...

13 KB (2,144 words) - 05:41, 10 December 2020
deep Generative Stochastic Networks Trainable by Backprop
[[File:figure_1_bengio.png |thumb|upright=1.75| Figure 1 Top: <ref>Bengio, Yoshua, Mesnil, Gregoire, Dauphin, Yann, and ´ Bottom: More generally, a GSN allows the use of arbitrary latent ...

12 KB (1,906 words) - 09:46, 30 August 2017
Conditional Image Synthesis with Auxiliary Classifier GANs
...diagram of the difference in the architecture can be seen in the following figure. ...on (x-axis) and downsized to 32 x 32 (y-axis).|(Odena et al., 2016) Figure 2: (Left) Inception accuracy (y-axis) of two generators with resolution 128 x ...

33 KB (5,219 words) - 10:24, 4 December 2017
the Wake-Sleep Algorithm for Unsupervised Neural Networks
...uction of the original input. This is analogous to learning a data-driven "bottom-up" representation of the input that is used to inform the higher-order "to [[File:Network.png |frame | center |Figure 1: The Helmholtz network structure. ]] ...

16 KB (2,512 words) - 09:46, 30 August 2017
Pixels to Graphs by Associative Embedding
2) Visual relationship detection methods like message passing RNNs and predic 2) Vector embeddings to detect body joints of the various people in an image. ...

17 KB (2,749 words) - 18:26, 16 December 2018
deep Neural Nets as a Method for Quantitative Structure–Activity Relationships
...prediction performance of methods is coefficient of determination (<math>R^2</math>). ...values of one or two parameters at a time, and then calculate the <math>R^2</math> for DNNs trained with the selected parameter settings. These results ...

17 KB (2,705 words) - 09:46, 30 August 2017
Wavelet Pooling CNN
...(p,q) \epsilon R_{ij}} (a_{kpq})</math> with everything defined as before. Figure 1 provides a numerical example that can be followed. ...he data is averaged with values of significantly lower intensities. Figure 2 displays an image of this. ...

15 KB (2,396 words) - 22:57, 20 April 2018
Countering Adversarial Images Using Input Transformations
...This is done by randomly nullifying features within images. Tramer et al [2], showed the state-of-the-art Ensemble Adversarial Training Method, which a [[File:non-targeted O.JPG| 600px|center]] ...

32 KB (4,769 words) - 18:45, 16 December 2018
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations
(2) What set of parameters, <math display="inline"> \vec{\lambda} </math>, bes ...iation is accomplished using a technique called automatic differentiation [2]. Importantly, the weights of the two neural networks will be shared, since ...

23 KB (3,762 words) - 15:51, 6 December 2020
Hierarchical Representations for Efficient Architecture Search
...esentations_for_Efficient_Architecture_Search#Primitive_operations section 2.3] are used to form small networks defined as ''motifs'' by the authors. To ...its bottom and only one complex motif at its top. Any motif in between the bottom and top levels can be defined as the composition of motifs in lower levels ...

30 KB (4,568 words) - 12:53, 11 December 2018
learning Spectral Clustering, With Application To Speech Separation
...} J=\sum^K_{k=1}\sum_{\mathbf x \in C_k}\|\mathbf x - \boldsymbol{\mu}_k\|^2</math> ...math>\mathop{\min_{\mathbf Y}}K-tr(\mathbf{Y^{\rm T}(D^{\rm{1/2}}WD^{\rm{1/2}})Y})</math> ...

35 KB (5,767 words) - 09:45, 30 August 2017
Dynamic Routing Between Capsules
[[File:one-pixel-attack.jpg|500px|middle]] [[File:face2.jpg|200px|middle]] ...

14 KB (2,384 words) - 12:36, 29 March 2018
stat441w18/A New Method of Region Embedding for Text Classification
2. Huang, Jingyue ...fication. It defines <math> region\left ( i,c\right ) </math> as the <math>2\times c+1</math> length region with middle word <math> \omega_i </math> whi ...

13 KB (2,188 words) - 12:42, 15 March 2018
stat946w18/Tensorized LSTMs
a_{t} =h_{t-1}^{cat} W^h + b^h \hspace{2cm} (2) [[File:StdRNN.png|650px|center||Figure 1: Recurrent Neural Network]] ...

25 KB (4,099 words) - 22:50, 20 April 2018
overfeat: integrated recognition, localization and detection using convolutional networks
2. The second idea is to train the system to not only produce a distribution .../en.wikipedia.org/wiki/Information_retrieval#Mean_average_precision mAP]). Figure 1 illustrates the higher difficulty of the detection process. ...

19 KB (2,961 words) - 09:46, 30 August 2017
a fast learning algorithm for deep belief nets
The following figure shows the network used to model the joint distribution The figure below shows a hybrid network where the top two layers have undirected conne ...

12 KB (1,919 words) - 09:46, 30 August 2017
regression on Manifold using Kernel Dimension Reduction
...be a random vector on <math>\,\Omega_{11}\times \Omega_{12} \times \Omega_{2}</math>, where <math>\,X = (U,V)</math>, and let <math>\,H_1 = H_{11} \otim ...decaying, as <math>\,W_{ij} = exp \left(\frac{- \|x_i - x_j \|^2}{\sigma^2}\right) </math>. Let D denote the diagonal matrix with elements <math>\, D_ ...

26 KB (4,280 words) - 09:45, 30 August 2017
visualizing Data using t-SNE
..._j ||^2/ 2\sigma_i ^2 )}{\sum_{k \neq i} \exp(-||x_i-x_k ||^2/ 2\sigma_i ^2 ) }</math> </center> ...q_{j|i} = \frac{\exp(-||y_i-y_j ||^2)}{\sum_{k \neq i} \exp(-||y_i-y_k ||^2) }</math> </center> ...

19 KB (3,223 words) - 09:45, 30 August 2017
convex and Semi Nonnegative Matrix Factorization
<math> \mathbf {E(F,G) = \|X-FG^T\|^2}</math> <math> J_{K-means} = \sum_{i=1}^n \sum_{k=1}^K g_{ik}||x_i-f_k||^2=||X-FG^T||^2 </math> ...

23 KB (3,920 words) - 09:45, 30 August 2017

Search results

Navigation menu

Search