Search results

graph Laplacian Regularization for Larg-Scale Semidefinite Programming
...comes from expanding the solution of the original problem in terms of the bottom eigenvectors of a graph laplacian. As the smaller SDPs coming from this fac <math>\,\min_{x_1,...,x_n}\Sigma_{i\sim j}{(\|x_i-x_j\|^2-d_{ij}^2)^2}</math> (1)<br /> ...

12 KB (1,953 words) - 09:45, 30 August 2017
The Curious Case of Degeneration
...the decoding strategies impact machine text. the author For example in the figure below, the GPT2 model tries to generate the continuation text given the con [[File: GPT2_example.png |caption=Example text|center |800px|caption position=bottom]] ...

13 KB (2,144 words) - 05:41, 10 December 2020
deep Generative Stochastic Networks Trainable by Backprop
[[File:figure_1_bengio.png |thumb|upright=1.75| Figure 1 Top: <ref>Bengio, Yoshua, Mesnil, Gregoire, Dauphin, Yann, and ´ Bottom: More generally, a GSN allows the use of arbitrary latent ...

12 KB (1,906 words) - 09:46, 30 August 2017
Conditional Image Synthesis with Auxiliary Classifier GANs
...diagram of the difference in the architecture can be seen in the following figure. ...on (x-axis) and downsized to 32 x 32 (y-axis).|(Odena et al., 2016) Figure 2: (Left) Inception accuracy (y-axis) of two generators with resolution 128 x ...

33 KB (5,219 words) - 10:24, 4 December 2017
the Wake-Sleep Algorithm for Unsupervised Neural Networks
...uction of the original input. This is analogous to learning a data-driven "bottom-up" representation of the input that is used to inform the higher-order "to [[File:Network.png |frame | center |Figure 1: The Helmholtz network structure. ]] ...

16 KB (2,512 words) - 09:46, 30 August 2017
Pixels to Graphs by Associative Embedding
2) Visual relationship detection methods like message passing RNNs and predic 2) Vector embeddings to detect body joints of the various people in an image. ...

17 KB (2,749 words) - 18:26, 16 December 2018
deep Neural Nets as a Method for Quantitative Structure–Activity Relationships
...prediction performance of methods is coefficient of determination (<math>R^2</math>). ...values of one or two parameters at a time, and then calculate the <math>R^2</math> for DNNs trained with the selected parameter settings. These results ...

17 KB (2,705 words) - 09:46, 30 August 2017
Wavelet Pooling CNN
...(p,q) \epsilon R_{ij}} (a_{kpq})</math> with everything defined as before. Figure 1 provides a numerical example that can be followed. ...he data is averaged with values of significantly lower intensities. Figure 2 displays an image of this. ...

15 KB (2,396 words) - 22:57, 20 April 2018
Countering Adversarial Images Using Input Transformations
...This is done by randomly nullifying features within images. Tramer et al [2], showed the state-of-the-art Ensemble Adversarial Training Method, which a [[File:non-targeted O.JPG| 600px|center]] ...

32 KB (4,769 words) - 18:45, 16 December 2018
Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations
(2) What set of parameters, <math display="inline"> \vec{\lambda} </math>, bes ...iation is accomplished using a technique called automatic differentiation [2]. Importantly, the weights of the two neural networks will be shared, since ...

23 KB (3,762 words) - 15:51, 6 December 2020
Hierarchical Representations for Efficient Architecture Search
...esentations_for_Efficient_Architecture_Search#Primitive_operations section 2.3] are used to form small networks defined as ''motifs'' by the authors. To ...its bottom and only one complex motif at its top. Any motif in between the bottom and top levels can be defined as the composition of motifs in lower levels ...

30 KB (4,568 words) - 12:53, 11 December 2018
learning Spectral Clustering, With Application To Speech Separation
...} J=\sum^K_{k=1}\sum_{\mathbf x \in C_k}\|\mathbf x - \boldsymbol{\mu}_k\|^2</math> ...math>\mathop{\min_{\mathbf Y}}K-tr(\mathbf{Y^{\rm T}(D^{\rm{1/2}}WD^{\rm{1/2}})Y})</math> ...

35 KB (5,767 words) - 09:45, 30 August 2017
Dynamic Routing Between Capsules
[[File:one-pixel-attack.jpg|500px|middle]] [[File:face2.jpg|200px|middle]] ...

14 KB (2,384 words) - 12:36, 29 March 2018
stat441w18/A New Method of Region Embedding for Text Classification
2. Huang, Jingyue ...fication. It defines <math> region\left ( i,c\right ) </math> as the <math>2\times c+1</math> length region with middle word <math> \omega_i </math> whi ...

13 KB (2,188 words) - 12:42, 15 March 2018
stat946w18/Tensorized LSTMs
a_{t} =h_{t-1}^{cat} W^h + b^h \hspace{2cm} (2) [[File:StdRNN.png|650px|center||Figure 1: Recurrent Neural Network]] ...

25 KB (4,099 words) - 22:50, 20 April 2018
overfeat: integrated recognition, localization and detection using convolutional networks
2. The second idea is to train the system to not only produce a distribution .../en.wikipedia.org/wiki/Information_retrieval#Mean_average_precision mAP]). Figure 1 illustrates the higher difficulty of the detection process. ...

19 KB (2,961 words) - 09:46, 30 August 2017
a fast learning algorithm for deep belief nets
The following figure shows the network used to model the joint distribution The figure below shows a hybrid network where the top two layers have undirected conne ...

12 KB (1,919 words) - 09:46, 30 August 2017
regression on Manifold using Kernel Dimension Reduction
...be a random vector on <math>\,\Omega_{11}\times \Omega_{12} \times \Omega_{2}</math>, where <math>\,X = (U,V)</math>, and let <math>\,H_1 = H_{11} \otim ...decaying, as <math>\,W_{ij} = exp \left(\frac{- \|x_i - x_j \|^2}{\sigma^2}\right) </math>. Let D denote the diagonal matrix with elements <math>\, D_ ...

26 KB (4,280 words) - 09:45, 30 August 2017
visualizing Data using t-SNE
..._j ||^2/ 2\sigma_i ^2 )}{\sum_{k \neq i} \exp(-||x_i-x_k ||^2/ 2\sigma_i ^2 ) }</math> </center> ...q_{j|i} = \frac{\exp(-||y_i-y_j ||^2)}{\sum_{k \neq i} \exp(-||y_i-y_k ||^2) }</math> </center> ...

19 KB (3,223 words) - 09:45, 30 August 2017
convex and Semi Nonnegative Matrix Factorization
<math> \mathbf {E(F,G) = \|X-FG^T\|^2}</math> <math> J_{K-means} = \sum_{i=1}^n \sum_{k=1}^K g_{ik}||x_i-f_k||^2=||X-FG^T||^2 </math> ...

23 KB (3,920 words) - 09:45, 30 August 2017
imageNet Classification with Deep Convolutional Neural Networks
In this paper, they trained a large, deep neural network to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the ImageNet Large-Scale Visual Recognition Challenge (ILSVRC) has roughly 1.2 million labeled high-resolution training images, 50 thousand validation ima ...

13 KB (1,939 words) - 09:46, 30 August 2017
Meta-Learning For Domain Generalization
...and transferred as a model that is likely to work on a new source domain [2]. Finally, a domain-invariant feature representation is learned to minimize [[File:ashraf1.jpg |center|600px]] ...

14 KB (2,177 words) - 00:41, 7 December 2020
Learning to Navigate in Cities Without a Map
2. Planning and acting using that map. ...View Images and a dual pathway agent architecture is designed. As shown in figure 1, some parts of the environment are built using Google StreetView images o ...

28 KB (4,494 words) - 00:24, 17 December 2018
One-Shot Object Detection with Co-Attention and Co-Excitation
<div align="center">'''Figure 1:''' Object Detection on an image</div> Figure 1 shows an example where the model identifies and locates all the instances ...

22 KB (3,609 words) - 21:53, 6 December 2020
Patch Based Convolutional Neural Network for Whole Slide Tissue Image Classification
[[File: Whole Slide Tissue Image of a grade IV tumor.png|thumb|300px|upleft|Figure 1: Whole Slide Tissue Image of a grade IV tumor. Features indicating subtyp ...3]. These methods excel when an abundance of patch labels are provided [1, 2], allowing patch-level supervised classifiers to learn the assortment of ca ...

16 KB (2,470 words) - 14:07, 19 November 2021
STAT946F17/Conditional Image Generation with PixelCNN Decoders
...parallelize the training process. In this work, Oord et al. [[#Reference|[2]]] introduced a Gated PixelCNN, which is a convolutional variant of the Pix ...el is a simple generative method wherein given an image of dimension $x_{n^2}$, we iterate, employ feedback and capture pixel densities from every pixel ...

31 KB (4,917 words) - 12:47, 4 December 2017
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
...or meta-learning methods that learn an update function or learning rule [1,2,3,4], this algorithm does not expand the number of learned parameters nor p [[File:model.png|200px|right|thumb|Figure 1: Diagram of the MAML algorithm]] ...

26 KB (4,205 words) - 10:18, 4 December 2017
learning Long-Range Vision for Autonomous Off-Road Driving
...ange vision for autonomous off‐road driving." Journal of Field Robotics 26.2 (2009): 120-144.</ref>; to robustly increase the obstacle and path detecti ...issues, a normalized “pyramid” containing 7 sub-images are extracted (see figure above), where the top row of the pyramid has a range from 112 meters to inf ...

20 KB (3,026 words) - 09:46, 30 August 2017
MarrNet: 3D Shape Reconstruction via 2.5D Sketches
...t varies based on colour, texture, lighting, background, etc. However, the 2.5D sketches (e.g. depth or normal maps) of the object remain constant, and ...n the idea from [Marr, 1982] that human 3D perception relies on recovering 2.5D sketches, which include depth maps (contains information related to the ...

21 KB (3,383 words) - 22:42, 20 April 2018
what game are we playing
...u - \log v - 1 + \nu 1 \\ 1^T u - 1 \\ 1^T v - 1 \\ \end{bmatrix}, \qquad (2) 2. For small changes du, dv, ...

25 KB (4,131 words) - 23:55, 6 December 2020
a neural representation of sketch drawings
...are models that use Hidden Markov Models [25] or Mixture Density Networks [2] to generate human sketches, continuous data points (modelling Chinese char ...classes, each class has 70k training samples, 2.5k validation samples, and 2.5k test samples. ...

30 KB (4,807 words) - 00:40, 17 December 2018
Zero-Shot Visual Imitation
[[File:1-GSP.png | 650px|thumb|center|Figure 1: The goal-conditioned skill policy (GSP) takes as input the current and g ...termines whether the goal has been satisfied with respect to some metrics. Figure 1 shows various GSPs along with diagram (d) showing the forward-consistent ...

31 KB (4,977 words) - 18:42, 16 December 2018
orthogonal gradient descent for continual learning
...utputs. The results of OGD-AVE, ODG-GTL, OGD-ALL are compared to SGD, ECW [2], (a regularization method using Fischer information for importance weights ...sk is a fixed permutation that gets applied to each MNIST digit. The below figure shows the performance comparison of different methods when applied on the p ...

15 KB (2,322 words) - 23:30, 7 December 2020
conditional neural process
[[File:001.jpg|300px|center]] 2. A CNP is permutation invariant in <math display="inline">O</math> and <mat ...

32 KB (4,970 words) - 00:26, 17 December 2018
FeUdal Networks for Hierarchical Reinforcement Learning
...at the top level (manager) produces a meaningful and explicit goal for the bottom level (worker) to achieve. ...ds using auxiliary losses and rewards such as pseudo count for exploration[2] have significantly improved results by stimulating agents to explore new p ...

20 KB (3,237 words) - 01:59, 3 December 2017
Superhuman AI for Multiplayer Poker
* 2 different pairs Consider the Lemonade Stand example from Figure 1 Below. We have 4 players and the goal for each player is to find a spot i ...

26 KB (4,248 words) - 00:06, 8 December 2020
Convolutional Sequence to Sequence Learning
...ndamental idea. This is given by <math>2^{{H(p)}}=2^{{-\sum _{x}p(x)\log _{2}p(x)}} </math> Suppose you have a four-sided dice (not sure what that’d be) The following figure depicts the underlying architecture: ...

27 KB (4,178 words) - 20:37, 28 November 2017
Describtion of Text Mining
'''2. Filtering''' The weights have 2 main models used Boolean model and TF-IDF model: ...

31 KB (4,992 words) - 05:11, 15 December 2020
stat946f11pool
In the first figure (Fig. 21) we have no information about the node Y and so we can not say if ...raphs. Here the Bayes ball rule is simpler and more intuitive. Considering Figure.... , a ball can be thrown either from x to z or from z to x if y is not ob ...

100 KB (18,249 words) - 09:45, 30 August 2017
stat341f11
i.e. From <math>x \sim~f(x)</math> sample <math>\,x_{1}, x_{2}, ..., x_{1000}</math> for ii = 2:1000 ...

139 KB (23,688 words) - 09:45, 30 August 2017
stat340s13
...chers from Cornell University, collected 509 million Twitter messages from 2.4 million users in 84 different countries. The data they used were words co 002: T/Th 1:00-2:20pm DC1351 <br /> ...

370 KB (63,356 words) - 09:46, 30 August 2017
stat841f14
The figure below demonstrates an example of PCA. Data is transformed from original 3D Image:PCA 2.png|Extraction of the principal components ...

220 KB (37,901 words) - 09:46, 30 August 2017
stat841f10
...ally distributed (i.i.d)] ordered pairs <math>\,\{(X_{1},Y_{1}), (X_{2},Y_{2}), \dots , (X_{n},Y_{n})\}</math>, where the values of the <math>\,ith</mat [[File:Data1.jpg]] ...

451 KB (73,277 words) - 09:45, 30 August 2017

Search results

Navigation menu

Search