Search results

STAT946F17/Cognitive Psychology For Deep Neural Networks: A Shape Bias Case Study
MNs are trained to assign label $\hat{y}$ to probe image $\hat{x}$ using an attention mechanism $a$ acting on image embeddings stored in the support set S: ...as an input sequence. The embedding $f(\hat{x}, S)$ is an LSTM with a read-attention mechanism operating over the entire embedded support set. The input to the ...

22 KB (3,531 words) - 20:30, 28 November 2017
stat946w18/Unsupervised Machine Translation Using Monolingual Corpora Only
The decoder makes use of the attention mechanism of Bahdanau et al. (2014). To compute the probability of a given with an attention mechanism. Then without any constraint, the auto-encoder tempts to merely c ...

28 KB (4,522 words) - 21:29, 20 April 2018
Poison Frogs Neural Networks
...e to be misclassified as a base. Finally the paper hopes that it can raise attention for the important issues of data reliability and data sourcing. ...

11 KB (1,590 words) - 18:29, 26 November 2021
Augmix: New Data Augmentation method to increase the robustness of the algorithm
...d be really curious to test their methodology in Large Models, adding self-attention layers to models improves robustness. To test the abstraction properties, w ...

11 KB (1,652 words) - 18:44, 6 December 2020
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
...s in a learned metric space using e.g. Siamese networks or recurrence with attention mechanisms”, the proposed method can be generalized to any other problems i ...xperimented on by the authors) would be to explore methods of attaching an Attention Kernel which results in a simple and differentiable loss. It has been imple ...

26 KB (4,205 words) - 10:18, 4 December 2017
stat441w18/Saliency-based Sequential Image Attention with Multiset Prediction
{{DISPLAYTITLE:stat441w18/Saliency-based Sequential Image Attention with Multiset Prediction}} ...

12 KB (1,840 words) - 14:09, 20 March 2018
Unsupervised Neural Machine Translation
The translation model uses a standard encoder-decoder model with attention. The encoder is a 2-layer bidirectional RNN, and the decoder is a 2 layer R ...ervised model to perform translations with monolingual corpora by using an attention-based encoder-decoder system and training using denoise and back-translatio ...

28 KB (4,293 words) - 00:28, 17 December 2018
stat946F18/differentiableplasticity
..., the RNN is attached with an external content-addressable memory bank. An attention mechanism within the controller network does the read-write to the memory b ...memories must be embedded in fixed-size vectors and retrieved through some attention mechanism. In contrast, trainable synaptic plasticity translates into very ...

27 KB (4,100 words) - 18:28, 16 December 2018
neural Machine Translation: Jointly Learning to Align and Translate
...t annotations to generate each target word; this implements a mechanism of attention in the decoder. ...

14 KB (2,221 words) - 09:46, 30 August 2017
on using very large target vocabulary for neural machine translation
...ased on specific implementation of neural machine translation that uses an attention mechanism, as recently proposed in <ref> ...

14 KB (2,301 words) - 09:46, 30 August 2017
stat441w18/e-gan
...the previous sections, alternatively updating G and D requires significant attention. We modify the way we update the generator G to improve stability and gener ...

15 KB (2,279 words) - 22:00, 14 March 2018
stat946F18/Autoregressive Convolutional Neural Networks for Asynchronous Time Series
This idea is also successfully used in attention networks[13] such as image captioning and machine translation. In this pape ...o, K., Courville, A., and Bengio, Y. Describing multi- media content using attention-based Encoder–Decoder networks. IEEE Transactions on Multimedia, 17(11): 18 ...

29 KB (4,577 words) - 10:13, 14 December 2018
deflation Methods for Sparse PCA
...oped in the literature. The latter sub-task has received relatively little attention and is typically borrowed without justification from the PCA context. In th ...

20 KB (3,332 words) - 09:45, 30 August 2017
conditional neural process
...ls which can do few-shot estimations of data. This can be implemented with attention mechanisms (Reed et al., 2017) or additional memory units in a VAE model ( ...their case features of samples are compared with target features using an attention kernel. At a higher level one can interpret this model as a CNP where the a ...

32 KB (4,970 words) - 00:26, 17 December 2018
STAT946F17/ Coupled GAN
...ng generative adversarial networks[8], variational autoencoders (VAE)[17], attention models[18], have shown that a deep network can learn an image distribution ...

32 KB (4,965 words) - 15:02, 4 December 2017
stat946F18/Beyond Word Importance Contextual Decomposition to Extract Interactions from LSTMs
Attention-based models: #Bahdanau et al. (2014): These are a different class of models which use attention modules(different architectures) to help focus the neural network to decide ...

31 KB (5,069 words) - 18:21, 16 December 2018
Unsupervised Learning of Optical Flow via Brightness Constancy and Motion Smoothness
...red solution in image recognition and computer vision problems, increasing attention has been dedicated to evolving the network architecture to further improve ...

16 KB (2,542 words) - 17:26, 26 November 2018
a fair comparison of graph neural networks for graph classification
field has attracted the attention of a wide research community, which resulted in ...

16 KB (2,430 words) - 00:54, 7 December 2020
One pixel attack for fooling deep neural networks
Neural network first caught people’s attention during the 2012 imageNet contest. A solution using neural network achieve 8 ...

17 KB (2,650 words) - 23:54, 30 March 2018
proposal for STAT946 projects Fall 2010
Independent component analysis has been given more attention recently. It is become a popular method for estimating the independent feat ...

17 KB (2,679 words) - 09:45, 30 August 2017

Search results

Navigation menu

Search