Learning Attention Based Models


Today I read an interesting post to drop RNNs for sequential models.

However, the post, unfortunately, didn't go into much detail on how one would study the attention based models and start experimenting with them.

The only usefull link was to this paper which uses convolutional networks.

The paper is very packed and dense, making it hard for me to understand, and I couldn't find any books on implementation of these attention models.

Does anyone have any suggestions?


Posted 2018-09-20

Use GitHub to find them.. books are not needed anymore (partially) when it comes to DL since the recent update will never be on a book in the due time but it will be Git! – Aditya – 2018-09-20T16:00:07.083

