0% found this document useful (0 votes)

39 views

DeepLearning ACL2012 Tutorial

This document provides references for the ACL 2012 tutorial titled "Deep Learning for NLP (without Magic)". It contains over 70 references to papers related to deep learning techniques for natural language processing tasks without direct supervision. The references cover topics like neural language models, representation learning, and applications of deep learning to tasks such as parsing and relation extraction.

Uploaded by

suryansh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

DeepLearning ACL2012 Tutorial

Uploaded by

suryansh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Deep Learning for NLP (without Magic)

References
Richard Socher,* Yoshua Bengio,† and Christopher Manning*
*Department of Computer Science, Stanford University
† Department of computer science and operations research, U. Montréal

July 8, 2012
ACL 2012 Tutorial

References
Ando, Rie Kubota and Tong Zhang. 2005. A framework for learning predictive
structures from multiple tasks and unlabeled data. J. Machine Learning Re-
search 6:1817–1853.

Bannard, C. and C. Callison-Burch. 2005. Paraphrasing with bilingual parallel

corpora. In ACL.

Bengio, Y. 2009. Learning deep architectures for AI. Foundations & Trends in
Mach. Learn. 2(1):1–127.

Bengio, Yoshua. 2012. Practical recommendations for gradient-based training of

deep architectures. Tech. rep., arXiv:1206.5533.

Bengio, Yoshua, Réjean Ducharme, and Pascal Vincent. 2001. A neural proba-
bilistic language model. In T. K. Leen, T. G. Dietterich, and V. Tresp, eds.,
Advances in NIPS 13, pages 932–938. MIT Press.

Bengio, Yoshua, Réjean Ducharme, Pascal Vincent, and Christian Jauvin. 2003.
A neural probabilistic language model. J. Machine Learning Research 3:1137–
1155.

1
Bengio, Y., P. Simard, and P. Frasconi. 1994. Learning long-term dependencies
with gradient descent is difficult. IEEE Tr. Neural Networks 5(2):157–166.
Blitzer, John, Kilian Weinberger, Lawrence Saul, and Fernando Pereira. 2005.
Hierarchical distributed representations for statistical language modeling. In
NIPS’2004. Cambridge, MA: MIT Press.
Bordes, Antoire, Xavier Glorot, Jason Weston, and Yoshua Bengio. 2012. Joint
learning of words and meaning representations for open-text semantic parsing.
AISTATS’2012 .
Bordes, Antoire, Jason Weston, Ronan Collobert, and Yoshua Bengio. 2011.
Learning structured embeddings of knowledge bases. In AAAI 2011.
Bottou, L. 2011. From machine learning to machine reasoning. CoRR
abs/1102.1808.
Brown, Peter F., Vincent J. Della Pietra, Peter V. deSouza, Jenifer C. Lai, and
Robert L. Mercer. 1992. Class-based n-gram models of natural language. Com-
putational Linguistics 18(4):467–479.
Clark, Alexander. 2003. Combining distributional and morphological information
for part of speech induction˙ In EACL 2003, pages 59–66.
Collobert, R. and J. Weston. 2008. A unified architecture for natural language
processing: Deep neural networks with multitask learning. In ICML’2008.
Collobert, Ronan, Jason Weston, Léon Bottou, Michael Karlen, Koray
Kavukcuoglu, and Pavel Kuksa. 2011. Natural language processing (almost)
from scratch. Journal of Machine Learning Research 12:2493–2537.
Costa, F., P. Frasconi, V. Lombardo, and G. Soda. 2003. Towards incremental
parsing of natural language using recursive neural networks. Applied Intelli-
gence 19.
Dahl, George E., Dong Yu, Li Deng, and Alex Acero. 2012. Context-dependent
pre-trained deep neural networks for large vocabulary speech recognition. IEEE
Transactions on Audio, Speech, and Language Processing 20(1):33–42.
Dauphin, Y., X. Glorot, and Y. Bengio. 2011. Large-scale learning of embed-
dings with reconstruction sampling. In Proceedings of the 28th International
Conference on Machine learning, ICML ’11.

2
Erhan, Dumitru, Yoshua Bengio, Aaron Courville, Pierre-Antoine Manzagol, Pas-
cal Vincent, and Samy Bengio. 2010. Why does unsupervised pre-training help
deep learning? J. Machine Learning Res. 11:625–660.

Firth, J. R. 1957. A synopsis of linguistic theory 1930–1955. In Studies in Lin-

guistic Analysis, pages 1–32. Oxford: Philological Society. Reprinted in F.
R. Palmer (ed), Selected Papers of J. R. Firth 1952–1959, London: Longman,
1968.

Glorot, Xavier and Yoshua Bengio. 2010. Understanding the difficulty of training
deep feedforward neural networks. In AISTATS’2010, pages 249–256.

Glorot, Xavier, Antoire Bordes, and Yoshua Bengio. 2011. Deep sparse rectifier
neural networks. In AISTATS’2011.

Goller, C. and A. Küchler. 1996. Learning task-dependent distributed representa-

tions by backpropagation through structure. In Proceedings of the International
Conference on Neural Networks (ICNN-96).

Goodfellow, Ian, Quoc Le, Andrew Saxe, and Andrew Ng. 2009. Measuring
invariances in deep networks. In NIPS 22, pages 646–654.

Gould, S., R. Fulton, and D. Koller. 2009. Decomposing a Scene into Geometric
and Semantically Consistent Regions. In ICCV.

Gunawardana, A., M. Mahajan, A. Acero, and J. Platt. 2005. Conditional random

fields for phone classification. In Interspeech, pages 1117–1120. MIT Press.

Hendrickx, I., S.N. Kim, Z. Kozareva, P. Nakov, D. Ó Séaghdha, S. Padó, M. Pen-

nacchiotti, L. Romano, and S. Szpakowicz. 2010. Semeval-2010 task 8: Multi-
way classification of semantic relations between pairs of nominals. In Proceed-
ings of the 5th International Workshop on Semantic Evaluation.

Hinton, G. E. 1990. Mapping part-whole hierarchies into connectionist networks.

Artificial Intelligence 46(1-2).

Hinton, Geoffrey. E. 2010. A practical guide to training restricted Boltzmann

machines. Tech. Rep. UTML TR 2010-003, Department of Computer Science,
University of Toronto.

3
Huang, Eric H., Richard Socher, Christopher D. Manning, and Andrew Y. Ng.
2012. Improving word representations via global context and multiple word
prototypes. In ACL 2012.

Koo, Terry, Xavier Carreras, and Michael Collins. 2008. Simple semi-supervised
dependency parsing. In Proceedings of ACL, pages 595–603.

Le, Quoc, Jiquan Ngiam, Adam Coates, Abhik Lahiri, Bobby Prochnow, and
Andrew Ng. 2011. On optimization methods for deep learning. In Proc.
ICML’2011. ACM.

Le, Quoc, Marc’Aurelio Ranzato, Rajat Monga, Matthieu Devin, Greg Corrado,
Kai Chen, Jeff Dean, and Andrew Ng. 2012. Building high-level features using
large scale unsupervised learning. In ICML’2012.

Lee, Honglak, Roger Grosse, Rajesh Ranganath, and Andrew Y. Ng. 2009a. Con-
volutional deep belief networks for scalable unsupervised learning of hierarchi-
cal representations. In ICML’2009.

Lee, Honglak, Peter Pham, Yan Largman, and Andrew Ng. 2009b. Unsuper-
vised feature learning for audio classification using convolutional deep belief
networks. In NIPS’2009.

Martin, Sven, Jörg Liermann, and Hermann Ney. 1998. Algorithms for bigram
and trigram word clustering. Speech Communication 24:19–37.

Menchetti, S., F. Costa, P. Frasconi, and M. Pontil. 2005. Wide coverage natural
language processing using kernel methods and neural networks for structured
data. Pattern Recognition Letters 26(12).

Mikolov, Tomas, Anoop Deoras, Stefan Kombrink, Lukas Burget, and Jan Cer-
nocky. 2011. Empirical evaluation and combination of advanced language mod-
eling techniques. In Proc. 12th annual conference of the international speech
communication association (INTERSPEECH 2011).

Mnih, Andriy and Geoffrey E. Hinton. 2007. Three new graphical models for
statistical language modelling. In ICML’2007, pages 641–648.

Mnih, Andriy and Geoffrey E. Hinton. 2009. A scalable hierarchical distributed

language model. In NIPS 21, pages 1081–1088.

4
Morin, Frédéric and Yoshua Bengio. 2005. Hierarchical probabilistic neural net-
work language model. In AISTATS’2005, pages 246–252.

Pollack, J. B. 1990. Recursive distributed representations. Artificial Intelligence

46.

Quattoni, Ariadna, Michael Collins, and Trevor Darrell. 2005. Conditional ran-
dom fields for object recognition. In NIPS’2004, pages 1097–1104. MIT Press.

rahman Mohamed, Abdel, George Dahl, and Geoffrey Hinton. 2012. Acoustic
modeling using deep belief networks. IEEE Trans. on Audio, Speech and Lan-
guage Processing 20(1):14–22.

Ratliff, N., J. A. Bagnell, and M. Zinkevich. 2007. (Online) subgradient methods

for structured prediction. In Eleventh International Conference on Artificial
Intelligence and Statistics (AIStats).

Reisinger, Joseph and Raymond J. Mooney. 2010. Multi-prototype vector-space

models of word meaning. In HLT-NAACL 2010, pages 109–117.

Rifai, Salah, Yann Dauphin, Pascal Vincent, and Yoshua Bengio. 2012. A gener-
ative process for contractive auto-encoders. In ICML’2012.

Rifai, Salah, Pascal Vincent, Xavier Muller, Xavier Glorot, and Yoshua Bengio.
2011. Contracting auto-encoders: Explicit invariance during feature extraction.
In ICML’2011.

Rink, B. and S. Harabagiu. 2010. UTD: Classifying semantic relations by com-

bining lexical and semantic resources. In Proceedings of the 5th International
Workshop on Semantic Evaluation.

Schwenk, Holger. 2007. Continuous space language models. Computer speech

and language 21:492–518.

Schwenk, H. and J-L. Gauvain. 2002. Connectionist language modeling for large
vocabulary continuous speech recognition. In ICASSP, pages 765–768. Or-
lando, Florida.

Schwenk, Holger, Anthony Rousseau, and Mohammed Attik. 2012. Large, pruned
or continuous space language models on a gpu for statistical machine transla-
tion. In Workshop on the future of language modeling for HLT.

5
Seide, Frank, Gang Li, and Dong Yu. 2011. Conversational speech transcrip-
tion using context-dependent deep neural networks. In Interspeech 2011, pages
437–440.
Sha, Fei and Fernando C. N. Pereira. 2003. Shallow parsing with conditional
random fields. In HLT-NAACL.
Smith, Noah A. and Jason Eisner. 2005. Contrastive estimation: Training log-
linear models on unlabeled data. In Proceedings of the 43rd Annual Meeting of
the Association for Computational Linguistics (ACL’05), pages 354–362. Ann
Arbor, Michigan: Association for Computational Linguistics.
Socher, Richard, Eric H. Huang, Jeffrey Pennington, Andrew Y. Ng, and Christo-
pher D. Manning. 2011a. Dynamic pooling and unfolding recursive autoen-
coders for paraphrase detection. In Advances in Neural Information Processing
Systems 24.
Socher, Richard, Cliff C. Lin, Andrew Y. Ng, and Christopher D. Manning. 2011b.
Parsing natural scenes and natural language with recursive neural networks.
In Proceedings of the 26th International Conference on Machine Learning
(ICML).
Socher, R., C. D. Manning, and A. Y. Ng. 2010. Learning continuous phrase
representations and syntactic parsing with recursive neural networks. In Pro-
ceedings of the NIPS-2010 Deep Learning and Unsupervised Feature Learning
Workshop.
Socher, Richard, Jeffrey Pennington, Eric H. Huang, Andrew Y. Ng, and Christo-
pher D. Manning. 2011c. Semi-supervised recursive autoencoders for predict-
ing sentiment distributions. In Proceedings of the 2011 Conference on Empiri-
cal Methods in Natural Language Processing (EMNLP).
Toutanova, Kristina, Dan Klein, Christopher D. Manning, and Yoram Singer.
2003. Feature-rich part-of-speech tagging with a cyclic dependency network.
In Human Language Technology Conference of the North American Chapter
of the Association for Computational Linguistics (HLT-NAACL 2003), pages
252–259.
Turian, Joseph, Lev Ratinov, and Yoshua Bengio. 2010. Word representations: A
simple and general method for semi-supervised learning. In Proc. ACL’2010,
pages 384–394. Association for Computational Linguistics.

6
Vincent, Pascal, Hugo Larochelle, Yoshua Bengio, and Pierre-Antoine Manzagol.
2008. Extracting and composing robust features with denoising autoencoders.
In ICML 2008, pages 1096–1103.

Data Scientist Test Task V2
No ratings yet
Data Scientist Test Task V2
1 page
Accenture Risk Analytics Network Credit Risk Analytics
No ratings yet
Accenture Risk Analytics Network Credit Risk Analytics
16 pages
Deep Learning in Natural Language Processing A State-of-the-Art Survey
No ratings yet
Deep Learning in Natural Language Processing A State-of-the-Art Survey
6 pages
Trend
No ratings yet
Trend
47 pages
71511
No ratings yet
71511
21 pages
SPA.2018.8563389
No ratings yet
SPA.2018.8563389
6 pages
DeepLearningBook_RefsByLastFirstNames
No ratings yet
DeepLearningBook_RefsByLastFirstNames
195 pages
The 7 NLP Techniques That Will Change How You Communicate in the Future (Part I)
No ratings yet
The 7 NLP Techniques That Will Change How You Communicate in the Future (Part I)
19 pages
Get Syllabus PDF
No ratings yet
Get Syllabus PDF
3 pages
Deep Neural Network Language Models - W12-2703
No ratings yet
Deep Neural Network Language Models - W12-2703
9 pages
Recent Advances in Natural Language Processing Via Large Pre-Trained Language Models-A Survey
No ratings yet
Recent Advances in Natural Language Processing Via Large Pre-Trained Language Models-A Survey
40 pages
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
100% (1)
Word2Vec - A Baby Step in Deep Learning But A Giant Leap Towards Natural Language Processing
12 pages
Mihai Surdeanu, Marco Antonio Valenzuela-Escarcega - Deep Learning for Natural Language Processing_ a Gentle Introduction-Cambridge University Press (2024)
No ratings yet
Mihai Surdeanu, Marco Antonio Valenzuela-Escarcega - Deep Learning for Natural Language Processing_ a Gentle Introduction-Cambridge University Press (2024)
345 pages
Deep Learning DSDL32,NLP
No ratings yet
Deep Learning DSDL32,NLP
4 pages
Cs224n 2025 Lecture03 Neuralnets
No ratings yet
Cs224n 2025 Lecture03 Neuralnets
96 pages
OPT-Open Pre-Trained Transformer Language Models 3
No ratings yet
OPT-Open Pre-Trained Transformer Language Models 3
4 pages
US11494615
No ratings yet
US11494615
21 pages
2009 Tutorial Nips
No ratings yet
2009 Tutorial Nips
113 pages
(2303.18223) A Survey of Large Language Models
No ratings yet
(2303.18223) A Survey of Large Language Models
115 pages
2022-foundations-tutorial3-sunwang-deeplearning4nlp
No ratings yet
2022-foundations-tutorial3-sunwang-deeplearning4nlp
103 pages
Pre Trained Models For NLP
No ratings yet
Pre Trained Models For NLP
15 pages
Lecture 2a - Word Level Semantics
No ratings yet
Lecture 2a - Word Level Semantics
34 pages
Deep Learning Methods And Applications Li Deng Dong Yu pdf download
No ratings yet
Deep Learning Methods And Applications Li Deng Dong Yu pdf download
49 pages
Survey On Large Language Models
No ratings yet
Survey On Large Language Models
52 pages
Download (Ebook) Deep learning in natural language processing by Deng, Li; Liu, Yang (eds.) ISBN 9789811052088, 9789811052095, 9811052085, 9811052093 ebook All Chapters PDF
100% (8)
Download (Ebook) Deep learning in natural language processing by Deng, Li; Liu, Yang (eds.) ISBN 9789811052088, 9789811052095, 9811052085, 9811052093 ebook All Chapters PDF
63 pages
A Survey of Large Language Models
No ratings yet
A Survey of Large Language Models
58 pages
A Survey Large Language Models
No ratings yet
A Survey Large Language Models
58 pages
Lecture Notes
No ratings yet
Lecture Notes
86 pages
Intro DL 10 NLP
No ratings yet
Intro DL 10 NLP
99 pages
2020 NLPDeepLearning
No ratings yet
2020 NLPDeepLearning
72 pages
Recent Advances of Foundation Language Models-Based Continual Learning - A Survey
No ratings yet
Recent Advances of Foundation Language Models-Based Continual Learning - A Survey
40 pages
Follow Me On For More:: Steve Nouri
No ratings yet
Follow Me On For More:: Steve Nouri
39 pages
Cs224n 2024 Lecture02 Wordvecs2
No ratings yet
Cs224n 2024 Lecture02 Wordvecs2
45 pages
Lecture1 PDF
No ratings yet
Lecture1 PDF
39 pages
Deep Learning For Information Retrieval
No ratings yet
Deep Learning For Information Retrieval
136 pages
Christopher Manning Lecture 5: Language Models and Recurrent Neural Networks (Oh, and Finish Neural Dependency Parsing J)
No ratings yet
Christopher Manning Lecture 5: Language Models and Recurrent Neural Networks (Oh, and Finish Neural Dependency Parsing J)
66 pages
LLM_book_43-102
No ratings yet
LLM_book_43-102
60 pages
08-DL-Deep Learning For Text Data (Transfer Learning in NLP)
No ratings yet
08-DL-Deep Learning For Text Data (Transfer Learning in NLP)
53 pages
Zhou 2020
No ratings yet
Zhou 2020
5 pages
Transactions On Neural Networks and Learning Systems 11
No ratings yet
Transactions On Neural Networks and Learning Systems 11
1 page
annurev-linguistics-032020-051035
No ratings yet
annurev-linguistics-032020-051035
18 pages
Machine Learning For Natural Language Processing Lecture Notes Columbia E6998 Itebooks download
No ratings yet
Machine Learning For Natural Language Processing Lecture Notes Columbia E6998 Itebooks download
42 pages
Dynamic Embedding Projection-Gated
No ratings yet
Dynamic Embedding Projection-Gated
10 pages
The Unreasonable Effectiveness of Data PDF
No ratings yet
The Unreasonable Effectiveness of Data PDF
5 pages
DL Unit-IV
No ratings yet
DL Unit-IV
20 pages
Training the application of LLM
No ratings yet
Training the application of LLM
68 pages
Deep Learning Methods and Application
No ratings yet
Deep Learning Methods and Application
100 pages
PIIS2589004224005558
No ratings yet
PIIS2589004224005558
24 pages
j2020 A Survey of The Usages of Deep Learning For Natural Language Processing
No ratings yet
j2020 A Survey of The Usages of Deep Learning For Natural Language Processing
21 pages
APznzaYD23xZzgrNn UY T9fGgJbB0 Kfhgt21x0vaHH4qfIvCmiqGVPY37T19O (2)
No ratings yet
APznzaYD23xZzgrNn UY T9fGgJbB0 Kfhgt21x0vaHH4qfIvCmiqGVPY37T19O (2)
10 pages
Natural Language Processing With Deep Learning 1 PDF
No ratings yet
Natural Language Processing With Deep Learning 1 PDF
37 pages
(IJCST-V6I3P19) :vignesh Venkatesh
No ratings yet
(IJCST-V6I3P19) :vignesh Venkatesh
16 pages
A Survey of Large Language Models
No ratings yet
A Survey of Large Language Models
140 pages
A Unified Architecture For Natural Language Processing: Deep Neural Networks With Multitask Learning
No ratings yet
A Unified Architecture For Natural Language Processing: Deep Neural Networks With Multitask Learning
9 pages
A Survey of Large Language Models
No ratings yet
A Survey of Large Language Models
140 pages
Reference Material NLP - 2
No ratings yet
Reference Material NLP - 2
40 pages
An Introduction to Deep Learning in Natural Language Processing
No ratings yet
An Introduction to Deep Learning in Natural Language Processing
14 pages
Downloed Papers
No ratings yet
Downloed Papers
700 pages
Kalyan 1 s2.0 S2949719123000456 Main
No ratings yet
Kalyan 1 s2.0 S2949719123000456 Main
48 pages
The Stack: On Software and Sovereignty
From Everand
The Stack: On Software and Sovereignty
Benjamin H. Bratton
4.5/5 (5)
Learning in Virtual Worlds: Research and Applications
From Everand
Learning in Virtual Worlds: Research and Applications
eBOUND Canada
No ratings yet
The Impact of Artificial Intelligence Technology on Public School Curriculums of Mathematics-Sciences
From Everand
The Impact of Artificial Intelligence Technology on Public School Curriculums of Mathematics-Sciences
Noel Smythe
No ratings yet
Sess 2
No ratings yet
Sess 2
25 pages
Cloud Computing Sess1
No ratings yet
Cloud Computing Sess1
14 pages
Scanned With Camscanner
No ratings yet
Scanned With Camscanner
7 pages
Chapter 1
No ratings yet
Chapter 1
51 pages
14 - 8 - 2018 - 16 - 22 - 55 - 449 - UNIT I (Part A)
No ratings yet
14 - 8 - 2018 - 16 - 22 - 55 - 449 - UNIT I (Part A)
29 pages
Check Semantics - Error Reporting - Disambiguate - Type Coercion - Static Checking
No ratings yet
Check Semantics - Error Reporting - Disambiguate - Type Coercion - Static Checking
108 pages
Chapter-3: Statistical Analysis
No ratings yet
Chapter-3: Statistical Analysis
56 pages
41 Essential SQL Interview Questions and Answers
No ratings yet
41 Essential SQL Interview Questions and Answers
39 pages
Competitive Coding: January 23, 2021
No ratings yet
Competitive Coding: January 23, 2021
1 page
Shift Reduce Parsing
No ratings yet
Shift Reduce Parsing
21 pages
DSA Day4
No ratings yet
DSA Day4
3 pages
Problem Statement
No ratings yet
Problem Statement
1 page
Assignment 1
No ratings yet
Assignment 1
2 pages
Plagiarism Spectrum Student Infographic PDF
No ratings yet
Plagiarism Spectrum Student Infographic PDF
1 page
ProbabilisticLearning Bayesian
No ratings yet
ProbabilisticLearning Bayesian
11 pages
Bright Network Marketing Communications Plan Template PDF
No ratings yet
Bright Network Marketing Communications Plan Template PDF
5 pages
Code of Conduct For Bright Network Event Participants: Expected Behaviour
No ratings yet
Code of Conduct For Bright Network Event Participants: Expected Behaviour
3 pages
The Godfather of A.I.' Quits Google and Warns of Danger Ahead - The New York Times
No ratings yet
The Godfather of A.I.' Quits Google and Warns of Danger Ahead - The New York Times
8 pages
Xavier Initialization PDF
No ratings yet
Xavier Initialization PDF
8 pages
Boltz321 PDF
No ratings yet
Boltz321 PDF
7 pages
Preview 2
No ratings yet
Preview 2
4 pages
Nonprofit Digital Transformation Demystified A Practical Guide Ali Gooyabadi pdf download
100% (1)
Nonprofit Digital Transformation Demystified A Practical Guide Ali Gooyabadi pdf download
50 pages
Artificial Intelligence More Human With Human
No ratings yet
Artificial Intelligence More Human With Human
3 pages
Ethics and Ai Unit 1
No ratings yet
Ethics and Ai Unit 1
31 pages
YY-Deep Learning PDF
No ratings yet
YY-Deep Learning PDF
46 pages
Machine Learning Techniques-bcds062!01!01[1]
No ratings yet
Machine Learning Techniques-bcds062!01!01[1]
66 pages
Lit Deep Learning
No ratings yet
Lit Deep Learning
19 pages
Intro of Deep Learning
No ratings yet
Intro of Deep Learning
19 pages
Human Cognition and Performance
No ratings yet
Human Cognition and Performance
107 pages
Week 1 Required Reading 2
No ratings yet
Week 1 Required Reading 2
17 pages
One Year of ChatGPT - How A.I. Changed Silicon Valley Forever - NYT
No ratings yet
One Year of ChatGPT - How A.I. Changed Silicon Valley Forever - NYT
11 pages
Supercharge Your Data Science Career
100% (1)
Supercharge Your Data Science Career
20 pages
Endsem
No ratings yet
Endsem
738 pages
Deep Learning Lib
No ratings yet
Deep Learning Lib
3 pages
Lec Introduction CEP
No ratings yet
Lec Introduction CEP
99 pages
Sam Altman Know What He
No ratings yet
Sam Altman Know What He
18 pages
1-Advancing Civil Engineering With AI and Machine Learning From Structural Health
No ratings yet
1-Advancing Civil Engineering With AI and Machine Learning From Structural Health
36 pages
Nitish CV
No ratings yet
Nitish CV
2 pages
Nobel Prize 2024
100% (1)
Nobel Prize 2024
24 pages
Download The Deep Learning Revolution The MIT Press Sejnowski ebook All Chapters PDF
100% (2)
Download The Deep Learning Revolution The MIT Press Sejnowski ebook All Chapters PDF
55 pages
The 100 Most Influential People in AI
No ratings yet
The 100 Most Influential People in AI
8 pages
Entrevista
No ratings yet
Entrevista
5 pages
Module 8 Artificial Intelligence in Monitoring and Evaluation
No ratings yet
Module 8 Artificial Intelligence in Monitoring and Evaluation
23 pages
Neuroscience-Inspired Artificial Intelligence
No ratings yet
Neuroscience-Inspired Artificial Intelligence
15 pages
Why The Godfather of A.I. Fears What He's Built - The New Yorker
No ratings yet
Why The Godfather of A.I. Fears What He's Built - The New Yorker
34 pages
Asset-V1 - MITx 6.86x 1T2021 Type@Asset Block@Slides - Lecture1 - Withcredits
No ratings yet
Asset-V1 - MITx 6.86x 1T2021 Type@Asset Block@Slides - Lecture1 - Withcredits
29 pages
Hinton - Deep Learning For AI
No ratings yet
Hinton - Deep Learning For AI
8 pages

DeepLearning ACL2012 Tutorial

Uploaded by

DeepLearning ACL2012 Tutorial

Uploaded by

Deep Learning for NLP (without Magic)

Bannard, C. and C. Callison-Burch. 2005. Paraphrasing with bilingual parallel

Bengio, Yoshua. 2012. Practical recommendations for gradient-based training of

Firth, J. R. 1957. A synopsis of linguistic theory 1930–1955. In Studies in Lin-

Goller, C. and A. Küchler. 1996. Learning task-dependent distributed representa-

Gunawardana, A., M. Mahajan, A. Acero, and J. Platt. 2005. Conditional random

Hendrickx, I., S.N. Kim, Z. Kozareva, P. Nakov, D. Ó Séaghdha, S. Padó, M. Pen-

Hinton, G. E. 1990. Mapping part-whole hierarchies into connectionist networks.

Hinton, Geoffrey. E. 2010. A practical guide to training restricted Boltzmann

Mnih, Andriy and Geoffrey E. Hinton. 2009. A scalable hierarchical distributed

Pollack, J. B. 1990. Recursive distributed representations. Artificial Intelligence

Ratliff, N., J. A. Bagnell, and M. Zinkevich. 2007. (Online) subgradient methods

Reisinger, Joseph and Raymond J. Mooney. 2010. Multi-prototype vector-space

Rink, B. and S. Harabagiu. 2010. UTD: Classifying semantic relations by com-

Schwenk, Holger. 2007. Continuous space language models. Computer speech

You might also like