57119nam#a2203937#i#4500001000500000005001700005008004000022020002300062044000900085080010100094084003900195084006000234084008900294084008900383084006700472084006500539100010100604245010900705260004400814300001200858500060400870510012601474510011201600510020101712510024601913510009302159510006902252510012402321510013502445510006702580510012802647510006302775510012102838510009902959510009603058510008203154510008803236510020203324510017903526510023203705510014403937510009504081510013504176510012004311510012004431510009904551510011404650510010204764510017104866510005605037510023205093510029205325510012805617510009105745510011005836510014605946510016406092510021106256510019206467510031206659510010906971510011307080510021407193510019007407510018307597510016707780510011307947510013308060510018008193510021808373510009808591510013808689510012308827510017408950510019209124510010109316510012609417510013709543510014909680510008509829510014509914510014410059510024010203510015910443510019310602510021010795510016511005510023711170510007411407510015211481510012311633510035911756510010512115510006012220510024112280510017412521510028712695510011212982510021913094510021513313510017413528510016313702510014113865510016114006510011814167510025514285510021014540510022914750510011414979510013915093510016215232510022615394510017915620510012515799510013115924510015416055510012916209510009416338510020416432510010616636510016016742510022916902510016817131510018117299510022817480510021117708510014917919510019718068510018218265510009118447510018718538510013618725510015818861510016819019510014619187510025719333510024019590510012319830510010919953510028520062510023820347510019620585510012020781510020320901510019221104510020321296510016521499510016321664510021221827510017722039510024822216510020922464510010822673510012422781510016122905510018523066510015223251510021923403510011223622510017123734510012723905510025224032510015924284510018724443510021124630510017324841510020525014510019225219510017625411510011325587510022325700510022825923510010226151510006426253510019426317510007126511510022926582510018726811510021126998510019127209510011627400510027027516510012727786510007427913510033627987510012628323510010628449510017528555510017728730510027528907510015029182510020329332510023629535510018629771510009229957510011630049510019830165510015230363510020730515510022330722510017230945510010731117510010531224510023731329510011931566510013731685510009131822510016831913510022532081510016132306510011732467510028532584510016532869510016233034510006833196510021133264510026333475510026733738510015134005510017034156510007534326510020134401510013234602510020534734510018034939510021735119510018535336510012935521510017335650510019335823510014736016510011236163510010236275510020636377510011036583510023536693510013736928510009237065510021037157510026837367510022837635510017137863510018638034510020338220510017638423510014838599510018538747510017138932510014439103510013539247510015439382510009339536510014639629510015539775510018439930510016940114510017340283510022040456510014140676510014940817510022340966510025641189510010641445510023041551510035141781510017942132510022342311510028742534510029842821510014843119510018043267510017943447510014743626510023243773510010644005510023144111510023244342510015044574510018944724510011744913510015745030510018845187510027245375510018545647510020945832510018146041510013746222510011146359510013146470510008146601510013146682510016646813510011446979510015347093510013247246510009347378510023447471510015847705510026647863510022448129510021248353510013948565510014148704510013848845510032448983510017349307510018649480510017249666510013049838510009849968510017750066510009950243510019850342510008150540510014350621510009650764510016250860510008451022510013351106510028951239510027251528510019751800510012751997510017552124510016652299510024152465510023852706510018752944533003353131856001753164159120260501150456.5 20190625d2019####ek#y0engy0150####ca##$a978-5-369-02011-1##$axxu##$aОбщие вопросы математических и естественных наук. 50##$aКибернетика. 32812bbk##$aВычислительная техника. 32972bbk##$aКомпьютерные и информационные науки. 02.07.012okso##$aКомпьютерные и информационные науки. 02.06.012okso##$aФизико-математические науки. 612tbk##$aИскусственный интеллект. 28.232grnti#1$aShumskiy, Sergey Aleksandrovich$aMoscow Institute of Physics and Technology (State University)00$aMACHINE INTELLIGENCE. ESSAYS ON THE THEORY OF MACHINE LEARNING AND ARTIFICIAL INTELLIGENCE$cMonography1#$aMoscow$bPublishing Center RIOR$c2019##$a340 p.##$aThis book is about the nature of mind, both human and artificial, from the standpoint of the theory of machine learning. It addresses the problem of creating artificial general intelligence. The author shows how one can use the basic mechanisms of our brain to create artificial brains of future robots. How will this ever-stronger artificial intelligence fit into our lives? What awaits us in the next 10-15 years? How can someone who wants to take part in a new scientific revolution, participate in developing a new science of mind?$amachine learning, artificial intelligence$a10.29039/02011-10#$aA. A. Ezhov and S.A. Shumskiy. Neyrokomp'yuting i ego prilozheniya v ekonomike i biznese. MIFI, 1998. ISBN 5-722-0252-H.0#$aD.A. Kovalevich and P.G. Schedrovickiy. Konveyer innovaciy. 2015. https://asi.ru/conveyor-of-iimovations/.0#$aB. Zavadovskaya and K. Karpov. Reyting kompaniy po proizvoditel'nosti truda sotrudnikov. 2017. https://bcs-express.ru/novosti-i-analitika/reiting-kompanii-po-proizvoditel-nosti-truda-sotrudnikov.0#$aAleksandr Markov and Mihail Markov. Mnogourovnevyy otbor i problema rosta mozga u pleystocenovyh homo. Opyt komp'yuternogo modelirovaniya sopryazhennoy evolyucii genov i memov, 2019. URL https://www.youtube.com/watch? v=AERQrIyk7og&t=5192s.0#$aVladimir Ivanovich Vernadskiy. Trudy, po vseobschey istorii nauki. Ripol Klassik, 1988.0#$aLev Semenovich Vygotskiy. Myshlenie i rech'. Directmedia, 2014.0#$aI. R. Agamirzyan. Tehnologicheskoe liderstvo: vospol'zovat'sya shansom. In Vyzov 2035, pages 8-15. Olimp-Biznes, 2016.0#$aLiza Fel'dman Barrett. Kak rozhdayutsya emocii. Revolyuciya v ponimanii mozga i upravlenii emociyami. Mann, Ivanov i Faber, 2018.0#$aTomas Kun. Struktura nauchnyh revolyuciy. M.: Progress, 1977.0#$aS.P. Kapica. Obschaya teoriya rosta chelovechestva: skol'ko lyudey zhilo, zhivet, i budet zhit' na Zemle. M.: Nauka, 1999.0#$aStanislav Lem. Golem, XIV. Bibliotek XXI veka. ACT, 2002.0#$aI.M. Nozhov. Morfologicheskaya i sintaksicheskaya obrabotka teksta (modeli i programmy). Kand. dissertaciya,, 2003.0#$aMark Beyker. Atomy yazyka: Grammatika v temnom pole soznaniya. LKI, 2008. ISBN 9785382004303.0#$aSaymon Haykin. Neyronnye set,i: polnyy kurs, 2-e izdanie. Izdatel'skiy dom Vil'yame, 2008.0#$aKarlota Peres. Tehnologicheskie revolyucii i finansovyy kapital. Delo, 2011.0#$aKris Anderson. Dlinnyy hvost. Effektivnaya model' biznesa, v Internete. MIF, 2012.0#$aS.A. Shumskiy. Yazyk i mozg: kak chelovek ponimaet rech'. In Sbornik nauchnyh t,rudoe XV Vserossiyskoy nauchnoy konferencii Neyroinformatika-2013. Lekcii po neyroinfor- matike, pages 72-105, 2013.0#$aK.V. Anohin. Kognitom: v poiskah obschey teorii kognitivnoy nauki. In Shestaya mezhdunarodnaya konferenciya, po kognitivnoy nauke: tez. dokl. Kaliningrad, pages 26-28, 2014.0#$aS.A. Shumskiy. Reinzhiniring arhitektury mozga: rol' i vzaimodeystvie osnovnyh podsistem. In Sbornik nauchnyh trudov XVII Vserossiyskoy nauchnoy konferencii Neyroinformatika-2015. Lekcii po peyroipformatike, pages 13-45, 2015.0#$aEvgeniy Kuznecov. Rossiya i mir tehnologicheskogo diktata: 3 scenariya buduschego, 2016. URL https://www.youtube.com/ watch?v=9GtG_kczrFE.0#$aMihail Nikitin. Proishozhdenie zhizni. Ot tumannosti do kletki. Al'pina non-fikshn, 2016.0#$aP.G. Schedrovickiy. Istoriya promyshlennyh revolyuciy i vyzovy III promyshlennoy revolyucii. 2016. https://youtu. be/_cpWkGwZMSI.0#$aDzhordzh Lakoff. Zhenschiny, ogon' i opasnye veschi. Chto kategorii yazyka govoryat nam o myshlenii. Litres, 2017.0#$aAleksandr Markov. Evolyuciya razuma i soprotivlenie nauke, 2017. URL https://www.youtube.com/watch?v= qTOyKOryWQY.0#$aK.V. Anohin. Kognitom - gipersetevaya model' mozga, 2018a. URL https: //youtu.'e/tDalzRYEhss.0#$aK.V. Anohin. Mozg, kak set', i razum, kak set' - vyzovy matematike, 2018b. URL https://youtu.be/tDalzRYEhss.0#$aSvetlana Burlak. Proishozhdenie yazyka: Fakty, issledovaniya, gipotezy. Al'pina Pablisher, 2018.0#$aS.V. Karelov. Vperedi II-nacionalizm i II-nacionalizaciya. 2018. http://russiancouncil.ru/activity/digest/longreads/ vperedi-ii-natsionalizm-i-ii-natsionalizatsiya/.0#$aErvin Shredinger. Chto takoe zhizn'? Litres, 2018.0#$aS.A. Shumskiy. Glubokoe strukturnoe obuchenie: Novyy vzglyad na obuchenie s podkrepleniem. In Sbornik nauchnyh t,rudoe XX Vserossiyskoy nauchnoy konferencii Neyroinformatika-2018. Lekcii po peyroipformatike, pages 11-43, 2018.0#$aS.A. Terehov. Tenzornye dekompozicii v statisticheskom prinyatii resheniy. In Sbornik nauchnyh trudov XX Vserossiyskoy nauchnoy konferencii Neyroinformatika-2018. Lekcii po neyroinformatike, pages 53-58, 2018. URL http://raai.org/library/books/Konf_II_problem-2018/ bookl_intellect.pdf.0#$aDH Medouz, I Randers, and DL Medouz. Predely rosta: 30 let spustya per. s angl. ES Oganesyan; pod red. NP Tarasovoy. 2012.0#$aYan Gudfellou, Bendzhio Ioshua, and Aaron Kurvill'. Glubokoe obuchenie. Litres, 2018.0#$aA.V. Korotaev, S.Yu. Malkov, and L.E. Grinina. Analiz i modelirovanie global'noy dinamiki. Lenand, 2018.0#$aS. Nikolenko, A. Kadurin, and E. Arhangel'skaya. Glubokoe obuchenie. Pogruzhenie v mir neyronnyh setey. Piter, 2018. ISBN 978-5-496-02536-2.0#$aAlphastar: Mastering the real-time strategy game starcraft ii, 2018. URL https://deepmind.com/blog/ alphastar-mastering-real-time-strategy-game-starcraft-ii/.0#$aMichal Aharon, Michael Elad, and Alfred Bruckstein. rmk-svd: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on signal processing, 54(11): 4311-4322, 2006.0#$aR Aharonov and N Slonim. Watch ibm's ai system debate a human champion live at think 2019. IBM Research blog, 2019. URL https://www.ibm.com/blogs/research/2019/02/ ai-debate-think-2019/.0#$aDario Amodei, Sundaram Ananthanaravanan, Rishita Anubhai, Jingliang Bai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Qiang Cheng, Guoliang Chen, et al. Deep speech 2: End-to-end speech recognition in english and mandarin. In International Conference on Machine Learning, pages 173-182, 2016.0#$aRelja Arandjelovic and Andrew Zisserman. Look, listen and learn. arXiv preprint arXiv:1705.08168, 2017.0#$aMartin Arjovskv, Soumith Chintala, and Leon Bottou. Wasserstein gan. arXiv preprint arXiv:1701.07875, 2017.0#$aF Gregory Ashbv, Shawn W Ell, Vivian V Valentin, and Michael V Casale. Frost: a distributed neurocomputational model of working memory maintenance. Journal of cognitive neuroscience, 17( 11): 172S 1713. 2005.0#$aBernard J Baars, Stan Franklin, and Thomas Zoiga Rams0v. Global workspace dynamics: cortical "binding and propagation" enables conscious contents. Frontiers in psychology, 4:200, 2013.0#$aJoscha Bach. The cortical conductor theory: Towards addressing consciousness in ai models. In Biologically Inspired Cognitive Architectures Meeting, pages 16-26. Springer, 2018.0#$aDzmitrv Bahdanau, Kvunghvun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv: C09.04-73, 2014.0#$aLisa Feldman Barrett. How emotions are made: The secret life of the brain. Houghton Mifflin Harcourt, 2017.0#$aLisa Feldman Barrett and W Kyle Simmons. Interoceptive predictions in the brain. Nature Reviews Neuroscience, 16(7): 419, 2015.0#$aAndre M Bastos, W Martin Usrev, Rick A Adams, George R Mangun, Pascal Fries, and Karl J Friston. Canonical microcircuits for predictive coding. Neuron, 76(4):695—711, 2012.0#$aFrancesco P Battaglia, Karim Benchenane, Anton Sirota, Cvriel MA Pennartz, and Sidney I Wiener. The hippocampus: hub of brain network communication for memory. Trends in cognitive sciences, 15(7):310—318, 2011.0#$aJames A Bednar and Stuart P WTilson. Cortical maps. The Neuroscientist, 22(6):604-617, 2016.0#$aEric D Beinhocker. The origin of wealth: Evolution, complexity, and the radical remaking of economics. Harvard Business Press, 2006.0#$aTimothy S Bell, John G Clearv, and Ian H WTitten. Text compression, volume 348. Prentice Hall Englewood Cliffs, 1990.0#$aYoshua Bengio. Deep learning of representations: Looking forward. In International Conference on Statistical Language and Speech Processing, pages 1-37. Springer, 2013.0#$aYoshua Bengio, Pascal Lamblin, Dan Popovici, and Hugo Larochelle. Greedy layer-wise training of deep networks. In Advances in neural information processing system,s, pages 153-160, 2007.0#$aYoshua Bengio, Ian J Goodfellow, and Aaron Courville. Deep learning. Nature, 521:436-444, 2015.0#$aYoshua Bengio et al. Learning deep architectures for ai. Foundations and trends® in Machine Learning, 2(1):1-127, 2009.0#$aCharles H Bennett. The thermodynamics of computation^a review. International Journal of Theoretical Physics, 21(12): 905-940, 1982.0#$aDavid Berthelot, Tom Schumm, and Luke Metz. Began: Boundary equilibrium generative adversarial networks. arXiv preprint arXiv:1703.10717, 2017.0#$aChristopher M Bishop. Pattern recognition and machine learning. springer, 2006.0#$aDavid M Blei, Andrew Y Ng, and Michael I Jordan. Latent dirichlet allocation. Journal of machine Learning research, 3 (Jan):993-1022, 2003.0#$aMatthew Michael Botvinick. Hierarchical reinforcement learning and decision making. Current opinion in neurobiology, 22(6): 956-962, 2012.0#$aClemens Boucsein, Martin Nawrot, Philipp Schnepel, and Ad Aertsen. Beyond the cortical column: abundance and physiology of horizontal connections imply a strong role for inputs from the surround. Frontiers in neuroscience, 5:32, 2011.0#$aAlan J Bray and David S Dean. Statistics of critical points of gaussian fields on large-dimensional spaces. Physical review letters, 98(15):150201, 2007.0#$aPeter F Brown, Peter V Desouza, Robert L Mercer, Vincent J Delia Pietra, and Jenifer S Lai. Class-based n-gram models of natural language. Computational linguistics, 18(4):467-479, 1992a.0#$aPeter F Brown, Vincent J Delia Pietra, Robert L Mercer, Stephen A Delia Pietra, and Jennifer S Lai. An estimate of an upper bound for the entropy of english. Computational Linguistics, 18(l):31-40, 1992b.0#$aJ Bughin, J Seong, J Manvika, M Chui, and R Joshi. Notes from the ai frontier: Modeling the impact of ai on the world economy. McKinsey Global Institute, 2018.0#$aJacques Bughin, E Hazan, S Ramaswamv, M Chui, T Alias, P Dahlstrom, N Henke, and M Trench. Artificial intelligence- the next digital frontier. McKinsey Global Institute, 2017. URL https://www.mckinsey.de/files/170620_studie_ai.pdf.0#$aGvorgv Buzsaki. Rhythms of the Brain. Oxford University Press, 2006.0#$aGvorgv Buzsaki and Edvard I Moser. Memory, navigation and theta rhythm in the hippocampal-entorhinal system. Nature neuroscience, 16(2):130, 2013.0#$aBradley P Carlin and Thomas A Louis. Bayes and empirical Bayes methods for data analysis. Chapman and Hall/CRC, 2010.0#$aChung-Cheng Chiu, Taga N Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J Weiss, Kanishka Rao, Ekaterina Gonina, et al. State-of- the-art speech recognition with sequence-to-sequence models. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 4774-4778. IEEE, 2018.0#$aNoam Chomsky. Knowledge of language: Its nature, origin, and use. Greenwood Publishing Group, 1986.0#$aNoam Chomsky. The minimalist program. MIT press, 2014.0#$aDan Cire§an, Alessandro Giusti, Luca M Gambardella, and Jiirgen Schmidhuber. Deep neural networks segment neuronal membranes in electron microscopy images. In Advances in neural information processing system,s, pages 2843-2851, 2012a.0#$aDan Cire§an, Ueli Meier, Jonathan Masci, and Jiirgen Schmidhuber. Multi-column deep neural network for traffic sign classification. Neural Networks, 32:333-338, 2012b.0#$aDan S Cire§an, Alessandro Giusti, Luca M Gambardella, and Jiirgen Schmidhuber. Mitosis detection in breast cancer histology images with deep neural networks. In International Conference on Medical Image Computing and Computer- assisted Intervention, pages 411-418. Springer, 2013.0#$aAndy Clark. Surfing uncertainty: Prediction, action, and the embodied mind. Oxford University Press, 2015.0#$aMichael W Cole, Jeremy R Reynolds, Jonathan D Power, Grega Repovs, Alan Anticevic, and Todd S Braver. Multi-task connectivity reveals flexible hubs for adaptive task control. Nature neuroscience, 16(9):1348, 2013.0#$aRonan Collobert, Jason Weston, Leon Bottou, Michael Karlen, Korav Kavukcuoglu, and Pavel Kuksa. Natural language processing (almost) from scratch. Journal of machine learning research, 12(Aug):2493-2537, 2011.0#$aAlexis Conneau, Guillaume Lample, Marc'Aurelio Ranzato, Ludovic Denover, and Herve Jegou. Word translation without parallel data. arXiv preprint arXiv:1710.04087, 2017.0#$aGavin E Crooks. Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences. Physical Review E, 60(3):2721, 1999.0#$aEgidio D'Angelo. Neural circuits of the cerebellum: hypothesis for function. Journal of integrative neuroscience, 10(03):317-352, 2011.0#$aEgidio D'Angelo and CAM WTheeler-Kingshott. Modelling the brain: elementary components to explain ensemble functions. Riv. del nuovo Cim, 40:297-333, 2017.0#$aTerrence WT Deacon. The symbolic species: The co-evolution of language and the brain. WW Norton & Company, 1998.0#$aJeffrey Dean, Greg Corrado, Raj at Monga, Kai Chen, Matthieu Devin, Mark Mao, Andrew Senior, Paul Tucker, Ke Yang, Quoc V Le, et al. Large scale distributed deep networks. In Advances in neural information processing system,s, pages 1223-1231, 2012.0#$aPaul Dean, John Porrill, Carl-Fredrik Ekerot, and Henrik Jorntell. The cerebellar microcircuit as an adaptive filter: experimental and computational evidence. Nature Reviews Neuroscience, 11 (1):30, 2010.0#$aThomas Dean. A computational model of the cerebral cortex. In Proceedings of the National Conference on Artificial Intelligence, volume 20, page 938. Menlo Park, CA; Cambridge, MA; London; AAAI Press; MIT Press; 1999, 2005.0#$aStanislas Dehaene. Consciousness and the brain: Deciphering how the brain codes our thoughts. Penguin, 2014.0#$aStanislas Dehaene, Hakwan Lau, and Sid Kouider. What is consciousness, and could machines have it? Science, 358(6362): 486-492, 2017.0#$aMarc Peter Deisenroth, Gerhard Neumann, Jan Peters, et al. A survey on policy search for robotics. Foundations and Trends in Robotics, 2(1 2): 1 1 12. 2013.0#$aDori Derdikman, Rina Hildesheim, Ehud Ahissar, Amos Arieli, and Amiram Grinvald. Imaging spatiotemporal dynamics of surround inhibition in the barrels somatosensory cortex. Journal of Neuroscience, 23(8):3100-3105, 2003.0#$aJeff Desjardins. The 8 major forces shaping the future of the global economy. 2018. URL https://worldview.stratfor.com/article/ 8-major-forces-shaping-future-global-economy.0#$aAlexev Dosovitskiv and Vladlen Koltun. Learning to act by predicting the future. arXiv preprint arXiv:1611.01779, 2016.0#$aRodney J Douglas and Kevan AC Martin. Recurrent neuronal circuits in the neocortex. Current biology, 17(13) :R496-R500, 2007.0#$aKenji Dova. Complementary roles of basal ganglia and cerebellum in learning and motor control. Current opinion in neurobiology, 10(6):732-739, 2000.0#$aRobin IM Dunbar. Neocortex size as a constraint on group size in primates. Journal of human evolution, 22(6):469-493, 1992.0#$aDavid Eagleman. Incognito: The Secret Lives of the Brain. New York City: Pantheon, 2011.0#$aChris Eliasmith, Terrence S Stewart, Xuan Choo, Trevor Bekolav, Travis DeWolf, Yichuan Tang, and Daniel Rasmussen. A large- scale model of the functioning brain, science, 338(6111):1202- 1205, 2012.0#$aDaniel Everett. How language began: the story of humanity's greatest invention. Profile Books, 2017.0#$aAldo Faisal, Dietrich Stout, Jan Apel, and Bruce Bradley. The manipulative complexity of lower paleolithic stone toolmaking. PloS one, 5(ll):el3718, 2010.0#$aBo Fan, Lijuan Wang, Frank K Soong, and Lei Xie. Photo-real talking head with deep bidirectional lstm. In Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on, pages 4884-4888. IEEE, 2015.0#$aManaal Faruqui, Yulia Tsvetkov, Dani Yogatama, Chris Dyer, and Noah Smith. Sparse overcomplete word vector representations. arXiv preprint arXiv:1506.02004, 2015.0#$aMichael J Frank and David Badre. Mechanisms of hierarchical reinforcement learning in corticostriatal circuits 1: computational analysis. Cerebral cortex, 22(3):509-526, 2011.0#$aMichael J Frank, Bryan Loughrv, and Randall S O'Reilly. Interactions between frontal cortex and basal ganglia in working memory: a computational model. Cognitive, Affective, & Behavioral Neuroscience, 1(2):137—160, 2001.0#$aStan Franklin, Tamas Madl, Sidney D'mello, and Javier Snaider. Lida: A systems-level architecture for cognition, emotion, and learning. IEEE Transactions on Autonomous Mental Development, 6(1):19-41, 2014.0#$aKarl Friston. A theory of cortical responses. Philosophical transactions of the Royal Society B: Biological sciences, 360 (1456):815-836, 2005.0#$aKarl Friston, Francesco Rigoli, Dimitri Ognibene, Christoph Mathvs, Thomas Fitzgerald, and Giovanni Pezzulo. Active inference and epistemic value. Cognitive neuroscience, 6(4): 187-214, 2015.0#$aKunihiko Fukushima. Neural network model for a mechanism of pattern recognition unaffected by shift in position- neocognitron. Electron. & Commun. Japan, 62(10) :11—18, 1979.0#$aJoaquin M Fuster. Cortex and mind: Unifying cognition. Oxford university press, 2003.0#$aTimur Garipov, Dmitry Podoprikhin, Alexander Novikov, and Dmitry Vetrov. Ultimate tensorization: compressing convolutional and fc layers alike. arXiv preprint arXiv:1611.032C, 2016.0#$aLeon A Gatvs, Alexander S Ecker, and Matthias Bethge. A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576, 2015.0#$aSergey Gavrilets and Aaron Vose. The dynamics of machiavellian intelligence. Proceedings of the National Academy of Sciences, 103(45):16823-16828, 2006.0#$aJonas Gehring, Michael Auli, David Grangier, Denis Yarats, and Yann N Dauphin. Convolutional sequence to sequence learning. arXiv preprint arXiv:1705.03122, 2017.0#$aDileep George and Jeff Hawkins. Towards a mathematical theory of cortical micro-circuits. PLoS computational biology, 5(10): el000532, 2009.0#$aAvniel Singh Ghuman, Nicolas M Brunet, Yuanning Li, Roma O Koneckv, John A Pvles, Shawn A Walls, Vincent Destefino, Wei Wang, and R Mark Richardson. Dynamic encoding of face information in the human fusiform gyrus. Nature communications, 5:5672, 2014.0#$aIan Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. In Advances in neural information processing systems, pages 2672-2680, 2014.0#$aIan Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press, 2016. http://www.deeplearningbook. org.0#$aAlex Graves. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850, 2013.0#$aAlex Graves, Santiago Fernandez, Faustino Gomez, and Jiirgen Schmidhuber. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In Proceedings of the 23rd international conference on Machine learning, pages 369-376. ACM, 2006.0#$aAlex Graves, Abdel-rahman Mohamed, and Geoffrey Hinton. Speech recognition with deep recurrent neural networks. In Acoustics, speech and signal processing (icassp), 2013 ieee international conference on, pages 6645-6649. IEEE, 2013.0#$aKevin Gurnev, Tony J Prescott, and Peter Redgrave. A computational model of action selection in the basal ganglia. i. a new functional anatomy. Biological cybernetics, 84(6) :401- 410, 2001.0#$aHabr. Neyroset' Yandeksa stala soavtorom p'esy dlya al'ta s orkestrom, 2019. URL https://habr.com/ru/post/441286/.0#$aPatric Hagmann, Leila Cammoun, Xavier Gigandet, Reto Meuli, Christopher J Honey, Van J Wedeen, and Olaf Sporns. Mapping the structural core of human cerebral cortex. PLoS biology, 6 (7):el59, 2008.0#$aSong Han, Huizi Mao, and William J Dally. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding. arXiv preprint arXiv:1510.00C9, 2015a.0#$aSong Han, Jeff Pool, John Tran, and William Dally. Learning both weights and connections for efficient neural network. In Advances in neural information processing system,s, pages 1135-1143, 2015b.0#$aMarc D Hauser, Noam Chomsky, and W Tecumseh Fitch. The faculty of language: what is it, who has it, and how did it evolve? science, 298(5598) :1569-1579, 2002.0#$aJeff Hawkins and Subutai Ahmad. WThv neurons have thousands of synapses, a theory of sequence memory in neocortex. Frontiers in neural circuits, 10:23, 2016.0#$aJeff Hawkins, Dileep George, and Jamie Niemasik. Sequence memory for prediction, inference and behaviour. Philosophical Transactions of the Royal Society B: Biological, Sciences, 364 (1521):1203-1209, 2009.0#$aJeff Hawkins, Subutai Ahmad, and Yuwei Cui. A theory of how columns in the neocortex enable learning the structure of the world. Frontiers in neural circuits, 11:81, 2017.0#$aKaiming Ne, Xiangvu Zhang, Shaoqing Ren, and Jian Sun. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the IEEE international conference on computer vision, pages 1026-1034, 2015.0#$aKaiming He, Xiangvu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770-778, 2016.0#$aDonald Olding Hebb. The organization of behavior: A neuropsychological theory. Psychology Press, 2005.0#$aSuzana Herculano-Houzel. The human advantage: a new understanding of how our brain became remarkable. MIT Press, 2016.0#$aM Hilbert and P Lopez. The world's technological capacity to store, communicate, and compute information. Science (New York, NY), 332(6025) :60—65, 2011.0#$aG Hinton, N Srivastava, and K Swerskv. Rmsprop: Divide the gradient by a running average of its recent magnitude. Neural networks for machine learning, Coursera lecture 6e, 2012a.0#$aGeoffrey E Hinton, Simon Osindero, and Yee-Whve Teh. A fast learning algorithm for deep belief nets. Neural computation, 18 (7): 1527 155 i. 2006.0#$aGeoffrey E Hinton, Nitish Srivastava, Alex Krizhevskv, Ilva Sutskever, and Ruslan R Salakhutdinov. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580, 2012b.0#$aSepp Hochreiter and Jiirgen Schmidhuber. Long short-term memory. Neural computation, 9(8):1735-1780, 1997.0#$aSepp Hochreiter, Yoshua Bengio, Paolo Frasconi, Jiirgen Schmidhuber, et al. Gradient flow in recurrent nets: the difficulty of learning long-term dependencies, 2001.0#$aThomas Hofmann. Unsupervised learning by probabilistic latent semantic analysis. Machine learning, 42(1):177—196, 2001.0#$aFu Jie Huang, Y-Lan Boureau, Yann LeCun, et al. Unsupervised learning of invariant feature hierarchies with applications to object recognition. In Computer Vision and Pattern Recognition, 2007. S VPR'07. IEEE Conference on, pages 1-8. IEEE, 2007.0#$aGao Huang, Zhuang Liu, Kilian Q Weinberger, and Laurens van der Maaten. Densely connected convolutional networks. arXiv preprint arXiv:1608.06993, 2016a.0#$aGao Huang, Yu Sun, Zhuang Liu, Daniel Sedra, and Kilian Q Weinberger. Deep networks with stochastic depth. In European Conference on Computer Vision, pages 646-661. Springer, 2016b.0#$aAlexander G Huth, Wendy A de Heer, Thomas L Griffiths, Frederic E Theunissen, and Jack L Gallant. Natural speech reveals the semantic maps that tile human cerebral cortex. Nature, 532(7600) :453-458, 2016.0#$aIFPMA. The pharmaceutical industry and global health. facts and figures, 2017. URL https: //www.ifpma.org/wp-content/uploads/2017/02/ IFPMA-Facts-And-Figures-2017.pdf.0#$aSergey Ioffe and Christian Szegedv. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning, pages 448-456, 2015.0#$aMakoto Ito and Kenji Dova. Multiple representations and algorithms for reinforcement learning in the cortico-basal ganglia circuit. Current opinion in neurobiology, 21(3):368- 373, 2011.0#$aEugene M Izhikevich and Gerald M Edelman. Large-scale model of mammalian thalamocortical systems. Proceedings of the national academy of sciences, 105(9):3593-3598, 2008.0#$aRay Jackendoff. Language, consciousness, culture: Essays on mental structure, volume 2007. MIT Press, 2007.0#$aMax Jaderberg, Volodymvr Mnih, Wojciech Marian Czarnecki, Tom Schaul, Joel Z Leibo, David Silver, and Korav Kavukcuoglu. Reinforcement learning with unsupervised auxiliary tasks. arXiv preprint arXiv:1611.05397, 2016.0#$aRafal Jozefowicz, Wojciech Zaremba, and Ilva Sutskever. An empirical exploration of recurrent network architectures. In Proceedings of the 32nd International Conference on Machine Learning (ICML-15), pages 2342-2350, 2015.0#$aDan Jurafskv and James H Martin. Speech and language processing, volume 3. Pearson London, 2014.0#$aDaniel Kahneman. Thinking, fast and slow. Macmillan, 2011.0#$aPentti Kanerva. Hvperdimensional computing: An introduction to computing in distributed representation with high- dimensional random vectors. Cognitive Commutation, 1(2) :139— 159, 2009.0#$aM Kawato. Cerebellum: models. Encyclopedia of neuroscience, 2007.0#$aNitish Shirish Keskar, Dheevatsa Mudigere, Jorge Nocedal, Mikhail Smelvanskiy, and Ping Tak Peter Tang. On large- batch training for deep learning: Generalization gap and sharp minima. arXiv preprint arXiv:1609.04836, 2016.0#$aRaymond R Kesner and Edmund T Rolls. A computational theory of hippocampal function, and tests of the theory: new developments. Neuroscience & Biobehavioral Reviews, 48:92147, 2015.0#$aMehdi Khamassi and Mark D Humphries. Integrating cortico- limbic-basal ganglia architectures for learning model-based and model-free navigation strategies. Frontiers in behavioral neuroscience, 6:79, 2012.0#$aHvunjik Kim, Andriv Mnih, Jonathan Schwarz, Marta Garnelo, Ali Eslami, Dan Rosenbaum, Oriol Vinvals, and Yee Whve Teh. Attentive neural processes. arXiv preprint arXiv:1901.05761, 2019.0#$aDiederik Kingma and Jimmy Ba. Adam: A method for stochastic optimization. arXiv preprint arXiv:C12.6980, 2014.0#$aDan Klein and Christopher D Manning. Corpus-based induction of syntactic structure: Models of dependency and constituency. In Proceedings of the Annual Meeting on Association for Computational Linguistics, page 478. Association for Computational Linguistics, 2004.0#$aTeuvo Kohonen. Self-organized formation of topologicallv correct feature maps. Biological cybernetics, 43(l):59-69, 1982.0#$aTeuvo Kohonen. Self-Organizing Maps. Springer-Verlag New York, 2001.0#$aAugustine Kong, Michael L Frigge, Gudmar Thorleifsson, Hreinn Stefansson, Alexander I Young, Florian Zink, Gudrun A Jonsdottir, Avsu Okbav, Patrick Sulem, Gisli Masson, et al. Selection against variants in the genome associated with educational attainment. Proceedings of the National Academy of Sciences, 11 !(5):K727 K732. 2017.0#$aJonathan Koomev and Samuel Naffziger. Moore's law might be slowing down, but not energy efficiency. IEEE Spectrum, 2015.0#$aEugene V Koonin. The logic of chance: the nature and origin of biological evolution. FT press, 2011.0#$aLeonard F Koziol and Deborah Ely Budding. Subcortical structures and cognition: Implications for neuropsychological assessment. Springer Science k, Business Media, 2009.0#$aLeonard F Koziol, Lauren A Barker, Arthur W Joyce, and Skip Hrin. Structure and function of large-scale brain systems. Applied Neuropsychology: Child, 3(4):236-244, 2014a.0#$aLeonard F Koziol, Deborah Budding, Nancy Andreasen, Stefano D'Arrigo, Sara Bulgheroni, Hiroshi Imamizu, Masao Ito, Mario Manto, Cherie Marvel, Krvstal Parker, et al. Consensus paper: the cerebellum's role in movement and cognition. The Cerebellum, 13(1):151-177, 2014b.0#$aMichael Kremer. Population growth and technological change: One million be to 1990. The Quarterly Journal of Economics, 108(3) :681—716, 1993.0#$aAlex Krizhevskv, Ilva Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097-1105, 2012.0#$aJohn E Laird, Christian Lebiere, and Paul S Rosenbloom. A standard model of the mind: Toward a common computational framework across artificial intelligence, cognitive science, neuroscience, and robotics. AI Magazine, 38(4), 2017.0#$aGuillaume Lample, Alexis Conneau, Ludovic Denover, and Marc'Aurelio Ranzato. Unsupervised machine translation using monolingual corpora only. arXiv preprint arXiv:1711.00043, 2017.0#$aNick Lane. Life ascending: the ten great inventions of evolution. Profile books, 2010.0#$aNick Lane. The vital question: energy, evolution, and the origins of complex life. WW Norton k, Company, 2015.0#$aSascha Lange and Martin Riedmiller. Deep auto-encoder neural networks in reinforcement learning. In Neural Networks (IJ CNN), The 2010 International Joint Conference on, pages 1-8. IEEE, 2010.0#$aEric Laukien, Richard Crowder, and Fergal Byrne. Fevnman machine: The universal dynamical systems computer. arXiv preprint arXiv:1609.03971, 2016.0#$aQuoc V Le. Building high-level features using large scale unsupervised learning. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pages 8595-8598. IEEE, 2013.0#$aYann LeCun, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, and Lawrence D Jackel. Backpropagation applied to handwritten zip code recognition. Neural computation, 1(4):541-551, 1989.0#$aYann LeCun, Leon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(ll):2278-2324, 1998.0#$aYann LeCun, Yoshua Bengio, and Geoffrey Hinton. Deep learning. Nature, 521 (7553): 136 i i i. 2015. *0#$aKai-Fu Lee. Al Superpowers: China, Silicon Valley, and the New World Order. Houghton Mifflin, 2018.0#$aTao Lei, Yu Zhang, Sida I Wang, Hui Dai, and Yoav Artzi. Simple recurrent units for highly parallelizable recurrence. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 4470-4481, 2018.0#$aEd S Lein, Michael J Hawrvlvcz, Nancy Ao, Mikael Avres, Amy Bensinger, Amy Bernard, Andrew F Voe, Mark S Boguski,0#$aKevin S Brockwav, Emi J Byrnes, et al. Genome-wide atlas of gene expression in the adult mouse brain. Nature, 445(7124): 168, 2007.0#$aPeter Lennie. The cost of cortical computation. Current biology, 13(6):493-497, 2003.0#$aOmer Levy and Yoav Goldberg. Neural word embedding as implicit matrix factorization. In Advances in neural information processing system,s, pages 2177-2185, 2014.0#$aTimothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971, 2015.0#$aJames Manvika, Jaana Remes, Jan Mischke, and Mekala Krishnan. The productivity puzzle: a closer look at the United States. McKinsev Global Institute, 2017.0#$aJames G March. Exploration and exploitation in organizational learning. Organization science, 2(l):71-87, 1991.0#$aHenry Markram, Eilif Muller, Srikanth Ramaswamv, Michael W Reimann, Marwan Abdellah, Carlos Aguado Sanchez, Anastasia Ailamaki, Lidia Alonso-Nanclares, Nicolas Antille, Selim Arsever, et al. Reconstruction and simulation of neocortical microcircuitrv. Cell, 163(2):456-492, 2015.0#$aDM Mateos, R Wennberg, R Guevara, and JL Perez Velazquez. Consciousness as a global property of brain dynamic activity. Physical, Review E, 96(6):062410, 2017.0#$aTomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013.0#$aMelanie Mitchell. An introduction to genetic algorithms. 1998.0#$aVolodymyr Mnih, Korav Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602, 2013.0#$aVolodymyr Mnih, Korav Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, et al. Human-level control through deep reinforcement learning. Nature, 518(7540):529, 2015.0#$aVolodymyr Mnih, Adria Puigdomenech Badia, Mehdi Mirza, Alex Graves, Timothy Lillicrap, Tim Harlev, David Silver, and Korav Kavukcuoglu. Asynchronous methods for deep reinforcement learning. In International Conference on Machine Learning, pages 1928-1937, 2016.0#$aDmitry Molchanov, Arsenii Ashukha, and Dmitry Vetrov. Variational dropout sparsifies deep neural networks. arXiv preprint arXiv:1701.05369, 2017.0#$aEdvard I Moser, Emilio Kropff, and Mav-Britt Moser. Place cells, grid cells, and the brain's spatial representation system. Annual review of neuroscience, 31, 2008.0#$aVernon V Mountcastle. Introduction. Cerebral cortex, 13(1):2 I. 2003.0#$aUrs Muller, Jan Ben, Eric Cosatto, Beat Flepp, and Yann L Cun. Off-road obstacle avoidance through end-to-end learning. In Advances in neural information processing system,s, pages 739-746, 2006.0#$aVipul Naik. Distribution, 2014. URL https://intelligence.org/wp-content/uploads/2014/02/ Naik-Distribution-of-Computation.pdf.0#$aVinod Nair and Geoffrey E Hinton. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), pages 807-814, 2010.0#$aCraig G Nevill-Manning and Ian H Witten. Identifying hierarchical structure in sequences: A linear-time algorithm. Journal of Artificial Intelligence Research, 7:67-82, 1997.0#$aAnh Nguyen, Jason Yosinski, Yoshua Bengio, Alexev Dosovitskiv, and Jeff Clune. Plug k, play generative networks: Conditional iterative generation of images in latent space. arXiv preprint arXiv:1612.00005, 2016.0#$aAlexander Novikov, Dmitrii Podoprikhin, Anton Osokin, and Dmitry P Vetrov. Tensorizing neural networks. In Advances in neural information processing system,s, pages 442-450, 2015.0#$aErkki Oja. Simplified neuron model as a principal component analyzer. Journal of mathematical biology, 15(3):267-273, 1982.0#$aErkki Oja and Juha Karhunen. Signal separation by nonlinear hebbian learning. In Computational intelligence: A dynamic system perspective, pages 83-97. Citeseer, 1995.0#$aRandall S O'Reilly and Michael J Frank. Making working memory work: a computational model of learning in the prefrontal cortex and basal ganglia. Neural computation, 18 (2):283-328, 2006.0#$aRandall S O'Reilly, Dean Wvatte, and John Rohrlich. Learning through time in the thalamocortical loops. arXiv preprint arXiv:1407.3432, 2014.0#$aIvan V Oseledets. Tensor-train decomposition. SIAM Journal on Scientific Commuting, 33(5):2295-2317, 2011.0#$aGiinther Palm. Neural associative memories and sparse coding. Neural Networks, 37:165-171, 2013.0#$aK Panetta. 5 Trends Emerge in the Gartner Hype Cycle for Emerging Technologies, 2018. 2018. https://www.gartner.com/smarterwithgartner/5-trends-emerge- in-gartner-hype- cycle-for-emerging-te chnologie0#$aJudea Pearl and Dana Mackenzie. The Book of Why: The New Science of Cause and Effect. Basic Books, 2018.0#$aJeffrey Pennington, Richard Socher, and Christopher Manning. Glove: Global vectors for word representation. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pages 1532-1543, 2014.0#$aNikolav Perunov, Robert A Marsland, and Jeremy L England. Statistical physics of adaptation. Physical Review X, 6(2): 021036, 2016.0#$aSteven Pinker. The language instinct: How the mind creates language. Penguin UK, 2003.0#$aChristopher Poultnev, Sumit Chopra, Yann L Cun, et al. Efficient learning of sparse representations with an energy-based model. In Advances in neural information processing systems, pages 1137-1144, 2007.0#$aJonathan D Power, Alexander L Cohen, Steven M Nelson, Gagan S Wig, Kelly Anne Barnes, Jessica A Church, Alecia S Vogel, Timothy O Laumann, Fran M Miezin, Bradley L Schlaggar, et al. Functional network organization of the human brain. Neuron, 72(4):665-678, 2011.0#$aGil Press. The thriving ai landscape in israel and what it means for global ai competition. Forbes, Sep 2018. https://www.forbes.com/sites/gilpress/2018/09/24/ the-thriving-ai-landscape-in-israel-and-what-it-means-for-glot0#$aFriedemann Pulvermiiller. How neurons make meaning: brain mechanisms for embodied and abstract-symbolic semantics. Trends in cognitive sciences, 17(9):458-470, 2013.0#$aAlec Radford, Luke Metz, and Soumith Chintala. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv: 1511.06^34, 2015.0#$aAlec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, and Ilva Sutskever. Language models are unsupervised multitask learners. 2019. URL https://blog.openai.com/ better-language-models/.0#$aMaithra Raghu, Ben Poole, Jon Kleinberg, Surva Ganguli, and Jascha Sohl-Dickstein. On the expressive power of deep neural networks. arXiv preprint arXiv:1606.05336, 2016.0#$aMaxwell JD Ramstead, Michael D Kirchhoff, Axel Constant, and Karl J Friston. Multiscale integration: Beyond internalism and externalism, 2019.0#$aScott Reed, Zevnep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, and Honglak Lee. Generative adversarial text to image synthesis. arXiv preprint arXiv:1605.05396, 2016.0#$aAnton Reiner, Loreta Medina, and S Leo Veenman. Structural and functional evolution of the basal ganglia in vertebrates. Brain Research Reviews, 28(3):235-285, 1998.0#$aJeremy R Reynolds and Randall S O'Reilly. Developing pfc representations using reinforcement learning. Cognition, 113 (3):281—292, 2009.0#$aUrs Ribarv. Dynamics of thalamo-cortical network oscillations and human perception. Progress in brain research, 150:127142, 2005.0#$aGerard J Rinkus. A cortical sparse distributed coding model linking mini-and macrocolumn-scale functionality. Frontiers in neuroanatomy, 4:17, 2010.0#$aJorma Rissanen. Modeling by shortest data description. Automatica, 14(5):465-471, 1978.0#$aEdmund T Rolls. A computational theory of episodic memory formation in the hippocampus. Behavioural brain research, 215 (2):180—196, 2010.0#$aFrank Rosenblatt. The perceptron: a probabilistic model for information storage and organization in the brain. Psychological review, 65(6):386, 1958.0#$aDaniel J Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, Zheng Wen, et al. A tutorial on thompson sampling. Foundations and Trends@ in Machine Learning, 11(1):1—96, 2018.0#$aSara Sabour, Nicholas Frosst, and Geoffrey E Hinton. Dynamic routing between capsules. In Advances in Neural Information Processing Systems, pages 3859-3869, 2017.0#$aJenny R Saffian, Ann Senghas, and John S Trueswell. The acquisition of language by children. Proceedings of the National Academy of Sciences, 98(23):12874-12875, 2001.0#$aRuslan Salakhutdinov, Andriv Mnih, and Geoffrey Hinton. Restricted boltzmann machines for collaborative filtering. In Proceedings of the 24-th international conference on Machine learning, pages 791-798. ACM, 2007.0#$aJared M Saletin and Matthew P Walker. Nocturnal mnemonics: sleep and hippocampal memory processing. Frontiers in neurology, 3:59, 2012.0#$aGerard Salton, Anita Wong, and Chung-Shu Yang. A vector space model for automatic indexing. Communications of the ACM, 18(11) :613—620, 1975.0#$aAdam Santoro, David Raposo, David GT Barrett, Mateusz Malinowski, Razvan Pascanu, Peter Battaglia, and Timothy Lillicrap. A simple neural network module for relational reasoning. arXiv preprint arXiv:1706.01427, 2017.0#$aLara Schlaflke, L Schweizer, NN Riither, R Luerding, Martin Tegenthoff, Christian Bellebaum, and Tobias Schmidt-Wilcke. Dynamic changes of resting state connectivity related to the acquisition of a lexico-semantic skill. Neurolmage, 146:429437, 2017.0#$aJurgen Schmidhuber. Deep learning in neural networks: An overview. Neural networks, 61:85-117, 2015.0#$aNoam Shazeer, Azalia Mirhoseini, Krzvsztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, and Jeff Dean. Outrageously large neural networks: The sparselv-gated mixture-of-experts layer. arXiv preprint arXiv:1701.06538, 2017.0#$aJonathan Shen, Ruoming Pang, Ron J Weiss, Mike Schuster, Navdeep Jaitlv, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, Rj Skerrv-Rvan, et al. Natural tts synthesis by conditioning wavenet on mel spectrogram predictions. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 4779-4783. IEEE, 2018.0#$aStewart Shipp, Rick A Adams, and Karl J Friston. Reflections on agranular architecture: predictive coding in the motor cortex. Trends in neurosciences, 36(12):706-716, 2013.0#$aYoav Shoham, Raymond Perrault, Erik Brvnjolfsson, Jack Clark, James Manvika, Juan Carlos Niebles, Terah Lyons, John Etchemendv, Barbara Grosz, and Zoe Bauer. The Al Index 2018 Annual Report. Stanford University, 2018.0#$aDavid Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. Mastering the game of go with deep neural networks and tree search. Nature, 529(7587):484-489, 2016.0#$aDavid Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, et al. Mastering chess and shogi by self-plav with a general reinforcement learning algorithm. arXiv preprint arXiv:1712.01815, 2017.0#$aKaren Simonvan and Andrew Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556, 2014.0#$aSoren Van Hout Solari and Rich Martin Stoner. Cognitive consilience: primate non-primarv neuroanatomical circuits underlying cognition. Frontiers in neuroanatomy, 5:65, 2011.0#$aHagen Soltau, Hank Liao, and Hasim Sak. Neural speech recognizer: Acoustic-to-word lstm model for large vocabulary speech recognition. arXiv preprint arXiv:1610.09975, 2016.0#$aSho Sonoda and Noboru Murata. Transport analysis of infinitely deep neural network. Journal of Machine Learning Research, 20(2):1—52, 2019.0#$aEelke Spaak, Mathilde Bonnefond, Alexander Maier, David A Leopold, and Ole Jensen. Layer-specific entrainment of gamma- band neural activity by the alpha rhythm in monkey visual cortex. Current Biology, 22(24):2313-2318, 2012.0#$aMichael W Spratling. A review of predictive coding algorithms. Brain and cognition, 112:92-97, 2017.0#$aPablo Sprechmann and Guillermo Sapiro. Dictionary learning and sparse coding for unsupervised clustering. In Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pages 2042-2045. IEEE, 2010.0#$aNitish Srivastava, Geoffrey E Hinton, Alex Krizhevskv, Ilva Sutskever, and Ruslan Salakhutdinov. Dropout: a simple way to prevent neural networks from overfitting. Journal of machine learning research, 15(1):1929—1958, 2014.0#$aKimberlv L Stachenfeld, Matthew M Botvinick, and Samuel J Gershman. The hippocampus as a predictive map. Nature neuroscience, 20(11):1643, 2017.0#$aSTATISTA. Number of apps available in leading app stores as of 3rd quarter 2018, 2018. URL https://www.statista.com/statistics/276623/ number-of-apps-available-in-leading-app-stores/.0#$aGreg Ver Steeg. Unsupervised learning via total correlation explanation. arXiv preprint arXiv:1706.08984, 2017.0#$aGreg Ver Steeg and Aram Galstvan. Low complexity gaussian latent factor models and a blessing of dimensionality. arXiv preprint arXiv:1706.03353, 2017.0#$aAndreas Stolcke and Stephen Omohundro. Inducing probabilistic grammars by bavesian model merging. In International Colloquium on Grammatical Inference, pages 106-118. Springer, 1994.0#$aXu Sun, Xuancheng Ren, Shuming Ma, Bingzhen Wei, Wei Li, Jingjing Xu, Houfeng Wang, and Yi Zhang. Training simplification and model simplification for deep learning: A minimal effort back propagation method. IEEE Transactions on Knowledge and Data Engineering, 2018.0#$aIlva Sutskever and Geoffrey Hinton. Learning multilevel distributed representations for high-dimensional sequences. In Artificial Intelligence and Statistics, pages 548-555, 2007.0#$aIlva Sutskever, James Martens, George Dahl, and Geoffrey Hinton. On the importance of initialization and momentum in deep learning. In International conference on machine learning, pages 1139-1147, 2013.0#$aIlva Sutskever, Oriol Vinvals, and Quoc V Le. Sequence to sequence learning with neural networks. In Advances in neural information processing system,s, pages 3104-3112, 2014.0#$aRichard S Sutton. Dvna, an integrated architecture for learning, planning, and reacting. ACM SIGART Bulletin, 2(4):160—163, 1991.0#$aTASS.RU. Vliyanie ekosistemy MSP na mirovuyu ekonomiku. 2017. https://tass.ru/pmef-2017/articles/4278934.0#$aEmanuel Todorov. Parallels between sensory and motor information processing. The cognitive neurosciences, pages 613-24, 2009.0#$aMichael Tomasello. Constructing a language. Harvard university press, 2009.0#$aGiulio Tononi and Christof Koch. Consciousness: here, there and everywhere? Phil. Trans. R. Soc. B, 370(1668):20140167, 2015.0#$aTVkultura. Na aukcione Christies vpervye prodali napisannuyu iskusstvennym intellektom kartinu, 2018. URL https://tvkultura.ru/article/show/article_id/ 302385/.0#$aNaonori Ueda and Rvohei Nakano. Deterministic annealing em algorithm. Neural networks, 11(2) :271-282, 1998.0#$aMarvlka Uusisaari and Erik De Schutter. The mysterious microcircuitrv of the cerebellar nuclei. The Journal of physiology, 589(14):3441-3457, 2011.0#$aKurt Vanlehn and William Ball. A version space approach to learning context-free grammars. Machine learning, 2(1):39 71. 1987.0#$aVladimir Naumovich Vapnik. Statistical learning theory, volume 1. Wiley New York, 1998.0#$aAshish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in Neural Information Processing Systems, pages 5998-6008, 2017.0#$aPaul FMJ Verschure. Distributed adaptive control: a theory of the mind, brain, body nexus. Biologically Inspired Cognitive Architectures, 1:55-72, 2012.0#$aPaul FMJ Verschure, Cvriel MA Pennartz, and Giovanni Pezzulo. The why, what, where, when and how of goal-directed choice: neuronal and computational principles. Philosophical Transactions of the Royal Society B: Biological Sciences, 369 (1655) :20130483, 2014.0#$aOriol Vinvals, Alexander Toshev, Samv Bengio, and Dumitru Erhan. Show and tell: A neural image caption generator. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3156-3164, 2015.0#$aAlexander Volokh and Giinter Neumann. Task-oriented dependency parsing evaluation methodology. In 2012 IEEE 13th International Conference on Information Reuse & Integration (IRI), pages 132-137. IEEE, 2012.0#$aChristoph Von der Malsburg. Binding in models of perception and brain function. Current opinion in neurobiology, 5(4):520- 526, 1995.0#$aHeinz Von Foerster, Patricia M Mora, and Lawrence W Amiot. Doomsday: Friday, 13 november, ad 2026. Science, 132(3436): 1291-1295, 1960.0#$aJohn Von Neumann, Arthur W Burks, et al. Theory of self-reproducing automata. IEEE Transactions on Neural Networks, 5(1):3-14, 1966.0#$aJian-Ping Wang, Sachin S Sapatnekar, Chris H Kim, Paul Crowell, Steve Koester, Suprivo Datta, Kaushik Roy, Anand Raghunathan, X Sharon Hu, Michael Niemier, et al. A pathway to enable exponential scaling for the bevond-cmos era. In Proceedings of the 54-th Annual Design Automation Conference 2017, page 16. ACM, 2017a.0#$aRuohan Wang, Antoine Cullv, Hvung Jin Chang, and Yiannis Demiris. Magan: Margin adaptation for generative adversarial networks. arXiv preprint arXiv:1704-03817, 2017b.0#$aXiaolong Wang and Abhinav Gupta. Generative image modeling using style and structure adversarial networks. In European Conference on Computer Vision, pages 318-335. Springer, 2016.0#$aYisen Wang, Xuejiao Deng, Songbai Pu, and Zhiheng Huang. Residual convolutional ctc networks for automatic speech recognition. arXiv preprint arXiv:1702.07793, 2017c.0#$aLawrence M Ward. The thalamic dynamic core theory of conscious experience. Consciousness and Cognition, 20(2): 464-86, 2011.0#$aChristopher JCH Watkins and Peter Davan. Q-learning. Machine learning, 8(3-4):279-292, 1992.0#$aNicholas Watters, Andrea Tacchetti, Theophane Weber, Razvan Pascanu, Peter Battaglia, and Daniel Zoran. Visual interaction networks. arXiv preprint arXiv:1706.01433, 2017.0#$aTerry A Welch. Technique for high-performance data compression. Computer, 6(17):8—19, 1984.0#$aAshia S WTilson, Rebecca Roelofs, Mitchell Stern, Nathan Srebro, and Benjamin Recht. The marginal value of adaptive gradient methods in machine learning. arXiv preprint arXiv:1705.08292, 2017.0#$aEdward O WTilson. The social conquest of earth. WW Norton k, Company, 2012.0#$aJ Gerard Wolff. An algorithm for the segmentation of an artificial language analogue. British journal of psychology, 66(1):79—90, 1975.0#$aJ Gerard Wolff. Language acquisition, data compression and generalization. Pergamon, 1982.0#$aJ Gerard Wolff. Learning syntax and meanings through optimization and distributional analysis. Categories and processes in language acquisition, 1(1), 1988.0#$aRichard Wrangham. Catching fire: How cooking made us human. Basic Books, 2009.0#$aY. Wu, G. Wayne, A. Graves, and T. Lillicrap. The Kanerva Machine: A Generative Distributed Memory. ArXiv e-prints, April 2018.0#$aYonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V Le, Mohammad Norouzi, Wolfgang Macherev, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherev, et al. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint agHyu:1609.08Sch, 2016.0#$aKelvin Xu, Jimmy Ba, Ryan Kiros, Kvunghvun Cho, Aaron Courville, Ruslan Salakhudinov, Rich Zemel, and Yoshua Bengio. Show, attend and tell: Neural image caption generation with visual attention. In International Conference on Machine Learning, pages 2048-2057, 2015.0#$aTom Young, Devamanvu Hazarika, Soujanva Poria, and Erik Cambria. Recent trends in deep learning based natural language processing, ieee Computational intelligence magazine, 13(3):55 75. 2018.0#$aHujia Yu, Chang Yue, and Chao Wang. News article summarization with attention-based deep recurrent neural networks, 2016.0#$aYan M Yufik and Karl Friston. Life and understanding: the origins of "understanding" in self-organizing nervous systems. Frontiers in system,s neuroscience, 10:98, 2016.0#$aMatthew D Zeiler and Rob Fergus. Visualizing and understanding convolutional networks. In European conference on computer vision, pages 818-833. Springer, 2014.0#$aHan Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaolei Huang, Xiaogang Wang, and Dimitris Metaxas. Stackgan: Text to photo-realistic image synthesis with stacked generative adversarial networks. arXiv preprint arXiv:1612.03242, 2016.0#$aYing Zhang, Mohammad Pezeshki, Philemon Brakel, Saizheng Zhang, Cesar Laurent Yoshua Bengio, and Aaron Courville. Towards end-to-end speech recognition with deep convolutional neural networks. arXiv preprint arXiv:1701.021'20, 2017.0#$aJun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A Efros. Unpaired image-to-image translation using cycle-consistent adversarial networks. arXiv preprint arXiv:1703.10593, 2017.##$aThere is an electronic copy4#$ariorpub.com