On December 5, 2017, the DeepMind team released a preprint paper introducing AlphaZero,[1] which would soon play three games by defeating world-champion chess engines Stockfish, Elmo, and the three-day version of AlphaGo Zero. In each case it made use of custom tensor processing units (TPUs) that the Google programs were optimized to use.[2] AlphaZero was trained solely via self-play using 5,000 first-generation TPUs to generate the games and 64 second-generation TPUs to train the neural networks, all in parallel, with no access to opening books or endgame tables. After four hours of training, DeepMind estimated AlphaZero was playing chess at a higher Elo rating than Stockfish 8; after nine hours of training, the algorithm defeated Stockfish 8 in a time-controlled 100-game tournament (28 wins, 0 losses, and 72 draws).[2][3][4] The trained algorithm played on a single machine with four TPUs.
DeepMind's paper on AlphaZero was published in the journal Science on 7 December 2018.[5] While the actual AlphaZero program has not been released to the public,[6] the algorithm described in the paper has been implemented in publicly available software. In 2019, DeepMind published a new paper detailing MuZero, a new algorithm able to generalize AlphaZero's work, playing both Atari and board games without knowledge of the rules or representations of the game.[7]
AlphaZero (AZ) is a more generalized variant of the AlphaGo Zero (AGZ) algorithm, and is able to play shogi and chess as well as Go. Differences between AZ and AGZ include:[2]
Comparing Monte Carlo tree search searches, AlphaZero searches just 80,000 positions per second in chess and 40,000 in shogi, compared to 70 million for Stockfish and 35 million for Elmo. AlphaZero compensates for the lower number of evaluations by using its deep neural network to focus much more selectively on the most promising variation.[2]
Training
AlphaZero was trained by simply playing against itself multiple times, using 5,000 first-generation TPUs to generate the games and 64 second-generation TPUs to train the neural networks. Training took several days, totaling about 41 TPU-years. It cost 3e22 FLOPs.[8]
In parallel, the in-training AlphaZero was periodically matched against its benchmark (Stockfish, Elmo, or AlphaGo Zero) in brief one-second-per-move games to determine how well the training was progressing. DeepMind judged that AlphaZero's performance exceeded the benchmark after around four hours of training for Stockfish, two hours for Elmo, and eight hours for AlphaGo Zero.[2]
Preliminary results
Outcome
Chess
In AlphaZero's chess match against Stockfish 8 (2016 TCEC world champion), each program was given one minute per move. AlphaZero was flying the English flag, while Stockfish the Norwegian.[9] Stockfish was allocated 64 threads and a hash size of 1 GB,[2] a setting that Stockfish's Tord Romstad later criticized as suboptimal.[10][note 1] AlphaZero was trained on chess for a total of nine hours before the match. During the match, AlphaZero ran on a single machine with four application-specific TPUs. In 100 games from the normal starting position, AlphaZero won 25 games as White, won 3 as Black, and drew the remaining 72.[11] In a series of twelve, 100-game matches (of unspecified time or resource constraints) against Stockfish starting from the 12 most popular human openings, AlphaZero won 290, drew 886 and lost 24.[2]
Shogi
AlphaZero was trained on shogi for a total of two hours before the tournament. In 100 shogi games against Elmo (World Computer Shogi Championship 27 summer 2017 tournament version with YaneuraOu 4.73 search), AlphaZero won 90 times, lost 8 times and drew twice.[11] As in the chess games, each program got one minute per move, and Elmo was given 64 threads and a hash size of 1 GB.[2]
Go
After 34 hours of self-learning of Go and against AlphaGo Zero, AlphaZero won 60 games and lost 40.[2][11]
Analysis
DeepMind stated in its preprint, "The game of chess represented the pinnacle of AI research over several decades. State-of-the-art programs are based on powerful engines that search many millions of positions, leveraging handcrafted domain expertise and sophisticated domain adaptations. AlphaZero is a generic reinforcement learning algorithm – originally devised for the game of go – that achieved superior results within a few hours, searching a thousand times fewer positions, given no domain knowledge except the rules."[2] DeepMind's Demis Hassabis, a chess player himself, called AlphaZero's play style "alien": It sometimes wins by offering counterintuitive sacrifices, like offering up a queen and bishop to exploit a positional advantage. "It's like chess from another dimension."[12]
Given the difficulty in chess of forcing a win against a strong opponent, the +28 –0 =72 result is a significant margin of victory. However, some grandmasters, such as Hikaru Nakamura and Komodo developer Larry Kaufman, downplayed AlphaZero's victory, arguing that the match would have been closer if the programs had access to an opening database (since Stockfish was optimized for that scenario).[13] Romstad additionally pointed out that Stockfish is not optimized for rigidly fixed-time moves and the version used was a year old.[10][14]
Similarly, some shogi observers argued that the Elmo hash size was too low, that the resignation settings and the "EnteringKingRule" settings (cf. shogi § Entering King) may have been inappropriate, and that Elmo is already obsolete compared with newer programs.[15][16]
Reaction and criticism
Papers headlined that the chess training took only four hours: "It was managed in little more than the time between breakfast and lunch."[3][17]Wired described AlphaZero as "the first multi-skilled AI board-game champ".[18] AI expert Joanna Bryson noted that Google's "knack for good publicity" was putting it in a strong position against challengers. "It's not only about hiring the best programmers. It's also very political, as it helps make Google as strong as possible when negotiating with governments and regulators looking at the AI sector."[11]
Human chess grandmasters generally expressed excitement about AlphaZero. Danish grandmaster Peter Heine Nielsen likened AlphaZero's play to that of a superior alien species.[11] Norwegian grandmaster Jon Ludvig Hammer characterized AlphaZero's play as "insane attacking chess" with profound positional understanding.[3] Former championGarry Kasparov said, "It's a remarkable achievement, even if we should have expected it after AlphaGo."[13][19]
Grandmaster Hikaru Nakamura was less impressed, stating: "I don't necessarily put a lot of credibility in the results simply because my understanding is that AlphaZero is basically using the Google supercomputer and Stockfish doesn't run on that hardware; Stockfish was basically running on what would be my laptop. If you wanna have a match that's comparable you have to have Stockfish running on a supercomputer as well."[10]
Top US correspondence chess player Wolff Morrow was also unimpressed, claiming that AlphaZero would probably not make the semifinals of a fair competition such as TCEC where all engines play on equal hardware. Morrow further stated that although he might not be able to beat AlphaZero if AlphaZero played drawish openings such as the Petroff Defence, AlphaZero would not be able to beat him in a correspondence chess game either.[20]
Motohiro Isozaki, the author of YaneuraOu, noted that although AlphaZero did comprehensively beat Elmo, the rating of AlphaZero in shogi stopped growing at a point which is at most 100–200 higher than Elmo. This gap is not that high, and Elmo and other shogi software should be able to catch up in 1–2 years.[21]
Final results
DeepMind addressed many of the criticisms in their final version of the paper, published in December 2018 in Science.[5] They further clarified that AlphaZero was not running on a supercomputer; it was trained using 5,000 tensor processing units (TPUs), but only ran on four TPUs and a 44-core CPU in its matches.[22]
Chess
In the final results, Stockfish 9 dev ran under the same conditions as in the TCEC superfinal: 44 CPU cores, Syzygy endgame tablebases, and a 32 GB hash size. Instead of a fixed time control of one move per minute, both engines were given 3 hours plus 15 seconds per move to finish the game. AlphaZero ran on a machine with four TPUs in addition to 44 CPU cores. In a 1000-game match, AlphaZero won with a score of 155 wins, 6 losses, and 839 draws. DeepMind also played a series of games using the TCEC opening positions; AlphaZero also won convincingly. Stockfish needed 10-to-1 time odds to match AlphaZero.[23]
Shogi
Similar to Stockfish, Elmo ran under the same conditions as in the 2017 CSA championship. The version of Elmo used was WCSC27 in combination with YaneuraOu 2017 Early KPPT 4.79 64AVX2 TOURNAMENT. Elmo operated on the same hardware as Stockfish: 44 CPU cores and a 32 GB hash size. AlphaZero won 98.2% of games when playing sente (i.e. having the first move) and 91.2% overall.
Reactions and criticisms
Human grandmasters were generally impressed with AlphaZero's games against Stockfish.[23] Former world champion Garry Kasparov said it was a pleasure to watch AlphaZero play, especially since its style was open and dynamic like his own.[24][25]
In the computer chess community, Komodo developer Mark Lefler called it a "pretty amazing achievement", but also pointed out that the data was old, since Stockfish had gained a lot of strength since January 2018 (when Stockfish 8 was released). Fellow developer Larry Kaufman said AlphaZero would probably lose a match against the latest version of Stockfish, Stockfish 10, under Top Chess Engine Championship (TCEC) conditions. Kaufman argued that the only advantage of neural network–based engines was that they used a GPU, so if there was no regard for power consumption (e.g. in an equal-hardware contest where both engines had access to the same CPU and GPU) then anything the GPU achieved was "free". Based on this, he stated that the strongest engine was likely to be a hybrid with neural networks and standard alpha–beta search.[26]
AlphaZero inspired the computer chess community to develop Leela Chess Zero, using the same techniques as AlphaZero. Leela contested several championships against Stockfish, where it showed roughly similar strength to Stockfish, although Stockfish has since pulled away.[27]
In 2019 DeepMind published MuZero, a unified system that played excellent chess, shogi, and go, as well as games in the Atari Learning Environment, without being pre-programmed with their rules.[28][29]
The match results by themselves are not particularly meaningful because of the rather strange choice of time controls and Stockfish parameter settings: The games were played at a fixed time of 1 minute/move, which means that Stockfish has no use of its time management heuristics (lot of effort has been put into making Stockfish identify critical points in the game and decide when to spend some extra time on a move; at a fixed time per move, the strength will suffer significantly). The version of Stockfish used is one year old, was playing with far more search threads than has ever received any significant amount of testing, and had way too small hash tables for the number of threads. I believe the percentage of draws would have been much higher in a match with more normal conditions.[10]
^As given in the Science paper, a TPU is "roughly similar in inference speed to a Titan V GPU, although the architectures are not directly comparable" (Ref. 24).
Anteros, lebih populer dengan sebutan Eros, karya Alfred Gilbert, 1885. Erotes adalah sekelompok dewa bersayap dalam mitologi Yunani yang melambangkan cinta dan gairah seksual.Para Erotes adalah anak dari Ares dan Afrodit. Anggota Erotes antara lain Eros, Anteros, Himeros, dan Pothos. Pranala luar (Inggris) Erotes di Theoi (Inggris) Erotes di Greek Mythology Index Diarsipkan 2011-07-03 di Wayback Machine. Artikel bertopik Mitologi Yunani ini adalah sebuah rintisan. Anda dapat membantu Wikiped...
Thomas Perronet Thompson, potret oleh George Hayter Thomas Perronet Thompson (15 Maret 1783 – 6 September 1869)[1] adalah seorang anggota Parlemen Inggris, gubernur Sierra Leone dan seorang reformis radikal. Ia menjadi menonjol pada tahun 1830-an dan 1840-an sebagai aktivis terkemuka di Liga Hukum Anti-Jagung. Ia berspesialisasi dalam mobilisasi opini masyarakat akar rumput melalui pamflet, artikel surat kabar, korespondensi, pidato, dan rapat perencanaan daerah yang t...
العلاقات الفنلندية القبرصية فنلندا قبرص فنلندا قبرص تعديل مصدري - تعديل العلاقات الفنلندية القبرصية هي العلاقات الثنائية التي تجمع بين فنلندا وقبرص.[1][2][3][4][5] مقارنة بين البلدين هذه مقارنة عامة ومرجعية للدولتين: وجه المقارنة فنلندا ق...
العلاقات الأرجنتينية التشيلية الأرجنتين تشيلي تعديل مصدري - تعديل يشير مصطلح العلاقات الأرجنتينة التشيلية إلى العلاقات الدولية التي تجمع جمهورية تشيلي بجمهورية الأرجنتين. تشترك الأرجنتين وتشيلي بثالث أطول حدود دولية في العالم، ويبلغ طولها 5300 كيلومتر (33...
2021 Hoboken mayoral election ← 2017 November 2, 2021 2025 → Turnout30.44%[1] Candidate Ravinder Bhalla Write-in Party Nonpartisan Nonpartisan Popular vote 8,771 612 Percentage 87.95% 6.52% Mayor before election Ravinder Bhalla Democratic Elected Mayor Ravinder Bhalla Democratic Elections in New Jersey Federal government U.S. President 1788-89 1792 1796 1800 1804 1808 1812 1816 1820 1824 1828 1832 1836 1840 1844 1848 1852 1856 1860 1864 1868 1872 187...
Trotoar yang dimajukan ke tengah jalan pada zebra crossPerlambatan lalu lintas yang dilakukan di Yate, South Gloucestershire, Inggris berupa: polisi tidur, marka jalan, rambu, delinator dan jalan yang dipersempit Pelambatan lalu lintas (traffic calming) adalah upaya yang dilakukan untuk memperlambat lalu lintas dalam rangka meningkatkan keselamatan pejalan kaki, pesepeda, pebelanja, dan penduduk serta mengurangi kebisingan dan polusi. Pelambatan lalu lintas biasanya diterapkan di daerah perum...
Arrondissement de Bamberg Landkreis Bamberg Héraldique Localisation Administration Pays Allemagne Land Bavière District(Regierungsbezirk) Haute-Franconie Chef-lieu Bamberg Villes principales Hirschaid, Hallstadt Préfet(Landrat) Johann Kalb Partis au pouvoir CSU Code arrondissemental(Kreisschlüssel) 09 4 71 Immatriculation BA Communes 36 Démographie Population 147 697 hab. (31 décembre 2021) Densité 127 hab./km2 Géographie Superficie 1 167,37 km2 Localisation ...
درابيتسونا خريطة الموقع تقسيم إداري البلد اليونان [1] خصائص جغرافية إحداثيات 37°56′48″N 23°37′30″E / 37.94666667°N 23.625°E / 37.94666667; 23.625 الارتفاع 0 متر السكان التعداد السكاني 13815 (resident population of Greece) (2021)[2]13335 (resident population of Greece) (2001)13116 (resident population of Greece) (1991...
Pieve di CadoreKomuneComune di Pieve di CadoreNegaraItaliaWilayahVenetoProvinsiBelluno (BL)FrazioniDamos, Nebbiù, Pozzale, Sottocastello, TaiPemerintahan • Wali kotaMaria Antonia CiottiLuas • Total66,6 km2 (257 sq mi)Ketinggian878 m (2,881 ft)Populasi (31 Mei 2007) • Total4.087 • Kepadatan6,1/km2 (16/sq mi)DemonimPievaniZona waktuUTC+1 (CET) • Musim panas (DST)UTC+2 (CEST)Kode pos32044Kode area tel...
Order of amphibians This article is about the group of amphibians. For other uses, see Frog (disambiguation). FrogsTemporal range: Early Jurassic – Present, 200–0 Ma PreꞒ Ꞓ O S D C P T J K Pg N Various types of frog Scientific classification Domain: Eukaryota Kingdom: Animalia Phylum: Chordata Class: Amphibia Clade: Salientia Order: AnuraDuméril, 1806 (as Anoures) Subgroups See text Native distribution of frogs (in green) Variegated golden frog (Mantella baroni) in the Ranomafan...
Vous lisez un « bon article » labellisé en 2015. Pour un article plus général, voir Garde impériale (Second Empire). Lanciers de la Garde impériale Lanciers de la Garde impériale, 1857. Richard Knötel, Uniformenkunde, 1890, volume VI, planche 9. Création 20 décembre 1855 Dissolution 28 octobre 1870 Pays France Allégeance Second Empire Branche Cavalerie Type Régiment Fait partie de Garde impériale Garnison Melun Guerres Campagne d'Italie (1859)Guerre franco-allemande d...
Bulgarian footballer This article relies largely or entirely on a single source. Relevant discussion may be found on the talk page. Please help improve this article by introducing citations to additional sources.Find sources: Nikola Yordanov – news · newspapers · books · scholar · JSTOR (December 2023) Nikola YordanovBulgarian: Никола ЙордановPersonal informationDate of birth (1938-10-23)October 23, 1938Place of birth Ruse, BulgariaDate o...
South Korean actor In this Korean name, the family name is Son. Son Suk-kuSon in 2022Born (1983-02-07) February 7, 1983 (age 41)Taepyeong-dong, Jung-gu, Daejeon, South KoreaAlma materSchool of the Art Institute of Chicago (FVNMA)OccupationActorYears active2014–presentAgentSBD Entertainment[1]Korean nameHangul손석구Hanja孫錫久Revised RomanizationSon Seok-guMcCune–ReischauerSon Sŏk-ku Son Suk-ku (Korean: 손석구; Korean pronunciation: [son.sʌk̚.ku]...
Hungarian-American mathematician The native form of this personal name is Szemerédi Endre. This article uses Western name order when mentioning individuals. This biography of a living person needs additional citations for verification. Please help by adding reliable sources. Contentious material about living persons that is unsourced or poorly sourced must be removed immediately from the article and its talk page, especially if potentially libelous.Find sources: Endre Szemerédi...
هذه المقالة يتيمة إذ تصل إليها مقالات أخرى قليلة جدًا. فضلًا، ساعد بإضافة وصلة إليها في مقالات متعلقة بها. (فبراير 2020) يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها...
Kulit putih Amerika SerikatKulit putih Amerika Serikat (murni/satu ras saja) pada tahun 2020Jumlah populasi235.411.507 (71,02%) Kulit putih murni atau campuran 204.277.273 (61,63%) Kulit putih murni 31.134.234 (9,39%) Kulit putih campuran [1]Daerah dengan populasi signifikanSemua wilayah di Amerika SerikatBahasaMayoritas Bahasa InggrisAgamaProtestan 48%Katolik 19%Mormon 2%Yahudi 3%Lainnya 3%Tidak beragama 24%[2] Kulit putih Amerika Serikat (juga disebut sebagai Eropa Amerika S...
The King's ManSutradaraMatthew VaughnProduser Matthew Vaughn David Reid Adam Bohling Ditulis oleh Matthew Vaughn Karl Gajdusek CeritaMatthew VaughnBerdasarkanThe Secret Serviceoleh Mark MillarDave GibbonsPemeran Ralph Fiennes Gemma Arterton Rhys Ifans Matthew Goode Tom Hollander Harris Dickinson Daniel Brühl Djimon Hounsou Charles Dance Penata musik Matthew Margeson Dominic Lewis SinematograferBen Davis[1]Penyunting Jason Ballantine Rob Hall Perusahaanproduksi Marv Studios[...
Эту страницу предлагается переименовать в «Чемпионат мира по футболу среди команд до 17 лет» или «Чемпионат мира по футболу среди юношей до 17 лет».Пояснение причин и обсуждение — на странице Википедия:К переименованию/7 августа 2023. Пожалуйста, основывайте свои аргум...
Journal of Mathematical Physics англ. Journal of Mathematical Physics[1] Сокращённое название(ISO 4) J. Math. Phys. Специализация математическая физика Периодичность ежемесячно Язык английский Адрес редакции 2 Huntington Quadrangle Melville, NY 11747-4502, USA Главный редактор Бруно Начтергейл Страна США Издатель Амери...