Home > Computers & Technology > Business Technology

What Is ChatGPT Doing ... and Why Does It Work? by Stephen Wolfram

Author:Stephen Wolfram [Wolfram, Stephen] , Date: March 22, 2023 ,Views: 410

What Is ChatGPT Doing ... and Why Does It Work? by Stephen Wolfram

Author:Stephen Wolfram [Wolfram, Stephen]
Language: eng
Format: azw3, epub, mobi
Publisher: Wolfram Media, Inc.
Published: 2023-03-10T00:00:00+00:00

The input is a vector of n tokens (represented as in the previous section by integers from 1 to about 50,000). Each of these tokens is converted (by a single-layer neural net ) into an embedding vector (of length 768 for GPT-2 and 12,288 for ChatGPTâs GPT-3). Meanwhile, thereâs a âsecondary pathwayâ that takes the sequence of (integer) positions for the tokens, and from these integers creates another embedding vector. And finally the embedding vectors from the token value and the token position are added together âto produce the final sequence of embedding vectors from the embedding module.

Why does one just add the token-value and token-position embedding vectors together? I donât think thereâs any particular science to this. Itâs just that various different things have been tried, and this is one that seems to work. And itâs part of the lore of neural nets thatâin some senseâso long as the setup one has is âroughly rightâ itâs usually possible to home in on details just by doing sufficient training, without ever really needing to âunderstand at an engineering levelâ quite how the neural net has ended up configuring itself.

Hereâs what the embedding module does, operating on the string hello hello hello hello hello hello hello hello hello hello bye bye bye bye bye bye bye bye bye bye :

Download

What Is ChatGPT Doing ... and Why Does It Work? by Stephen Wolfram.azw3
What Is ChatGPT Doing ... and Why Does It Work? by Stephen Wolfram.epub
What Is ChatGPT Doing ... and Why Does It Work? by Stephen Wolfram.mobi

Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.

Categories

Linux & Unix	iPhone & iOS
Macintosh	Android
Business Technology	Certification
Computer Science	Databases & Big Data
Digital Audio, Video & Photography	Games & Strategy Guides
Graphics & Design	Hardware & DIY
History & Culture	Internet & Social Media
Mobile Phones, Tablets & E-Readers	Networking & Cloud Computing
Operating Systems	Programming
Programming Languages	Security & Encryption
Software	Web Development & Design

Popular ebooks

Dependency Injection in .NET by Mark Seemann(23332)
Exploring Deepfakes by Bryan Lyon and Matt Tora(8455)
Robo-Advisor with Python by Aki Ranin(8394)
Offensive Shellcode from Scratch by Rishalin Pillay(6502)
Microsoft 365 and SharePoint Online Cookbook by Gaurav Mahajan Sudeep Ghatak Nate Chamberlain Scott Brewster(5730)
Ego Is the Enemy by Ryan Holiday(5620)
Management Strategies for the Cloud Revolution: How Cloud Computing Is Transforming Business and Why You Can't Afford to Be Left Behind by Charles Babcock(4636)
Python for ArcGIS Pro by Silas Toms Bill Parker(4561)
Machine Learning at Scale with H2O by Gregory Keys | David Whiting(4460)
Elevating React Web Development with Gatsby by Samuel Larsen-Disney(4272)
Liar's Poker by Michael Lewis(3575)
Learning C# by Developing Games with Unity 2021 by Harrison Ferrone(3409)
Speed Up Your Python with Rust by Maxwell Flitton(3364)
OPNsense Beginner to Professional by Julio Cesar Bueno de Camargo(3345)
Extreme DAX by Michiel Rozema & Henk Vlootman(3309)
Agile Security Operations by Hinne Hettema(3251)
Linux Command Line and Shell Scripting Techniques by Vedran Dakic and Jasmin Redzepagic(3227)
Essential Cryptography for JavaScript Developers by Alessandro Segala(3206)
Cryptography Algorithms by Massimo Bertaccini(3153)
AI-Powered Commerce by Andy Pandharikar & Frederik Bussler(3104)