• Stars
    star
    148
  • Rank 249,221 (Top 5 %)
  • Language
    Jupyter Notebook
  • Created almost 2 years ago
  • Updated 11 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Analysis of OpenAI's ChatGPT

ChatGPT

ChatGPT is a text-based AI assistant by OpenAI. This is an analysis of ChatGPT.

Analysis

Models available via chat.openai.com

GPT-3.5

title slug max_tokens description
Default (GPT-3.5) text-davinci-002-render-sha 8191 Our fastest model, great for most everyday tasks.
Default (GPT-3.5) (Mobile) text-davinci-002-render-sha-mobile 8191 Our fastest model, great for most everyday tasks.
Code Interpreter text-davinci-002-render-sha-code-interpreter nan nan
Plugins text-davinci-002-render-sha-plugin nan nan

GPT-4

title slug max_tokens description
GPT-4 gpt-4 4095 Our most capable model, great for tasks that require creativity and advanced reasoning.
Plugins gpt-4-plugins 8192 An experimental model that knows when and how to use plugins
GPT-4 (Mobile, V2) gpt-4-mobile 4095 Our most capable model, great for tasks that require creativity and advanced reasoning.
Code Interpreter gpt-4-code-interpreter nan nan

Models available via platform.openai.com

Model Max tokens Training data (pre-training)
gpt-4-0613 8192 Up to Sep 2021
gpt-4-32k-0613 32768 Up to Sep 2021
gpt-3.5-turbo-0613 4096 Up to Sep 2021
gpt-3.5-turbo-16k-0613 16384 Up to Sep 2021
gpt-4-0314 8192 Up to Sep 2021
gpt-4-32k-0314 32768 Up to Sep 2021
gpt-3.5-turbo-0301 4096 Up to Sep 2021

Completion

If we send the message [{"role": "user", "content": "13+37="}] to the model, it returns the following chat completion response:

Model Completion
gpt-3.5-turbo-0613 Link
gpt-4-0613 Link

Vocabulary

The ChatGPT models employ a distinct vocabulary compared to their predecessors. The models use a cl100k_base vocabulary with 100,000 tokens, where each token encodes an average of 3.7 characters in English. For a detailed analysis, see vocab.ipynb.

Additionally, these models employ the Chat Markup Language.

Tokenizer

The number of prompt tokens and completion tokens are computed as follows (see OpenAI-Cookbook for details):

prompt_tokens = ['<|im_start|>', 'user', '\n', '13', '+', '37', '=', '<|im_end|>', '<|im_start|>', 'assistant', '<|message|>']
# len(tokens) is 11
completion_tokens = ['50']
# len(tokens) is 1

Performance

We have evaluated the ChatGPT model gpt-4-0613 with the HumanEval dataset. Out of 164 programming problems, the model can solve 86.59%.

Model name Pass@1 Date Comment Results Prompt
 gpt-4-0613 86.59% 2023-06-13 https://platform.openai.com/ Link Complete the following code. Use ```python to put the completed Python code in markdown quotes:\n{code}
 gpt-3.5-turbo-0613 71.34% 2023-06-13 https://platform.openai.com/ Link Complete the following code. Use ```python to put the completed Python code in markdown quotes:\n{code}
 gpt-4-0314 78.66% 2023-03-17 https://platform.openai.com/ Link Complete the following code:\n{code}
 gpt-3.5-turbo-0301 74.39% 2023-04-24 https://platform.openai.com/ Link Complete the following code. Use ```python to put the Python code in markdown quotes:\n{code}
 text-davinci-002-render-sha 70.12% 2023-02-19 https://chat.openai.com/
 text-davinci-002-render 56.10% 2022-12-03 https://chat.openai.com/
cushman-ml 56.10% 2022-10-23 Copilot
code-davinci-002 46.95% 2022-10-23 https://platform.openai.com/
code-cushman-001 32.93% 2022-10-23 https://platform.openai.com/