ChatGPT, GenerativeAI and LLMs Timeline
This repository organizes a timeline of key events (products, services, papers, GitHub, blog posts and news) that occurred before and after the ChatGPT announcement.
It's curating a variety of information in this timeline, with a particular focus on LLM and Generative AI.
Maybe it's a scene from the hottest history, so I thought it would be important to keep those memories well, so I organized them.
Contributing
Issues and Pull Requests are greatly appreciated. If you've never contributed to an open source project before I'm more than happy to walk you through how to create a pull request.
You can start by opening an issue describing the problem that you're looking to resolve and we'll go from there.
Emoji
arXiv
License
This document is licensed under the MIT license ยฉ Jonghong Jeon
Date | Announcement |
---|---|
6.17 | Understanding Encoder And Decoder LLMs (blog) |
6.16 | Language-Guided Music Recommendation for Video via Prompt Analogies ( |
6.16 | QR Code AI Art Generator (tweet), (Hugging face), (SD art) |
6.16 | Standford CRFM - Transparency Index for Foundation Model Provider's Compliance measurement with the Draft EU AI Act (tweet), () |
6.15 | LOVM: Language-Only Vision Model Selection ( |
6.15 | WizardCoder: Empowering Code Large Language Models with Evol-Instruct ( |
6.15 | Segment Any Point Cloud Sequences by Distilling Vision Foundation Models ( |
6.15 | Seeing the World through Your Eyes ( |
6.15 | Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models ( |
6.15 | Can Language Models Teach Weaker Agents? Teacher Explanations Improve Students via Theory of Mind ( |
6.15 | Segment Any Point Cloud Sequences by Distilling Vision Foundation Models ( |
6.15 | ChessGPT: Bridging Policy Learning and Language Modeling ( |
6.15 | [SCIENCE] Art and the science of generative AI, Vol 380, Issue 6650, (DOI: 10.1126/science.adh4451) |
6.14 | Unifying Large Language Models and Knowledge Graphs: A Roadmap ( |
6.14 | Knowledge Distillation of Large Language Models ( |
6.14 | TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement ( |
6.14 | EU MEPs ready to negotiate first-ever rules for safe and transparent AI (news) |
6.14 | TryOnDiffusion: A Tale of Two UNets ( |
6.14 | AssistGPT: A General Multi-modal Assistant that can Plan, Execute, Inspect, and Learn ( |
6.14 | Stable Diffusion with Core ML on Apple Silicon (tweet), () |
6.13 | Scalable 3D Captioning with Pretrained Models ( |
6.13 | Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation ( |
6.13 | arXiVeri: Automatic table verification with GPT ( |
6.13 | AVIS: Autonomous Visual Information Seeking with Large Language Models ( |
6.13 | AniFaceDrawing: Anime Portrait Exploration during Your Sketching ( |
6.13 | h2oGPT: Democratizing Large Language Models ( |
6.13 | 3D molecule generation by denoising voxel grids |
6.13 | GeneCIS: A Benchmark for General Conditional Image Similarity (project page), ( |
6.13 | Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration () |
6.13 | One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning ( |
6.13 | GitHub survey result - 92% of U.S.-based developers are already using AI coding tools both in and outside of work (blog) |
6.13 | ChatGPT Workspaces - Upcoming ChatGPT features: file uploading, profiles, organizations and workspaces (reddit) |
6.12 | Transformers learn through gradual rank increase ( |
6.12 | Large Language Models as Tax Attorneys: A Case Study in Legal Capabilities Emergence ( |
6.12 | Augmenting Language Models with Long-Term Memory ( |
6.12 | Yann LeCun and Geoffrrey Hinton's Consensus on a number of questions about AI and catastrophic risks (tweet) |
6.12 | Conversation of Andrew Ng and Geoffrey Hinton about AI and catastrophic risks (tweet) |
6.12 | Benchmarking Neural Network Training Algorithms ( |
6.12 | Lit-llama - Implementation of the LLaMA language model based on nanoGPT () |
6.12 | OpenAI, DeepMind will open up models to UK government (news) |
6.12 | WizardLM: An Instruction-following LLM Using Evol-Instruct () |
6.11 | Face0: Instantaneously Conditioning a Text-to-Image Model on a Face ( |
6.11 | A Comprehensive Survey on Applications of Transformers for Deep Learning Tasks ( |
6.9 | Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions ( |
6.9 | Judging LLM-as-a-judge with MT-Bench and Chatbot Arena ( |
6.9 | Evaluating the Social Impact of Generative AI Systems in Systems and Society ( |
6.9 | Can Large Language Models Infer Causation from Correlation? ( |
6.9 | FinGPT: Open-Source Financial Large Language Models ( |
6.8 | On the Reliability of Watermarks for Large Language Models ( |
6.8 | PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization ( |
6.8 | How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources ( |
6.8 | StableDiffusion - Clipdrop Launches Uncrop: The Ultimate Aspect Ratio Editor (blog) |
6.8 | Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models ( |
6.8 | Simple and Controllable Music Generation ( |
6.8 | Tracking Everything Everywhere All at Once (project page), ( |
6.8 | Understanding GPT tokenizers (blog) |
6.7 | Learning to Ground Instructional Articles in Videos through Narrations ( |
6.7 | Emergent Correspondence from Image Diffusion ( |
6.7 | Certified Reasoning with Language Models ( |
6.7 | Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions ( |
6.7 | Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks ( |
6.7 | M$^3$IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning ( |
6.7 | PromptBench: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts ( |
6.7 | INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models ( |
6.7 | ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models ( |
6.7 | Deductive Verification of Chain-of-Thought Reasoning ( |
6.6 | ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory ( |
6.6 | InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models ( |
6.6 | HeadSculpt: Crafting 3D Head Avatars with Text ( |
6.6 | MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion ( |
6.6 | Neuralangelo: High-Fidelity Neural Surface Reconstruction ( |
6.6 | PokemonChat: Auditing ChatGPT for Pokรฉmon Universe Knowledge ( |
6.6 | A Static Evaluation of Code Completion by Large Language Models ( |
6.6 | Large Language Models of Code Fail at Completing Code with Potential Bugs ( |
6.6 | Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias ( |
6.6 | Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis ( |
6.6 | Recognize Anything: A Strong Image Tagging Model ( |
6.6 | ATT3D: Amortized Text-to-3D Object Synthesis ( |
6.6 | Falcon-40B-Instruct is a 40B parameters causal decoder-only model built by TII based on Falcon-40B and finetuned on a mixture of Baize (HF) |
6.5 | LLM-Blender: Ensembling Large Language Models with Pairwise Ranking and Generative Fusion ( |
6.5 | Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding ( |
6.5 | PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Mode ( |
6.5 | Orca: Progressive Learning from Complex Explanation Traces of GPT-4 ( |
6.4 | SAM3D: Zero-Shot 3D Object Detection via Segment Anything Model ( |
6.4 | A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models ( |
6.3 | VisualGPTScore: Visio-Linguistic Reasoning with Multimodal Generative Pre-Training Scores ( |
6.2 | Segment Anything in High Quality ( |
6.2 | The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only ( |
6.2 | StyleDrop: Text-To-Image Generation in Any Style (project page), ( |
6.1 | StableRep: Synthetic Images from Text-to-Image Models Make Strong Visual Representation Learners ( |
6.1 | The TIME - "The End of Humanity" cover (tweet), ("AI Is Not an Arms Race") |
6.1 | AutoGPTQ - An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm () |
6.1 | Wuerstchen: Efficient Pretraining of Text-to-Image Models ( |
6.1 | StyleGAN knows Normal, Depth, Albedo, and More ( |
6.1 | Diffusion Self-Guidance for Controllable Image Generation ( |
6.1 | Thought Cloning: Learning to Think while Acting by Imitating Human Thinking ( |
6.1 | Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles ( |
6.1 | The Hidden Language of Diffusion Models ( |
6.1 | Inserting Anybody in Diffusion Models via Celeb Basis ( |
6.1 | LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day ( |
6.1 | Birth of a Transformer: A Memory Viewpoint ( |
6.1 | SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds ( |
6.1 | Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance ( |
6.1 | ReviewerGPT? An Exploratory Study on Using Large Language Models for Paper Reviewing ( |
5.31 | The Impact of Positional Encoding on Length Generalization in Transformers ( |
5.31 | Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust ( |
5.31 | Discovering New Interpretable Conservation Laws as Sparse Invariants ( |
5.31 | Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor ( |
5.31 | Understanding and Mitigating Copying in Diffusion Models ( |
5.31 | PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning ( |
5.31 | Human or Not? A Gamified Approach to the Turing Test ( |
5.31 | OpenAI - Letโs Verify Step by Step (paper), ( |
5.31 | Humans in 4D: Reconstructing and Tracking Humans with Transformers ( |
5.31 | Improving CLIP Training with Language Rewrites ( |
5.31 | MuseCoco: Generating Symbolic Music from Text ( |
5.31 | MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training ( |
5.31 | CodeTF: One-stop Transformer Library for State-of-the-art Code LLM ( |
5.30 | Re-evaluating Word Mover's Distance ( |
5.30 | Bigger, Better, Faster: Human-level Atari with human-level efficiency ( |
5.30 | Japan Goes All In: Copyright Doesnโt Apply To AI Training (news) |
5.30 | A.I. Poses โRisk of Extinction,โ Industry Leaders Warn - (NYT news) |
5.30 | Statement on AI Risk - AI experts and public figures express their concern about AI risk (statement) |
5.30 | GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction ( |
5.30 | Nested Diffusion Processes for Anytime Image Generation ( |
5.30 | StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation ( |
5.30 | HiFA: High-fidelity Text-to-3D with Advanced Diffusion Guidance ( |
5.30 | Grammar Prompting for Domain-Specific Language Generation with Large Language Models ( |
5.30 | AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation ( |
5.30 | Ambient Diffusion: Learning Clean Distributions from Corrupted Data ( |
5.30 | ChatGPT and large language models in gastroenterology, (Nature Reviews Gastroenterology & Hepatology) |
5.30 | Blockwise Parallel Transformer for Long Context Large Models ( |
5.29 | A lawyer used ChatGPT to prepare a court filing. It went horribly awry. (CBS news) |
5.29 | Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors ( |
5.29 | RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths ( |
5.29 | Photoswap: Personalized Subject Swapping in Images ( |
5.29 | TaleCrafter: Interactive Story Visualization with Multiple Characters ( |
5.29 | GlyphControl: Glyph Conditional Control for Visual Text Generation ( |
5.29 | Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation ( |
5.29 | Faith and Fate: Limits of Transformers on Compositionality ( |
5.29 | PaLI-X: On Scaling up a Multilingual Vision and Language Model ( |
5.29 | Controllable Text-to-Image Generation with GPT-4 ( |
5.29 | Brainformers: Trading Simplicity for Efficiency ( |
5.28 | Geometric Algebra Transformers ( |
5.28 | Tab-CoT: Zero-shot Tabular Chain of Thought ( |
5.28 | Tab-CoT: Zero-shot Tabular Chain of Thought ( |
5.28 | FuseCap: Leveraging Large Language Models to Fuse Visual Data into Enriched Image Captions ( |
5.28 | Introducing NVIDIA ACE For Games - Spark Life Into Virtual Characters With Generative AI (blog) |
5.27 | SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks ( |
5.27 | The Curse of Recursion: Training on Generated Data Makes Models Forget ( |
5.27 | DNA-GPT: Divergent N-Gram Analysis for Training-Free Detection of GPT-Generated Text ( |
5.27 | What indeed can GPT models do in chemistry? A comprehensive benchmark on eight tasks ( |
5.27 | WingmanAI - real-time transcription of audio, integrated with ChatGPT for interactive use () |
5.27 | ToolBench - Large-scale instruction tuning SFT data to equip LLMs with general tool-use capability () |
5.27 | G7 officials to hold first meeting on AI regulation next week (news) |
5.26 | Chain-of-Thought Hub: A Continuous Effort to Measure Large Language Models' Reasoning Performance ( |
5.26 | A new antibiotic, discovered with artificial intelligence, may defeat a dangerous superbug (CNN news) |
5.26 | Generating Images with Multimodal Language Models ( |
5.26 | Backpack Language Models ( |
5.26 | Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing ( |
5.26 | Playing repeated games with Large Language Models ( |
5.26 | Training Socially Aligned Language Models in Simulated Human Society ( |
5.26 | BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks ( |
5.26 | Large Language Models as Tool Makers ( |
5.26 | ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation ( |
5.25 | Deep learning-guided discovery of an antibiotic targeting Acinetobacter baumannii, (nature chemical biology https://doi.org/10.1038/s41589-023-01349-8), (), (Cloned snapshot) |
5.25 | Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory ( |
5.25 | Role-Play with Large Language Models ( |
5.25 | Break-A-Scene: Extracting Multiple Concepts from a Single Image ( |
5.25 | Voyager: An Open-Ended Embodied Agent with Large Language Models (Project page), ( |
5.25 | Efficient Neural Music Generation ( |
5.25 | Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models ( |
5.25 | On Architectural Compression of Text-to-Image Diffusion Models ( |
5.25 | Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models ( |
5.25 | The False Promise of Imitating Proprietary LLMs ( |
5.25 | the new Stable Diffusion โReimagine XLโ model on @ClipdropApp x @StabilityAI (tweet), (Clipdrop) |
5.25 | Gorilla: Large Language Model Connected with Massive APIs (tweet), (project page), ( |
5.25 | OpenAI - Democratic Inputs to AI (Tweet), (Blog) |
5.24 | Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models ( |
5.24 | Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models ( |
5.24 | Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models ( |
5.24 | A majority of Americans have heard of ChatGPT, but few have tried it themselves (Pew Research Center news) |
5.24 | Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective ( |
5.24 | Think Before You Act: Decision Transformers with Internal Working Memory ( |
5.24 | PandaGPT: One Model to Instruction-Follow Them All (project page), ( |
5.24 | SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning ( |
5.24 | Manifold Diffusion Fields ( |
5.24 | A Neural Space-Time Representation for Text-to-Image Personalization ( |
5.24 | Can Transformers Learn to Solve Problems Recursively? ( |
5.24 | This Land is {Your, My} Land: Evaluating Geopolitical Biases in Language Models ( |
5.24 | Model evaluation for extreme risks ( |
5.24 | State of GPT and RLHF LLMs - Andrej Karpathy, OpenAI (session), (video) |
5.24 | LMs with a Voice: Spoken Language Modeling beyond Speech Tokens ( |
5.24 | BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing ( |
5.23 | Transformer-based Vulnerability Detection in Code at EditTime: Zero-shot, Few-shot, or Fine-tuning? |
5.23 | Unityโs Project Barracuda Injects Generative AI Into Games To Kickstart Exponential Growth (Forbes news) |
5.23 | VisorGPT: Learning Visual Prior via Generative Pre-Training ( |
5.23 | ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models ( |
5.23 | OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities ( |
5.23 | Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks ( |
5.23 | Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach ( |
5.23 | Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models ( |
5.23 | Aligning Large Language Models through Synthetic Feedback ( |
5.23 | LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond ( |
5.23 | Lost in Translation: Large Language Models in Non-English Content Analysis (news) |
5.23 | Anchor Prediction: Automatic Refinement of Internet Links ( |
5.23 | Coarse-to-Fine Contrastive Learning in Image-Text-Graph Space for Improved Vision-Language Compositionality ( |
5.23 | Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training ( |
5.23 | PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents ( |
5.23 | Bing at Microsoft Build 2023: Continuing the Transformation of Search (blog) |
5.23 | Bringing the power of AI to Windows 11 โ unlocking a new era of productivity for customers and developers with Windows Copilot and Dev Home (blog) |
5.23 | Adobe Unveils Future of Creative Cloud With Generative AI as a Creative Co-Pilot in Photoshop (news), (blog) |
5.23 | QLoRA: Efficient Finetuning of Quantized LLMs ( |
5.22 | SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation ( |
5.22 | Meta-in-context learning in large language models ( |
5.22 | AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation ( |
5.22 | Iterative Forward Tuning Boosts In-context Learning in Language Models ( |
5.22 | How Language Model Hallucinations Can Snowball ( |
5.22 | Intel Announces Aurora genAI, Generative AI Model With 1 Trillion Parameters (news), (Intel newsroom) |
5.22 | Introducing Mind-Video (Tweet), (demo), (data) |
5.22 | Reflective Linguistic Programming (RLP): A Stepping Stone in Socially-Aware AGI (SocialAGI) ( |
5.22 | GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints ( |
5.22 | LM vs LM: Detecting Factual Errors via Cross Examination ( |
5.22 | XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages ( |
5.22 | VideoLLM: Modeling Video Sequence with Large Language Models ( |
5.22 | RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text ( |
5.22 | RWKV: Reinventing RNNs for the Transformer Era ( |
5.22 | Introducing speech-to-text, text-to-speech, and more for 1,100+ languages (Blog), ( |
5.21 | Augmenting Autotelic Agents with Large Language Models ( |
5.21 | XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models (), (Video) |
5.20 | G7 Hiroshima Leadersโ Communiquรฉ (statement), (html) |
5.20 | G7 calls for developing global technical standards for AI (news) |
5.20 | Labour should pledge ยฃ11bn to build โBritGPTโ AI, thinktank says (news) |
5.20 | CodeCompose: A Large-Scale Industrial Deployment of AI-assisted Code Authoring ( |
5.19 | OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models ( |
5.19 | Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes ( |
5.19 | Clinical Camel: An Open-Source Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding ( |
5.19 | New York City public schools remove ChatGPT ban (news) |
5.19 | Graphologue: Exploring Large Language Model Responses with Interactive Diagrams ( |
5.19 | The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics ( |
5.19 | HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation ( |
5.19 | Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems ( |
5.19 | TELeR: A General Taxonomy of LLM Prompts for Benchmarking Complex Tasks ( |
5.19 | Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields ( |
5.19 | Chupa: Carving 3D Clothed Humans from Skinned Shape Priors using 2D Diffusion Probabilistic Models ( |
5.19 | Comparing Software Developers with ChatGPT: An Empirical Investigation ( |
5.19 | CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing ( |
5.19 | Multimodal Web Navigation with Instruction-Finetuned Foundation Models ( |
5.19 | Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity ( |
5.19 | Scaling laws for language encoding models in fMRI ( |
5.19 | Any-to-Any Generation via Composable Diffusion ( |
5.19 | ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings ( |
5.19 | Apple Bans Employees From Using ChatGPT Amid Its Own AI Efforts (news) |
5.18 | Brain-inspired learning in artificial neural networks: a review ( |
5.18 | ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities ( |
5.18 | RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture ( |
5.18 | LIMA: Less Is More for Alignment ( |
5.18 | GETMusic: Generating Any Music Tracks with a Unified Representation and Diffusion Framework ( |
5.18 | SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities ( |
5.18 | mLongT5: A Multilingual and Efficient Text-To-Text Transformer for Longer Sequences ( |
5.18 | Language Models Meet World Models: Embodied Experiences Enhance Language Models ( |
5.18 | Roundhill Investments Launches Generative AI & Technology ETF (NYSE Arca: CHAT) (news), (CHAT ETF) |
5.18 | VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks ( |
5.18 | Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold ( |
5.18 | PyLLMs - a minimal Python library to connect to LLMs (OpenAI, Anthropic, Google, AI21, Cohere, Aleph Alpha, HuggingfaceHub) () |
5.18 | Evidence of Meaning in Language Models Trained on Programs ( |
5.18 | Introducing the ChatGPT app for iOS (blog), (Download on the App Stor) |
5.18 | MTIA v1: Metaโs first-generation AI inference accelerator (blog) |
5.18 | Pursuing groundbreaking scale and accelerating research using Metaโs Research SuperCluster (blog) |
5.18 | Reimagining Metaโs infrastructure for the AI age (blog) |
5.17 | Chain-of-Symbol Prompting Elicits Planning in Large Langauge Models ( |
5.17 | DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining ( |
5.17 | Explaining black box text modules in natural language with language models ( |
5.17 | Tree of Thoughts: Deliberate Problem Solving with Large Language Models ( |
5.17 | Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback ( |
5.17 | PMC-VQA: Visual Instruction Tuning for Medical Visual Question Answering ( |
5.17 | What You See is What You Read? Improving Text-Image Alignment Evaluation ( |
5.17 | PaLM 2 Technical Report ( |
5.17 | Improving Language Model Negotiation with Self-Play and In-Context Learning from AI Feedback ( |
5.17 | SoundStorm: Efficient Parallel Audio Generation ( |
5.16 | Performance of ChatGPT on a Radiology Board-style Examination: Insights into Current Strengths and Limitations (RSNA Radiology, https://doi.org/10.1148/radiol.230582) |
5.16 | AR-Diffusion: Auto-Regressive Diffusion Model for Text Generation ( |
5.16 | Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation ( |
5.16 | ChatGPT versus human in generating medical graduate exam questions โ An international prospective study (medRxiv), ( |
5.16 | Understanding 3D Object Interaction from a Single Image ( |
5.16 | StructGPT: A General Framework for Large Language Model to Reason over Structured Data ( |
5.16 | FitMe: Deep Photorealistic 3D Morphable Model Avatars ( |
5.16 | Pre-Training to Learn in Context ( |
5.16 | Towards Expert-Level Medical Question Answering with Large Language Models ( |
5.16 | GPTeam: Collaborative AI Agents () |
5.16 | WATCH LIVE: OpenAI CEO Sam Altman testifies on artificial intelligence before Senate committee (Youtube) |
5.16 | NYT - Microsoft Says New A.I. Shows Signs of Human Reasoning |
5.15 | Common Diffusion Noise Schedules and Sample Steps are Flawed ( |
5.15 | Symbol tuning improves in-context learning in language models ( |
5.15 | Interpretability at Scale: Identifying Causal Mechanisms in Alpaca ( |
5.15 | DarkBERT: A Language Model for the Dark Side of the Internet ( |
5.15 | AutoRecon: Automated 3D Object Discovery and Reconstruction ( |
5.15 | RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs ( |
5.15 | Small Models are Valuable Plug-ins for Large Language Models ( |
5.15 | "ChatGPT can pick stocks better then top fund managers" - The ChatGPT Fund - (tweet), (website) |
5.15 | officially launching the Poe API - (Tweet, (): (poe-protocol), (api-bot-tutorial) |
5.15 | Guidance - A guidance language for controlling large language models () |
5.15 | BriefGPT - Locally hosted tool that connects documents to LLMs for summarization and querying, with a simple GUI () |
5.15 | Iโm an ER doctor. Hereโs how Iโm already using ChatGPT to help treat patients (blog) |
5.14 | How to run Llama 13B with a 6GB graphics card (Gist) |
5.13 | Leaked Copilot Chat's confidential rules (tweet) |
5.13 | GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content (arXiv](https://arxiv.org/abs/2305.07969)), ( |
5.13 | Everything-LLMs-And-Robotics - The world's largest GitHub Repository for LLMs + Robotics () |
5.13 | CodeT5+: Open Code Large Language Models for Code Understanding and Generation |
5.13 | EU AI Act To Target US Open Source Software (Blog) |
5.13 | PCAST Working Group on Generative AI Invites Public Input (Blog) |
5.12 | spacy-llm, an extension for integrating LLMs into structured NLP pipelines! (), (tweet) |
5.12 | TinyStories: How Small Can Language Models Be and Still Speak Coherent English? ( |
5.12 | Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation ( |
5.12 | ArtGPT-4: Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4 ( |
5.12 | MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers ( |
5.12 | AI FILM -The Carnival of the Ages - Runway gen2 (Youtube), (Reddit) |
5.11 | Large Language Models Can Be Used To Effectively Scale Spear Phishing Campaigns ( |
5.11 | Towards best practices in AGI safety and governance: A survey of expert opinion ( |
5.11 | Optimizing Memory Mapping Using Deep Reinforcement Learning ( |
5.11 | Universal Source Separation with Weakly Labelled Data ( |
5.11 | Active Retrieval Augmented Generation ( |
5.11 | Anthropic - Introducing 100K Context Windows (Blog) |
5.11 | CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model ( |
5.11 | Exploiting Diffusion Prior for Real-World Image Super-Resolution ( |
5.11 | Domain Incremental Lifelong Learning in an Open World ( |
5.11 | Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting ( |
5.11 | Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers ( |
5.11 | EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention ( |
5.11 | InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning ( |
5.11 | Huggingface Transformers Agent (API) |
5.11 | Google PaLM 2 Technical Report ( |
5.11 | Google MusicLM (Demo), (news) |
5.10 | HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion ( |
5.10 | VideoChat: Chat-Centric Video Understanding ( |
5.10 | Bot or Human? Detecting ChatGPT Imposters with A Single Question ( |
5.10 | Do LLMs Understand User Preferences? Evaluating LLMs On User Rating Prediction ( |
5.10 | Relightify: Relightable 3D Faces from a Single Image via Diffusion Models ( |
5.10 | Similarity of Neural Network Models: A Survey of Functional and Representational Measures ( |
5.10 | Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era ( |
5.10 | MPT-7B StoryWriter- new open-source language model that can handle really long inputs (Replicate) |
5.10 | Humata.ai - Ask AI anything about your files (Tweet) |
5.10 | IMAGEBIND: One Embedding Space To Bind Them All ( |
5.9 | StarCoder: may the source be with you! ( |
5.9 | Towards Building the Federated GPT: Federated Instruction Tuning ( |
5.9 | Large Language Model Programs ( |
5.9 | FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance ( |
5.9 | OpenAI - Language models can explain neurons in language models (Blog), (Paper), (), (Tweet) |
5.9 | AvatarReX: Real-time Expressive Full-body Avatars ( |
5.8 | Augmented Large Language Models with Parametric Knowledge Guiding ( |
5.8 | We had ChatGPT take the CPA exam โ and it failed (news) |
5.8 | Comparison of GPT-3.5, GPT-4, and human user performance on a practice ophthalmology written examination (Nature) |
5.8 | MultiModal-GPT: A Vision and Language Model for Dialogue with Humans ( |
5.7 | SuperAgent - Deploy LLM Agents to production () |
5.7 | Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models ( |
5.7 | X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages ( |
5.7 | Multi-Space Neural Radiance Fields ( |
5.7 | Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting ( |
5.7 | Yoshua Bengio - AI Scientists: Safe and Useful AI? (Blog) |
5.5 | privateGPT - Interact privately with your documents using the power of GPT, 100% privately, no data leaks (), (star history) |
5.5 | Open LLMs : A list of open LLMs available for commercial use - () |
5.5 | A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding ( |
5.5 | Otter: A Multi-Modal Model with In-Context Instruction Tuning ( |
5.5 | Composite Motion Learning with Task Control ( |
5.5 | StarCoderBase: trained on 1T tokens in 80+ programming languages (Huggingface) |
5.5 | Dolphin: General video interaction platform based on LLMs (Demo), (), (Tweet) |
5.5 | MPT-7B: A New Standard for Open-Source, Commercially Usable LLMs (Blog), Commercially usable: (MPT-7B) (MPT-7B-Instruct), (MPT-7B-StoryWriter), For non-commerical use: (MPT-7B-Chat) |
5.5 | StarCoder: A State-of-the-Art LLM for Code (Blog), (), (HuggingFace), (Tweet) |
5.5 | OpenAlpaca, an instruction-following model based on OpenLLaMA (), (Huggingface), (Tweet) |
5.4 | Seeing is Believing: Brain-Inspired Modular Training for Mechanistic Interpretability ( |
5.4 | Evaluating the Performance of ChatGPT in Ophthalmology: An Analysis of its Successes and Shortcomings (Ophthalmology Science) |
5.4 | Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction ( |
5.4 | Governance of the AI, by the AI, and for the AI ( |
5.4 | Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs ( |
5.4 | Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion ( |
5.4 | AttentionViz: A Global View of Transformer Attention ( |
5.4 | Reddit - OpenAI lost $540M in 2022, will need $100B more to develop AGI, says Altman. My breakdown on why this matters and what it means for other AI startups |
5.4 | FACT SHEET: Biden-โ Harris Administration Announces New Actions to Promote Responsible AI Innovation that Protects Americansโ Rights and Safety - (White house) |
5.4 | Google "We Have No Moat, And Neither Does OpenAI" - (Blog) |
5.4 | CNBC - Britain launches probe into ChatGPT-style A.I. as regulators grow concerned by risks |
5.4 | Personalize Segment Anything Model with One Shot ( |
5.4 | AutoML-GPT: Automatic Machine Learning with GPT ( |
5.4 | NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads ( |
5.4 | An automatically discovered chain-of-thought prompt generalizes to novel models and datasets ( |
5.4 | NYT - White House Pushes Tech C.E.O.s to Limit Risks of A.I. |
5.4 | Microsoft Bing AI chatbot and Edge browser get massive AI upgrades. See the list. (Blog) |
5.4 | Introducing Slack GPT (Blog) |
5.3 | Chatbot Arena: Benchmarking LLMs in the Wild with Elo Ratings - (Blog) |
5.3 | CodeGen2: Lessons for Training LLMs on Programming and Natural Languages ( |
5.3 | Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes ( |
5.3 | Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings ( |
5.3 | AG3D: Learning to Generate 3D Avatars from 2D Image Collections ( |
5.3 | Shap-E: Generating Conditional 3D Implicit Functions ( |
5.3 | 100 Practical Applications and Use Cases of Generative AI - ( |
5.3 | Comprehensive LLM model zoo - Ecosystem Graphs to track the foundation model ecosystem assets (datasets, models, and applications) and their relationship (Table), (Graph), () |
5.3 | GPTutor: a ChatGPT-powered programming tool for code explanation ( |
5.3 | Midjourney 5.1 Arrives - And Itโs Another Leap Forward For AI Art - (Forbes) |
5.3 | Mojo |
5.3 | #NeurIPS2023 Creative AI Track (Blog), (Call for proposal) |
5.3 | HeyPi - Personal AI |
5.2 | Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl ( |
5.2 | Andrew Ng - ChatGPT Prompt Engineering for Developers - (online course), (Tweet) |
5.2 | DreamPaint: Few-Shot Inpainting of E-Commerce Items for Virtual Try-On without 3D Modeling ( |
5.2 | Generalizing Dataset Distillation via Deep Generative Prior ( |
5.2 | Multimodal Procedural Planning via Dual Text-Image Prompting ( |
5.2 | WSJ - Google DeepMind CEO Says Some Form of AGI Possible in a Few Years |
5.2 | Latest NVIDIA Graphics Research Advances Generative AIโs Next Frontier (Blog) |
5.2 | Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation ( |
5.2 | TMR: Text-to-Motion Retrieval Using Contrastive 3D Human Motion Synthesis ( |
5.2 | Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation ( |
5.2 | Unlimiformer: Long-Range Transformers with Unlimited Length Input ( |
5.2 | Bark - Text-Prompted Generative Audio Model () |
5.2 | Jsonformer: A Bulletproof Way to Generate Structured JSON from Language Models () |
5.1 | scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI (bioXiv), ( |
5.1 | The Guardian - AI makes non-invasive mind-reading possible by turning thoughts into text |
5.1 | Learning to Reason and Memorize with Self-Notes ( |
5.1 | Poisoning Language Models During Instruction Tuning ( |
5.1 | What Do Self-Supervised Vision Transformers Learn? ( |
5.1 | NYT - โThe Godfather of A.I.โ Leaves Google and Warns of Danger Ahead (Archive) |
4.30 | ChatGPT: Is this version good for healthcare and research? - (ScienceDirect) |
4.30 | Understanding Parameter-Efficient LLM Finetuning: Prompt Tuning And Prefix Tuning (Blog) |
4.30 | A brief history of LLaMA models (Blog) |
4.30 | BabyBeeAGI: Task Management and Functionality Expansion on top of BabyAGI (blog), (Replit), (), (OG BaybyAGI) |
4.30 | Results of G7 Digital and Tech Ministersโ Meeting in Takasaki, Gunma - (Summary), (Declaration), (Annex1), (Annex2), (Annex3), (Annex4), (Annex5) |
4.30 | PandaLM: Reproducible and Automated Language Model Assessment () |
4.29 | Can ChatGPT Pass An Introductory Level Functional Language Programming Course? ( |
4.29 | A Review of ChatGPT Applications in Education, Marketing, Software Engineering, and Healthcare: Benefits, Drawbacks, and Research Directions ( |
4.29 | ChatGPT-2D, which can generate mind maps with AI - (Tweet), (ChatGPT-2D) |
4.29 | MLC LLM - an open framework that brings language models (LLMs) directly into a broad class of platforms (CUDA, Vulkan, Metal) with GPU acceleration (Tweet), (Demo), () |
4.29 | GenOs Index - The April (aka the Frenetic Pace) Edition - (blog) |
4.29 | StableVicuna, the AI Worldโs First Open Source RLHF LLM Chatbot! - (Blog), (Tweet) |
4.29 | DeepFloyd - a state-of-the-art text-to-image model (Web), (), (HuggingFace demo), (Tweet) |
4.29 | When Patient Questions Are Answered With Higher Quality and Empathy by ChatGPT than Physicians - (Blog) |
4.29 | BMTools - Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins () |
4.29 | FastChat-T5 (), (Tweet) |
4.29 | Lamini, the LLM Engine for Rapidly Customizing Models - (Blog) |
4.28 | EU proposes new copyright rules for generative AI - (Reuter), (Economic times) |
4.28 | PROMPTENGINEERING FORCHATGPTA QUICKGUIDE TOTECHNIQUES, TIPS,ANDBESTPRACTICES - ( |
4.28 | ResiDual: Transformer with Dual Residual Connections ( |
4.28 | Causal Reasoning and Large Language Models: Opening a New Frontier for Causality ( |
4.28 | We Interviewed the Engineer Google Fired for Saying Its AI Had Come to Life (Futurism) |
4.28 | LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model ( |
4.28 | MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks ( |
4.28 | Are Emergent Abilities of Large Language Models a Mirage? ( |
4.28 | The Ultimate Battle of Language Models: Lit-LLaMA vs GPT3.5 vs Bloom vs โฆ. (Blog) |
4.28 | Otter, a multi-modal in-context learning model with instruction tuning - (), (Demo), (Youtube) |
4.28 | Economist - Yuval Noah Harari argues that AI has hacked the operating system of human civilisation (Archive) |
4.28 | Assessing the Potential of USMLE-Like Exam Questions Generated by GPT-4 (medRxiv), ( |
4.28 | JAMA - Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum - (paper) |
4.27 | ChatGPT as an Attack Tool: Stealthy Textual Backdoor Attack via Blackbox Generative Model Trigger ( |
4.27 | PMC-LLaMA: Further Finetuning LLaMA on Medical Papers ( |
4.27 | "Can ChatGPT Diagnose Me?" How Large Language Models will Transform Clinical Care - (Youtube) |
4.27 | Large Language Models Are State-of-the-Art Evaluators of Code Generation ( |
4.27 | Controlled Text Generation with Natural Language Instructions ( |
4.27 | A Survey of Large Language Models - version 8 ( |
4.27 | LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions ( |
4.27 | DataComp: In search of the next generation of multimodal datasets ( |
4.27 | We're Afraid Language Models Aren't Modeling Ambiguity ( |
4.27 | Boston Dynamics robot dog can answer your questions now, thanks to ChatGPT - (ZDNet), (YouTube) |
4.27 | LlamaIndex & Deep Lake for Financial Statement Analysis (Blog) |
4.26 | Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning ( |
4.26 | Multidimensional Evaluation for Text Style Transfer Using ChatGPT ( |
4.26 | NPJ - Comparing scientific abstracts generated by ChatGPT to real abstracts with detectors and blinded human reviewers (Paper), ( |
4.26 | TopGPT โ the worldโs first Andrew Tate large language model |
4.26 | Multi-Party Chat: Conversational Agents in Group Settings with Humans and Models ( |
4.26 | MOSS, a 16B tool-augmented conversational language model (Tweet), () |
4.26 | Exploring the Curious Case of Code Prompts ( |
4.26 | Controllable Image Generation via Collage Representations ( |
4.26 | Unleashing Infinite-Length Input Capacity for Large-scale Language Models with Self-Controlled Memory System ( |
4.26 | TextDeformer: Geometry Manipulation using Text Guidance ( |
4.26 | Evaluation of GPT-3.5 and GPT-4 for supporting real-world information needs in healthcare delivery ( |
4.26 | Ray Conditioning: Trading Photo-consistency for Photo-realism in Multi-view Image Generation ( |
4.26 | Harnessing the Power of LLMs in Practice: A Survey on ChatGPT and Beyond ( |
4.26 | HuggingChat - the first open source alternative to ChatGPT |
4.25 | Time - The 'Don't Look Up' Thinking That Could Doom Us With AI (Archive) |
4.25 | AI-assisted coding: Experiments with GPT-4 ( |
4.25 | NVIDIA NeMo Guardrails helps enterprises keep applications built on large language models aligned with their safety and security requirements (Blog), () |
4.25 | Stable and low-precision training for large-scale vision-language models ( |
4.25 | AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head ( |
4.25 | Answering Questions by Meta-Reasoning over Multiple Chains of Thought ( |
4.25 | Patch-based 3D Natural Scene Generation from a Single Example ( |
4.25 | Generative AI at Work - (NBER), ( |
4.25 | Chatbot Arena |
4.24 | Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model ( |
4.24 | AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays ( |
4.24 | Pointersect: Neural Rendering with Cloud-Ray Intersection ( |
4.24 | A Cookbook of Self-Supervised Learning ( |
4.24 | On the Challenges of Using Black-Box APIs for Toxicity Evaluation in Research ( |
4.24 | Towards Realistic Generative 3D Face Models ( |
4.24 | TextMesh: Generation of Realistic 3D Meshes From Text Prompts ( |
4.24 | Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training Exam (TXIT): Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation Oncology ( |
4.24 | Social AGI - SAMANTHA (Self-Reflective Artificial Mind Attuned to Naturalistic Thought and Human Adaptability) () |
4.24 | Segment Anything in Medical Images ( |
4.24 | Segment Anything in 3D with NeRFs ( |
4.24 | WizardLM: Empowering Large Language Models to Follow Complex Instructions ( |
4.24 | Track Anything: Segment Anything Meets Videos ( |
4.24 | OpenAI Brand guidelines - (blog) |
4.24 | GPT4Tools: Teaching LLM to Use Tools via Self-instruction - (Project page), (), (Video), |
4.24 | RAM: Relate-Anything-Model (), (Demo) |
4.24 | Chart-GPT 1.0 |
4.23 | Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models ( |
4.23 | Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness ( |
4.22 | Boosting Theory-of-Mind Performance in Large Language Models via Prompting ( |
4.22 | LaMP: When Large Language Models Meet Personalization ( |
4.22 | Finetuning Large Language Models (Blog) |
4.21 | Can GPT-4 Perform Neural Architecture Search? ( |
4.21 | Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition ( |
4.21 | Emergent and Predictable Memorization in Large Language Models ( |
4.21 | CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval ( |
4.21 | Bard now helps you code with support for 20+ langs (Python, C++, JS, Go, etc.). (Blog) |
4.21 | Inducing anxiety in large language models increases exploration and bias ( |
4.20 | Why Does ChatGPT Fall Short in Answering Questions Faithfully? ( |
4.20 | FinChat.io - The Chat GPT for Finance |
4.20 | LlamaAcademy: Teaching Llamas How to Code () |
4.20 | Announcing Google DeepMind: DeepMind + Brain = Google DeepMind (Blog) |
4.20 | "Can ChatGPT Diagnose Me?" How Large Language Models will Transform Clinical Care. Thursday, April 27th, 2023 (RSVP) |
4.20 | StableLM: Stability AI Language Models (), (Blog) |
4.19 | Fundamental Limitations of Alignment in Large Language Models ( |
4.19 | Scaling Transformer to 1M tokens and beyond with RMT ( |
4.19 | Occupational Heterogeneity in Exposure to Generative AI - (paper), ( |
4.19 | The Unintended Consequences of Censoring Digital Technology -- Evidence from Italy's ChatGPT Ban ( |
4.19 | CompressGPT: Decrease Token Usage by ~70% (blog) |
4.19 | Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes ( |
4.19 | LLM as A Robotic Brain: Unifying Egocentric Memory and Control ( |
4.19 | Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agent ( |
4.19 | Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models ( |
4.19 | h2oai's LLM repositories - (h2ogpt), (h2o-llmstudio), (Huggingface) |
4.19 | Evaluating Verifiability in Generative Search Engines ( |
4.19 | How to train your own Large Language Models (Blog) |
4.19 | AI Playground from Vercel Labs (tweet) |
4.19 | StanfordBDHG HealthGPT (tweet), () |
4.19 | GPT4All-J : the first Apache-2 Licensed Chatbot that runs locally on your machine (), ( |
4.19 | PersonalPrivate.AI - system to advise on new patent ideas (tweet) |
4.18 | Economist - The world needs an international agency for artificial intelligence, say two AI experts (Archive) |
4.18 | CancerGPT: Few-shot Drug Pair Synergy Prediction using Large Pre-trained Language Models ( |
4.18 | Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions ( |
4.18 | Nature - Why open-source generative AI models are an ethical way forward for science |
4.18 | Autonomous Agents(BabyAGI, AutoGPT) & Agent Simulations(CAMEL, Generative Agents) (Blog) |
4.18 | AutoTaskFormer: Searching Vision Transformers for Multi-task Learning ( |
4.18 | SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, and More ( |
4.18 | Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models ( |
4.18 | Google - Differentially private heatmaps (Blog) |
4.18 | The Complete Beginners Guide To Autonomous Agents |
4.18 | Llama Lab - A repo dedicated to building cutting-edge AGI projects: llama_agi (inspired by babyagi) and auto_llama (inspired by autogpt) (), (Llama Hub) |
4.18 | Elon Musk to start ChatGPT rival called โTruthGPTโ (tweet) |
4.17 | MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing ( |
4.17 | Notice of the Cyberspace Administration of China on Public Comments on the "Administrative Measures for Generative Artificial Intelligence Services (Draft for Comment)" (Announcement) |
4.17 | Pretrained Language Models as Visual Planners for Human Assistance ( |
4.17 | An Evaluation on Large Language Model Outputs: Discourse and Memorization ( |
4.17 | Epic, Microsoft bring generative AI to EHRs - ([Microsoft announcement](Microsoft and Epic expand strategic collaboration with integration of Azure OpenAI Service)) |
4.17 | BenchMD: A Benchmark for Modality-Agnostic Learning on Medical Images and Sensors ( |
4.17 | Towards Robust Prompts on Vision-Language Models ( |
4.17 | Tool Learning with Foundation Models ( |
4.17 | Low-code LLM: Visual Programming over LLMs ( |
4.17 | Wired - OpenAIโs CEO Says the Age of Giant AI Models Is Already Over |
4.17 | Synthetic Data from Diffusion Models Improves ImageNet Classification ( |
4.17 | RedPajama-Data: An Open Source Recipe to Reproduce LLaMA training dataset (GitHib) |
4.17 | Visual Instruction Tuning ( |
4.17 | Learning to Compress Prompts with Gist Tokens ( |
4.17 | ImpressionGPT: An Iterative Optimizing Framework for Radiology Report Summarization with ChatGPT ( |
4.17 | Meta - DINOv2: State-of-the-art computer vision models with self-supervised learning (blog), (), (Demo), ( |
4.17 | TypingMind - A better UI for ChatGPT (tweet) |
4.16 | Understanding Large Language Models (Blog) |
4.16 | INSIGHT - an autonomous AI that can do medical research () |
4.16 | GPT4free - use ChatGPT, for free!! - () |
4.16 | Solving Math Word Problems by Combining Language Models With Symbolic Solvers ( |
4.16 | ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human ( |
4.16 | Driving and suppressing the human language network using large language models (bioRxiv), ( |
4.16 | MultiGPT (). (tweet) |
4.16 | OpenAssistant Conversations - Democratizing Large Language Model Alignment ( |
4.16 | Auto-evaluator - lightweight evaluation tool for question-answering using Langchain () |
4.16 | NYT - Google Devising Radical Search Changes to Beat Back A.I. Rivals (Archive) |
4.15 | Brex's Prompt Engineering Guide () |
4.15 | Graphologue and Sensecape by UCSD Creativity Lab |
4.15 | Tractable Control for Autoregressive Language Generation ( |
4.15 | Web LLM - language model chats directly onto web browsers (Site), () |
4.15 | MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models (Project page). (Paper), (), (YouTube) |
4.15 | OpenAssistant - The world's largest open-source replication of ChatGPT (site), (), (Dataset - OASST1), (Paper), (YouTube), (Reddit) |
4.14 | HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge ( |
4.14 | ChatGPT: Applications, Opportunities, and Threats ( |
4.14 | Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding ( |
4.14 | OpenBB Terminal V3.0.0rc2 - () |
4.14 | Delta Denoising Score ( |
4.14 | DINOv2: Learning Robust Visual Features without Supervision ( |
4.14 | Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text ( |
4.14 | WSJ - Elon Musk Creates New Artificial Intelligence Company X.AI (archive), (FT) |
4.14 | Google Med-PaLM 2 - A responsible path to generative AI in healthcare |
4.14 | Meta's open source Animated Drawings - (Blog) |
4.14 | ControlNet v1.1 nightly - () |
4.13 | Teenage-AGI () |
4.13 | Boosted Prompt Ensembles for Large Language Models ( |
4.13 | ChatGPT-4 Outperforms Experts and Crowd Workers in Annotating Political Twitter Messages with Zero-Shot Learning ( |
4.13 | Soundini: Sound-Guided Diffusion for Natural Video Editing ( |
4.13 | Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study ( |
4.13 | Inpaint Anything: Segment Anything Meets Image Inpainting ( |
4.13 | GoalGPT by Nando.ai |
4.13 | Power-seeking can be probable and predictive for trained agents ( |
4.13 | GoalGPT by Nando.ai |
4.13 | Stable Diffusion XL Beta Available for API Customers and DreamStudio Users |
4.13 | NAB 2023: Introducing Text-Based Editing in Premiere Pro, Properties panel in After Effects, and much more |
4.13 | Announcing New Tools for Building with Generative AI on AWS - Amazon LLM (Titan), AWS fine-tuning model (Bedrock), Amazon copilot competitor (Code whisperer) |
4.13 | FT - We must slow down the race to God-like AI (archive) |
4.13 | Segment Everything Everywhere All at Once ( |
4.13 | Expressive Text-to-Image Generation with Rich Text ( |
4.13 | AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models ( |
4.12 | Can Large Language Models Transform Computational Social Science? ( |
4.12 | Galactic ChitChat: Using Large Language Models to Converse with Astronomy Literature ( |
4.12 | Performance of ChatGPT, GPT-4, and Google Bard on a Neurosurgery Oral Boards Preparation Question Bank (medRxiv), ( |
4.12 | ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning ( |
4.12 | Nature -Foundation models for generalist medical artificial intelligence ( |
4.12 | Dolly v2 - 12B parameter language model (Model weight), (), (Blog) |
4.11 | Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond ( |
4.11 | Toxicity in ChatGPT: Analyzing Persona-assigned Language Models ( |
4.11 | Multi-step Jailbreaking Privacy Attacks on ChatGPT ( |
4.11 | Building LLM applications for production |
4.11 | Emergent autonomous scientific research capabilities of large language models ( |
4.11 | OpenAIโs Bug Bounty Program |
4.11 | NTIAโs โAI Accountability Policy Request for Commentโ |
4.11 | WSJ - Biden Administration Weighs Possible Rules for AI Tools Like ChatGPT, (archive) |
4.11 | ChemCrow: Augmenting large-language models with chemistry tools ( |
4.11 | LangChainJS Support for Multiple JS Environments (tweet) |
4.11 | Teaching Large Language Models to Self-Debug ( |
4.10 | Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models (Paper), ( |
4.10 | On the Possibilities of AI-Generated Text Detection ( |
4.10 | OpenAGI: When LLM Meets Domain Experts ( |
4.9 | ChatAll - oncurrently chat with ChatGPT, Bing Chat, bard, Alpaca, Vincuna, Claude, ChatGLM, MOSS, iFlytek Spark, ERNIE and more, discover the best answers () |
4.9 | BabyAGI JS - () |
4.9 | AgentGPT - Auto-GPT directly in the browser (tweet), (), (demo) |
4.8 | A Recipe for Training Large Models |
4.7 | SuperPrompt Engineer Encourages ChatGPT Hallucinations |
4.7 | Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster ( |
4.7 | Why think step-by-step? Reasoning emerges from the locality of experience ( |
4.7 | Generative Agents: Interactive Simulacra of Human Behavior ( |
4.7 | Vicuna-7B: small, efficient, yet capable (), (Weight) |
4.7 | StackLlama (Blog), (Demo), () |
4.7 | SegGPT: Segmenting Everything In Context ( |
4.6 | Chrome ships WebGPU (Blog) |
4.6 | GPT detectors are biased against non-native English writers ( |
4.6 | ChaosGPT: Empowering GPT with Internet and Memory to Destroy Humanity (YouTube) |
4.6 | InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning ( |
4.6 | Wired - AI Desperately Needs Global Oversight |
4.6 | Instruction Tuning with GPT-4 ( |
4.6 | GeNVS: Generative Novel View Synthesis with 3D-Aware Diffusion Models ( |
4.6 | Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark ( |
4.5 | Yoshua Bengio - Slowing down development of AI systems passing the Turing test |
4.5 | Language models are on Replicate - FLAN-T5, GPT-J, and LLaMA (Blog) |
4.5 | Meta's Segment Anything Model (SAM) (Paper), ( |
4.4 | Calibrated Chaos: Variance Between Runs of Neural Network Training is Harmless and Inevitable ( |
4.4 | One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era ( |
4.4 | LangCahin raised $10 million in seed funding |
4.4 | Kandinsky 2.1 (), (HuggingFace) |
4.4 | The weights of Vicuna-13B released (WebUI demo) () |
4.4 | LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models ( |
4.4 | Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models ( |
4.3 | Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling ( |
4.3 | Vicuna-13B: An Open-Source ChatGPT Alternative That Impresses GPT-4 (Blog), () |
4.3 | Baby AGI () |
4.3 | Berkley just released Koala-13B! (Demo) |
4.3 | 2023 Artificial Intelligence (AI) Index Report Published by Stanford Institute for Human-Centered Artificial Intelligence (HAI) |
4.3 | The LLM playground - open source () |
4.3 | Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data ( |
4.2 | GPTCache : A Library for Creating Semantic Cache for LLM Queries - () |
4.2 | Better Language Models of Code through Self-Improvement ( |
4.2 | Eight Things to Know about Large Language Models ( |
4.2 | LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models ( |
4.1 | Italy curbs ChatGPT, starts probe over privacy concerns |
3.31 | Choose Your Weapon: Survival Strategies for Depressed AI Academics ( |
3.31 | CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society ( |
3.31 | A Survey of Large Language Models - Version 1 ( |
3.31 | (SCIENTIFIC AMERICAN) AI Chatbots Can Diagnose Medical Conditions at Home. How Good Are They? |
3.30 | ChatGPT in Healthcare: A Taxonomy and Systematic Review (medRxiv), ( |
3.30 | Launching the Generative AI Open Source (GenOS) Index - (Index), (Tweet) |
3.30 | Whose Opinions Do Language Models Reflect? ( |
3.30 | Language Models can Solve Computer Tasks ( |
3.30 | Self-Refine: Iterative Refinement with Self-Feedback ( |
3.30 | Humans in Humans Out: On GPT Converging Toward Common Sense in both Success and Failure ( |
3.30 | List of Open Sourced Fine-Tuned Large Language Models (LLM) |
3.30 | NEJM - Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine |
3.30 | BloombergGPT: A Large Language Model for Finance ( |
3.30 | Got It AIโs ELMAR challenges GPT-4 and LLaMa, scores well on hallucination benchmarks |
3.30 | HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace ( |
3.30 | CAIDP claims "The FTC should investigate OpenAI and block GPT over โdeceptiveโ behavior" |
3.30 | Epic to use Microsoft's GPT-4 in EHRs |
3.30 | Auto-GPT: An Autonomous GPT-4 Experiment () |
3.29 | AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators ( |
3.29 | nucleotide transformers - genomics LLM, ranging from 500M to 2.5B parameters - () |
3.29 | GeoV-9b - 9 billion parameter causal language model (code, weights, colab) |
3.29 | GPT4All - 7B param language model finetuned from a curated set of 400k GPT-Turbo-3.5 |
3.29 | LLaMA-Adapter!: Efficient Fine-tuning of Language Models with Zero-init Attention |
3.29 | MacGPT 3.2 |
3.29 | GPTEval: NLG Evaluation using GPT-4 with Better Human Alignment ( |
3.29 | TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs ( |
3.28 | Natural Selection Favors AIs over Humans |
3.28 | ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks ( |
3.28 | LLaMA voice chat + Siri TTS |
3.28 | Cerebras-GPT - 111M to 13B parameters trained using the Chinchilla formula |
3.28 | Microsoft Security Copilot: Empowering defenders at the speed of AI |
3.28 | Google pix2struct launched today, a multimodal model specializing in screenshot data |
3.28 | OpenFlamingo - a framework that enables training and evaluation of large multimodal models (LMMs) |
3.27 | Microsoft JARVIS () |
3.27 | ChatGPT Survey: Performance on NLP datasets |
3.27 | GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models ( |
3.26 | Nature Language Reasoning, A Survey ( |
3.26 | Sam Altman: OpenAI CEO on GPT-4, ChatGPT, and the Future of AI - Lex Fridman Podcast #367 |
3.26 | LLaMA voice chat |
3.26 | Japanese Alpaca LoRA |
3.24 | Progressively Optimized Local Radiance Fields for Robust View Synthesis ( |
3.24 | Efficient Methods for Natural Language Processing: A Survey ( |
3.24 | NYT OPINION - You Can Have the Blue Pill or the Red Pill, and Weโre Out of Blue Pills (archive) |
3.24 | Dolly - open source LLM |
3.24 | Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators |
3.24 | ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge ( |
3.24 | Do large language models need sensory grounding for meaning and understanding? @YannLeCun |
3.23 | OpenAI: ChatGPT Plugins |
3.23 | Opera brings AI ChatGPT bot sidebar to browsers |
3.22 | Artificial muses: Generative Artificial Intelligence Chatbots Have Risen to Human-Level Creativity ( |
3.22 | GitHub: Copilot X |
3.22 | Sparks of Artificial General Intelligence: Early experiments with GPT-4 ( |
3.22 | Pause Giant AI Experiments: An Open Letter |
3.21 | WSJ - Generative AI Makes Headway in Healthcare |
3.21 | NVIDIA Brings Generative AI to Worldโs Enterprises |
3.21 | Adobe launches Firefly |
3.21 | Google launches Bard in the US and UK |
3.21 | Microsoft: Bing Image Creator |
3.21 | Stability AI Launches Stable Diffusion Reimagine |
3.20 | Reflexion: an autonomous agent with dynamic memory and self-reflection ( |
3.20 | March 20 ChatGPT outage: Hereโs what happened |
3.20 | Runway Gen-2 |
3.20 | Paper: Capabilities of GPT-4 on Medical Challenge Problems |
3.20 | Making Music with GPT 4 by (Wavtool) |
3.19 | Simple LLM Finetuner () |
3.18 | Data-centric Artificial Intelligence: A Survey ( |
3.17 | Can AI-Generated Text be Reliably Detected? ( |
3.17 | GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models ( |
3.16 | WebSHAP: Towards Explaining Any Machine Learning Models Anywhere ( |
3.16 | LERF: Language Embedded Radiance Fields ( |
3.16 | Microsoft: Microsoft 365 Copilot |
3.16 | Alpaca LoRA: instruct tune LLAMA on consumer hardware |
3.16 | OpenAI CEO Sam Altman says AI will reshape society, acknowledges risks: 'A little bit scared of this' |
3.15 | A new era for AI and Google Workspace |
3.15 | PyTorch 2.0: Our next generation release |
3.15 | Baidu: ERNIE Bot |
3.15 | Midjourney: Midjourney V5 |
3.15 | arXiv - GPT-4 Technical report |
3.14 | The Lancet - Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine |
3.14 | THUDM releases ChatGLM-6B |
3.14 | Langflow - a UI for LangChain () |
3.14 | Anthropic: Claude |
3.14 | Google: PaLM API & Workspace |
3.14 | OpenAI: GPT-4 |
3.13 | Stanford Alpaca 7B |
3.13 | Microsoft lays off team that taught employees how to make AI tools responsibly |
3.13 | MiniLLM: Large Language Models on Consumer GPUs |
3.13 | Chatbot UI ((https://img.shields.io/github/stars/mckaywrigley/chatbot-ui?style=social)) |
3.12 | GM explores using ChatGPT in vehicles |
3.10 | Google: PaLM-E |
3.9 | multi-model playground - https://nat.dev |
3.9 | GPT-4 is coming next week |
3.8 | Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models ( |
3.8 | NYT, Opinion - Noam Chomsky: The False Promise of ChatGPT (archive) |
3.7 | A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT ( |
3.7 | Radiology - The Role and Limitations of Large Language Models Such as ChatGPT in Clinical Settings and Medical Journalism |
3.7 | Stability AI Acquires Image Editing App Clipdrop |
3.6 | Google: Universal Speech Model |
3.5 | Generative AI: Perspectives from Stanford HAI |
3.5 | UpStage, ChatGPT bot (Askup) on Line |
3.5 | UpStage, ChatGPT bot (Askup) on KakaoTalk |
3.2 | Consistency Models ( |
3.1 | OpenAI: ChatGPT and Whisper API |
2.28 | Large Language Models Are State-of-the-Art Evaluators of Translation Quality ( |
2.27 | Best Practices for Using AI When Writing Scientific Manuscripts (ACS Nano 2023, 17, 5, 4091โ4093) |
2.27 | Fighting โWoke AI,โ Musk Recruits Team to Develop OpenAI Rival |
2.25 | The Lancet - The promise of large language models in health care |
2.25 | AugGPT: Leveraging ChatGPT for Text Data Augmentation ( |
2.24 | Sam Altman, Planning for AGI and beyond |
2.24 | Meta: LLaMA |
2.23 | Radiology - ChatGPT and the Future of Medical Writing |
2.23 | Instagram co-founders launch AI-powered news app Artifact on Android, iOS |
2.23 | Notion.AI launch |
2.22 | The alignment problem from a deep learning perspective ( |
2.22 | Microsoft: Bing announcement on mobile and Skype |
2.22 | Science - As scientists explore AI-written text, journals hammer out policies |
2.21 | BadGPT: Exploring Security Vulnerabilities of ChatGPT via Backdoor Attacks to InstructGPT ( |
2.21 | Hyena Hierarchy: Towards Larger Convolutional Language Models ( |
2.21 | The PNAS Journals Outline Their Policies for ChatGPT and Generative AI |
2.21 | ChatGPT: Jack of all trades, master of none ( |
2.17 | Time, ChatGPT cover |
2.17 | OpenAI, Foundry Product Brief |
2.17 | Generative AI on Roblox: Our Vision for the Future of Creation |
2.16 | Do We Still Need Clinical Language Models? ( |
2.16 | Startup Replit launches a ChatGPT-like bot for coders |
2.15 | A&O announces exclusive launch partnership with Harvey |
2.14 | What Is ChatGPT Doing โฆ and Why Does It Work? (Stephen Wolfram Writings) |
2.14 | 1M ChatGPT plus user |
2.14 | The Gen AI Conference Hosted by Jasper |
2.13 | Google: Vision Transformer 22B |
2.12 | Transformer models: an introduction and catalog ( |
2.10 | arXivGPT launches |
2.10 | OpenAI, ChatGPT plus announce (20$) |
2.9 | Disastrous Chatbot Demo Costs Google $140 Billion |
2.9 | Meta: Toolformer |
2.8 | A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity ( |
2.8 | Runway launches ground-breaking Gen-1 video generation AI system |
2.7 | Microsoft: Bing ChatGPT |
2.7 | Getty Images sues AI art generator Stable Diffusion in the US for copyright infringement |
2.6 | The Lancet - ChatGPT: friend or foe? |
2.6 | Google: Bard announcement |
2.4 | Theory of Mind May Have Spontaneously Emerged in Large Language Models ( |
2.4 | POE.com open |
2.3 | Google invests in Anthropic, maker of ChatGPT rival |
2.3 | Naver, SearchGPT announcement |
2.2 | Creating a Large Language Model of a Philosopher ( |
2.2 | ChatGPT reaches 100 million users two months after launch |
2.1 | The Diagnostic and Triage Accuracy of the GPT-3 Artificial Intelligence Model (medrXiv |
2.1 | OpenAI, released a software tool to help identify text generated by AI |
1.31 | JAMA Network - Nonhuman โAuthorsโ and Implications for the Integrity of Scientific Publication and Medical Knowledge |
1.30 | SingSong: Generating musical accompaniments from singing ( |
1.30 | China's biggest search engine is to set launch a ChatGPT rival in March |
1.26 | Science Journal - ChatGPT is fun, but not an author |
1.26 | DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature ( |
1.26 | ChatGPT Is Coming for Classrooms. Don't Panic |
1.26 | ChatGPT passes exams from law and business schools |
1.26 | Googleโs new AI turns text into music - MusicLM |
1.24 | Putting ChatGPT's Medical Advice to the (Turing) Test ( |
1.24 | Nature policy - Tools such as ChatGPT threaten transparent science; here are our ground rules for their use |
1.20 | WAME policy - Chatbots, ChatGPT, and Scholarly Manuscripts |
1.17 | Meet Claude: Anthropicโs Rival to ChatGPT |
1.14 | Microsoft in talks to acquire a 49% stake in ChatGPT owner OpenAI |
1.12 | Multimodal Deep Learning ( |
1.11 | This Voice Doesn't Exist - Generative Voice AI |
1.9 | Microsoft is looking at OpenAIโs GPT for Word, Outlook, and PowerPoint |
1.5 | Apple launches AI-powered book narrations |
1.5 | Microsoft, VALL-E |
1.4 | ICML conference responds to LLM ethics rule |
1.3 | Enter GPTZeo |
2023.01.01 | Collected by Jonghong Jeon ([email protected]) |
12.29 | GPT Takes the Bar Exam ( |
12.27 | bioarXiv - Comparing scientific abstracts generated by ChatGPT to original abstracts using an artificial intelligence output detector, plagiarism detector, and blinded human reviewers |
12. 15 | Constitutional AI: Harmlessness from AI Feedback ( |
11.30 | OpenAI, ChatGPT service |
11.28 | NeurIPS 2022 conference |
11.17 | InstructPix2Pix: Learning to Follow Image Editing Instructions |
11.16 | Holistic Evaluation of Language Models ( |
10.30 | LlamaIndex (GPT Index) GitHub project |
10.23 | LangChain GitHub project |
9.19 | SEQUOIA - Generative AI: A Creative New World |
8.25 | Understanding Diffusion Models: A Unified Perspective ( |
3.29 | Training Compute-Optimal Large Language Models ( |
3.15 | OpenAI, GPT 3.5 announce |
2.11 | Compute Trends Across Three Eras of Machine Learning ( |
2022.01.01 | |
8.16 | On the Opportunities and Risks of Foundation Models ( |
4.18 | The Power of Scale for Parameter-Efficient Prompt Tuning ( |
2021.01.01 | |
Last Modified 2023/04/14 PM19:40 KST |
Additional Links
- LLM Collection
- Open LLM Leaderboard
- AI Incident Database
- Daily papers by AK
- Awesome-Generative-RecSys - A curated list of Generative Recommender Systems (Paper & Code)
- Prompt Engineering Guide - papers -
- awesome-ChatGPT-repositories
- The Rundown
- WEEKLY PAPERS
- Primo.ai LLM wiki
- ML Papers of the Week
- CS 324 - Advances in Foundation Models
- ML timeline
- ChatGPT Timeline
- OpenAI Timeline
- The Rise and Rise of A.I. LLMs