The ChatGPT Paradox: Impressive Yet Incomplete

39.836 Lượt nghe
00:00
Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.
Tải MP3
MÔ TẢ MP3TIẾP THEO
The ChatGPT Paradox: Impressive Yet Incomplete
Prof. Thomas G. Dietterich discusses the current state of large language models like ChatGPT. He explains their capabilities and limitations, emphasizing their statistical nature and tendency to hallucinate. Dietterich explores the challenges in uncertainty quantification for these models and proposes integrating them with formal reasoning systems. He advocates for more robust knowledge representation methods, such as knowledge graphs, and discusses the importance of safety in AI development. Dietterich also touches on the changing landscape of academic AI research and the potential of open-source models to accelerate innovation in the field.

MLST is sponsored by Tufa Labs:
Are you interested in working on ARC and cutting-edge AI research with the MindsAI team (current ARC winners)?
Focus: ARC, LLMs, test-time-compute, active inference, system2 reasoning, and more.
Future plans: Expanding to complex environments like Warcraft 2 and Starcraft 2.
Interested? Apply for an ML research position: [email protected]

Thomas G. Dietterich
Distinguished Professor (Emeritus), Computer Science, Oregon State University
https://scholar.google.com/citations?user=09kJn28AAAAJ&hl=en
https://medium.com/@tdietterich
https://x.com/tdietterich

TOC:
00:00 LLMs: Capabilities, Limitations, and Statistical Nature
09:25 Uncertainty Quantification in Large Language Models
15:49 Integrating Reasoning and Formal Systems with LLMs
19:39 Knowledge Structures and Future AI Architectures
20:11 Knowledge Extraction and Representation in AI
23:51 Challenges in LLM-based Knowledge Systems
33:12 Scientific Publishing and Knowledge Graphs
37:14 AI Safety and Truth in Knowledge Representation
39:56 AI Regulation and Safety Engineering
41:57 Challenges in AI Perception and Novelty Detection
50:04 Verification and Safety in Complex AI Systems
59:23 Open-Source Models and Academic AI Research
03:16 Limitations of Transformers and Future AI Architectures


REFS:
00:16 Meta releases the biggest and best open-source AI model yet, https://www.theverge.com/2024/7/23/24204055/meta-ai-llama-3-1-open-source-assistant-openai-chatgpt
00:27 Tamper-Resistant Safeguards for Open-Weight LLMs, https://arxiv.org/pdf/2408.00761
00:41 Identifying latent disease factors, https://arxiv.org/pdf/2410.07890
00:53 EY position paper on AI, https://www.ey.com/en_ch/news/2023/12/new-ey-position-paper-on-artificial-intelligence
00:53 RAG for Knowledge-Intensive NLP Tasks, https://arxiv.org/pdf/2005.11401
00:55 AI's international research networks mapped, https://www.nature.com/articles/d41586-024-02986-2
01:15 A Survey of Large Language Models, https://arxiv.org/pdf/2303.18223
03:30 Embers of Autoregression, https://arxiv.org/pdf/2309.13638
07:33 GPT-4 Technical Report, https://cdn.openai.com/papers/gpt-4.pdf
10:52 LM-Polygraph, https://arxiv.org/pdf/2311.07383
14:16 Shifting Attention to Relevance, https://arxiv.org/pdf/2307.01379
16:00 Retrieval-Augmented Generation, https://arxiv.org/pdf/2005.11401
20:34 Baldur: Whole-Proof Generation, https://arxiv.org/pdf/2303.04910
22:11 Never-Ending Learning, https://www.cs.cmu.edu/~tom/pubs/NELL_aaai15.pdf
25:36 Machine Unlearning, https://arxiv.org/pdf/2404.01206
36:09 Papers with code or without code, https://www.sciencedirect.com/science/article/abs/pii/S0306457323002145
39:12 CYC: a large-scale investment in knowledge infrastructure, https://dl.acm.org/doi/10.1145/219717.219745
40:23 Community Notes, https://communitynotes.x.com/guide/en/about/introduction
41:39 REGULATION (EU), https://eur-lex.europa.eu/eli/reg/2024/1689/oj
43:44 Open Category Problem, https://futureoflife.org/ai-researcher-profile/ai-researcher-thomas-dietterich/
50:38 BO as Assisstant, https://dl.acm.org/doi/pdf/10.1145/3526113.3545664
50:59 Testing of Autonomous Vehicles, https://research-assets.waabi.ai/GUARD/paper.pdf
54:02 Engineering a Safer World, https://mitpress.mit.edu/9780262533690/engineering-a-safer-world/
54:50 The nature of the Internet, https://www.pnas.org/doi/epdf/10.1073/pnas.0501426102
58:43 Anomaly and Novelty Detection, https://dl.acm.org/doi/10.1145/3447548.3469453
59:06 Causality 2nd Edition, https://www.amazon.ca/Causality-Judea-Pearl/dp/052189560X
03:38 LLaMA: Open and Efficient Foundation Language Models, https://arxiv.org/pdf/2302.13971					
The ChatGPT Paradox: Impressive Yet Incomplete

Nhạc Theo Chủ Đề

Liên kết website