$4 Self-Modifying Coding Agents?! Evan Boyle Breaks Down GenSX

$4 Self-Modifying Coding Agents?! Evan Boyle Breaks Down GenSX

343 Lượt nghe
$4 Self-Modifying Coding Agents?! Evan Boyle Breaks Down GenSX
What if building complex AI agents felt as natural as composing React components—and they could even rewrite their own code? 🤯 In this episode of One Shot / AI Tinkerers, host Joe sits down with Evan Boyle, founder of GenSX, to explore a radically new way to design, run, and ship long-running agent workflows: 🔑 Key takeaways - React-inspired component model for agents – why JSX-style, type-safe functions beat static graphs for scalability and code reuse. - Traces, telemetry & evals baked-in – see every prompt, variable, and LLM call in real time. - $4 self-modifying coding agent – Evan demos an agent that checks out its own repo, refactors 3 K lines, runs tests, and pushes to GitHub… iteratively. - Real-world production use cases – from million-document legal discovery to inbox-wide entity extraction and analytics. - Durable execution & infra shift – why 5-second latencies and massive parallelism are forcing a rethink of serverless, queues, and caching. - Developer experience first – faster dev loops with component-level caching, cursor rules, and LLM “rubber-duck” debugging tricks. 🛠️ Tools & frameworks mentioned GenSX, React/JSX, OpenAI & Anthropic models, Temporal, Pulumi, Cursor, LangChain, LlamaIndex, Crew AI…and more. 🔗 Try GenSX → https://www.gensx.com 💬 Join the community → https://github.com/gensx-inc/gensx 🐦 Follow Evan on X/Twitter → @evanboyle 🙌 Enjoyed the conversation? 👍 Like, 🔔 subscribe, and drop your questions or aha moments in the comments. It helps more builders discover the pod! 📍 Chapters 00:00 Intro & Evan’s background 04:28 Why existing agent frameworks break at scale 12:55 Inside the React-style component model 23:10 Live demo: Hacker News Analyzer (1,000 LLM calls in parallel) 32:45 Tracing, telemetry, and evals 38:20 The self-modifying code agent ($4/iteration) 50:40 Real production agent use cases 59:05 Dev-tooling tips: caching, logging-only debug loops 1:08:30 The future of AI infrastructure & closing thoughts #GenSX #AIAgents #DeveloperExperience #React #SelfModifyingCode #AIWorkflow #OneShotPodcast