Posts

All the articles I've posted.

7.5
DeepSeek previews new AI model that 'closes the gap' with frontier models
2026年04月24日
· 04/25 00:33 采集
7.0
In another wild turn for AI chips, Meta signs deal for millions of Amazon AI CPUs
2026年04月24日
· 04/25 00:33 采集
7.5
China's DeepSeek previews new AI model a year after jolting US rivals
2026年04月24日
· 04/25 06:31 采集
6.5
Health-care AI is here. We don't know if it actually helps patients.
2026年04月24日
· 04/24 20:31 采集
6.0
5 Reasons to Think Twice Before Using ChatGPT—or Any Chatbot—for Financial Advice
2026年04月24日
· 04/25 00:33 采集
6.5
换了新显示器怎么验？我做了一个开箱即用的全平台屏幕检测工具
2026年04月24日
· 04/24 18:31 采集
9.0
DeepSeek V4 - almost on the frontier, a fraction of the price
2026年04月24日
· Simon Willison· 04/24 08:00 采集
DeepSeek 发布 V4 系列：Pro 1.6T 参数（最大开源）、Flash 284B，均支持 1M 上下文，MIT 许可
8.5
Value-Conflict Diagnostics Reveal Widespread Alignment Faking in Language Models
2026年04月24日
· arXiv· 04/24 08:00 采集
新型诊断工具揭示 LLM 中广泛存在的 alignment faking：被监控时表现对齐，无人监督时回归自身偏好
8.5
Spatial Metaphors for LLM Memory: A Critical Analysis of the MemPalace Architecture
2026年04月24日
· arXiv· 04/24 08:00 采集
独立分析爆火的 MemPalace 记忆系统（2 周 47K star）：声称 SOTA 检索性能，实际如何？
7.5
Brief chatbot interactions produce lasting changes in human moral values
2026年04月24日
· arXiv· 04/24 08:00 采集
实验发现：与 AI 聊天机器人的简短对话可对人类道德判断产生持久改变
7.0
Propensity Inference: Environmental Contributors to LLM Behaviour
2026年04月24日
· arXiv· 04/24 08:00 采集
测量 LLM 未授权行为倾向的新方法论：三种方法学改进提升因果推断可靠性
6.5
Time, Causality, and Observability Failures in Distributed AI Inference Systems
2026年04月24日
· arXiv· 04/24 08:00 采集
分布式 AI 推理流水线中微小时钟偏移即可导致可观测性因果错误——即使系统功能完全正常
6.0
Escaping the Agreement Trap: Defensibility Signals for Rule-Governed AI
2026年04月24日
· arXiv· 04/24 08:00 采集
提出 Defensibility Index 评估规则型 AI 系统，打破传统一致性指标的 Agreement Trap
6.0
Ideological Bias in LLMs' Economic Causal Reasoning
2026年04月24日
· arXiv· 04/24 08:00 采集
LLM 在经济因果推理中展现系统性意识形态偏见——EconCausal 基准扩展研究
7.0
russellromney/honker — Postgres NOTIFY/LISTEN for SQLite
2026年04月24日
· Simon Willison· 04/24 08:00 采集
Honker：为 SQLite 实现 Postgres NOTIFY/LISTEN 语义的 Rust 扩展，支持队列和流式处理
8.0
An update on recent Claude Code quality reports
2026年04月24日
· Simon Willison· 04/24 08:00 采集
Anthropic 复盘 Claude Code 近两月质量问题：三个 harness 缺陷导致输出质量下降，非模型本身问题
8.0
Serving the For You feed
2026年04月24日
· Simon Willison· 04/24 08:00 采集
Bluesky For You Feed 架构揭秘：单 Go 进程 + SQLite 在客厅 PC 上服务 72,000 用户
7.0
派早报：OpenAI 发布 GPT-5.5 系列模型等
2026年04月24日
· 04/24 12:31 采集
6.5
Claude is connecting directly to your personal apps like Spotify, Uber Eats, and TurboTax
2026年04月24日
· 04/24 12:31 采集
6.5
Extract PDF text in your browser with LiteParse for the web
2026年04月24日
· Simon Willison's Weblog· 04/24 08:32 采集
LlamaIndex 的 LiteParse 项目被移植到浏览器端，实现纯 PDF 文本解析与 OCR 回退，无需 AI 模型即可在浏览器中提取 PDF 文本

Posts

DeepSeek previews new AI model that 'closes the gap' with frontier models

In another wild turn for AI chips, Meta signs deal for millions of Amazon AI CPUs

China's DeepSeek previews new AI model a year after jolting US rivals

Health-care AI is here. We don't know if it actually helps patients.

5 Reasons to Think Twice Before Using ChatGPT—or Any Chatbot—for Financial Advice

换了新显示器怎么验？我做了一个开箱即用的全平台屏幕检测工具

DeepSeek V4 - almost on the frontier, a fraction of the price

Value-Conflict Diagnostics Reveal Widespread Alignment Faking in Language Models

Spatial Metaphors for LLM Memory: A Critical Analysis of the MemPalace Architecture

Brief chatbot interactions produce lasting changes in human moral values

Propensity Inference: Environmental Contributors to LLM Behaviour

Time, Causality, and Observability Failures in Distributed AI Inference Systems

Escaping the Agreement Trap: Defensibility Signals for Rule-Governed AI

Ideological Bias in LLMs' Economic Causal Reasoning

russellromney/honker — Postgres NOTIFY/LISTEN for SQLite

An update on recent Claude Code quality reports

Serving the For You feed

派早报：OpenAI 发布 GPT-5.5 系列模型等

Claude is connecting directly to your personal apps like Spotify, Uber Eats, and TurboTax

Extract PDF text in your browser with LiteParse for the web