12-month outlook anticipates less line-by-line coding and more AI agent orchestration, architectural decisions, and output review from increasingly capable coding tools.
The Collective Intelligence Project builds infrastructure enabling global input into AI system development and governance. The organization combines large-scale deliberation, participatory evaluation, and institutional partnerships. It operates as a small team supported by major foundations including Google.org, Omidyar Network, and Future of Life Foundation, working with AI labs and governments.
About the Role: The position involves building and maintaining full-stack platforms with complex data, visualizations, and user experiences. The core challenge centers on articulating complicated data for mainstream audiences including journalists, academics, and engineers.
Primary focus will be continuing development of Weval (weval.org), an evaluation platform used by AI labs and governments to assess frontier models on questions automated benchmarks cannot address — such as mental health crisis handling, accurate legal advice delivery in Indian languages, and political bias detection.
Secondary work includes Global Dialogues (70+ countries gathering public input on AI), Digital Twin evaluations, and democratic AI governance tool deployments.
12-Month Outlook: The role anticipates less line-by-line coding and more AI agent orchestration, architectural decisions, and output review from increasingly capable coding tools. Value shifts toward system design judgment, quality standards, and managing parallel workstreams with AI execution.
Compensation: $150,000 + health/dental/vision insurance, 403(b), generous PTO Flexible hours, life accommodation, output-focused culture; hybrid in-office/remote on Pacific time.