FeaturesTemplatesShowcaseTownie
AI
BlogDocsPricing
Log inSign up
varun1352
varun1352AlwaysHere_migrated
Remix of stevekrouse/cerebrasTemplate
Public
Like
1
AlwaysHere_migrated
Home
Code
2
README.md
H
main.tsx
Branches
1
Pull requests
Remixes
1
History
Environment variables
Settings
Val Town is a collaborative website to build and scale JavaScript apps.
Deploy APIs, crons, & store data – all from the browser, and deployed in miliseconds.
Sign up now
Code
/
Code
/
Search
main.tsx
https://varun1352--7e3c5f4c278511f0b46e569c3dd06744.web.val.run
README.md

Cerebras Inference template

This val shows you how you can deploy an app using Cerebras Inference on Val Town in seconds.

What is Cerebras?

Cerebras is an American chip manufacturer that produces large wafer chips that deliver mind-blowing LLM inference speeds. As of this writing on Jan 17, 2025, Cerebras Inference provides Llama 3.18b, 3.1 70b, and 3.370b at a jaw-dropping 2k tokens per second – that's 50x faster than what the frontier labs produce.

Llama 3.370b at 2k tokens per second is particularly noteworthy because it is a GPT-4-class model. This level of intelligence at that level of speed will unlock whole new classes of applications.

Quick start

Set up Cerebras

  1. Sign up for Cerebras
  2. Get a Cerebras API Key
  3. Save it in a Val Town environment variable called CEREBRAS_API_KEY

Once Cerebras is set up in your Val Town account, there are two ways to get started:

  1. Fork this app and customize it (or ask Townie AI to customize it)

  2. Start a new chat with Townie AI and copy & paste the following instructions:.

Use Cerebras for AI on the backend like so:

const { OpenAI } = await import("https://esm.sh/openai");
const client = new OpenAI({
  apiKey: Deno.env.get("CEREBRAS_API_KEY"),
  baseURL: "https://api.cerebras.ai/v1"
});
const response = await client.chat.completions.create({
  model: "llama-3.3-70b",
  messages: [],
});
const generatedText = response.choices[0].message.content;

For example, the val in this template was created by asking Townie AI to "Make a chatgpt clone", then I hit shift-enter twice, and then pasted in the instructions on how to use Cerebras from above, then hit enter. Townie built this app on its first try, in about 20 seconds.

Sample apps

  • Cerebras Searcher - a Perplexity clone that uses the SerpAPI to do RAG and summaries with Cerebras (requires a SerpAPI key)
  • Cerebras Coder - an app that generates websites in a second with Cerebras
  • Cerebras Debater - an app that truly shows Cerebras's speed: it's Cerebras talking to Cerebras in a debate
HTTP
  • main.tsx
    varun1352--7e…44.web.val.run
Code
README.md
H
main.tsx
Go to top
X (Twitter)
Discord community
GitHub discussions
YouTube channel
Bluesky
Product
FeaturesPricing
Developers
DocsStatusAPI ExamplesNPM Package Examples
Explore
ShowcaseTemplatesNewest ValsTrending ValsNewsletter
Company
AboutBlogCareersBrandhi@val.town
Terms of usePrivacy policyAbuse contact
© 2025 Val Town, Inc.