Serverless GPU

Name: Podstack GPU Cloud
Brand: Podstack
SKU: PODSTACK-GPU-CLOUD
Availability: InStock
Rating: 4.9 (180 reviews)

Serverless GPU provides instant notebook environments with GPU access. Start coding immediately without configuring pods or infrastructure. Includes reactive notebooks with DAG-based execution, checkpoints, and cost tracking.

What is Serverless GPU?

Serverless GPU is a managed notebook environment that:

Starts in seconds
Pre-configured with ML libraries
Automatic resource management
Pay only for active time
No infrastructure to manage
Reactive cell execution with dependency tracking

Getting Started

Create a Notebook

Navigate to Serverless GPU
Click Create Notebook
Configure:
- Name: Notebook identifier
- GPU Type: Select from available GPUs (view pricing per type)
- Provider: Select compute provider
- Idle Shutdown: Auto-stop after inactivity (optional)
Click Create

GPU Selection

Browse available GPU types with pricing:

View hourly rates per GPU type
Check real-time availability
Compare GPU specifications
Estimate costs before creating

Idle Shutdown Configuration

Configure automatic shutdown to save costs:

Idle Timeout Options

Never: Notebook runs until manually stopped
30 minutes: Stops after 30 minutes of inactivity
1 hour: Stops after 1 hour of inactivity
2 hours: Stops after 2 hours of inactivity
4 hours: Stops after 4 hours of inactivity

What Counts as Activity

Executing code cells
Interacting with the notebook interface
Terminal activity
File operations

Before Auto-Shutdown

Warning notification displayed
Chance to cancel the shutdown
Save all work before timeout

Access Your Notebook

Wait for status to show Running
Click the notebook link
The notebook editor opens
Start coding!

Notebook Editor

Cell Types

The notebook supports multiple cell types:

Code Cells: Execute Python code with GPU access
Markdown Cells: Add formatted documentation
Rich Text Cells: Formatted text with editing toolbar

Cell Operations

Add Cell: Insert code, markdown, or rich text cells
Delete Cell: Remove a cell and its output
Reorder Cells: Drag and drop to rearrange
Execute Cell: Run individual cells or all cells
Clear Output: Clear a cell’s output or all outputs
Cancel Execution: Stop a running cell

Code Execution

# Cells execute on the remote GPU
import torch
print(f"GPU: {torch.cuda.get_device_name(0)}")

# Full ML workflow support
model = AutoModel.from_pretrained("bert-base")
model.to("cuda")

Cell Output

Cells display rich output:

Text output and print statements
Tables and DataFrames
Matplotlib/Seaborn plots
Images and visualizations
Error tracebacks with formatting

Reactive Execution (DAG)

Notebooks support reactive execution based on cell dependencies.

How It Works

Podstack automatically tracks variable dependencies between cells. When you modify a cell, all dependent cells are marked as “dirty” and can be re-executed automatically.

DAG View

Visualize cell dependencies:

Click the DAG tab in the notebook
See a directed acyclic graph showing which cells depend on which
Identify execution order and dependency chains

Dirty Cell Detection

When you edit a cell:

Downstream cells that depend on its variables are marked as dirty
Click Execute Dirty Cells to re-run only affected cells
Saves time by skipping cells that don’t need re-execution

Cell Analysis

Analyze individual cells or the entire notebook:

View which variables a cell reads and writes
See upstream and downstream dependencies
Identify potential circular dependencies

Checkpoints (Time-Travel)

Save and restore notebook state at any point.

Creating Checkpoints

Click Create Checkpoint in the notebook toolbar
Enter a descriptive name
The current state of all cells, outputs, and variables is saved

Restoring Checkpoints

Open the Checkpoints panel
Browse saved checkpoints by name and timestamp
Click Restore to return to that state
All cells and outputs revert to the checkpoint state

Managing Checkpoints

View all checkpoints with timestamps
Delete old checkpoints to save space
Compare checkpoint states

Timeline

View the execution history of your notebook:

Chronological list of all cell executions
Execution timestamps and durations
Success/failure status for each execution
Navigate back to specific points in history

Variable Inspector

Track variables across your notebook:

View all defined variables and their current values
See variable types and sizes
Track which cells define and use each variable
Identify stale variables from cleared cells

Cost Tracking

Monitor notebook costs in real-time:

Cost Breakdown

Click the Cost tab in the notebook
View costs broken down by:
- GPU time
- Compute time per cell
- Total session cost
Track spending trends over time

Per-Cell Costs

Each cell execution shows its compute cost, helping you identify expensive operations and optimize your workflow.

Volume Mounts

Attach persistent storage to your notebook:

Adding Volumes

Go to notebook settings
Click Add Volume
Select from your existing NFS volumes
Specify the mount path
Volume is available in the notebook filesystem

Removing Volumes

Go to notebook settings
Click the remove icon on the volume
Volume is detached (data is preserved)

Dataset Integration

Attach datasets directly to your notebook:

Attaching Datasets

Go to notebook settings
Click Attach Dataset
Browse available datasets from your project
Dataset is mounted and accessible in your notebook

Using Datasets

import pandas as pd
# Datasets are mounted at the configured path
df = pd.read_csv("/data/my-dataset/train.csv")

Import and Export

Import Notebooks

Import existing notebooks:

From .ipynb: Upload Jupyter notebook files
From URL: Import from a public URL

Export Notebooks

Export your work in multiple formats:

.ipynb: Standard Jupyter format
PDF: For sharing and documentation
HTML: For web viewing

Replay

Replay your notebook execution:

Click Replay in the toolbar
Watch cells execute in order
View outputs as they were generated
Useful for demos and debugging

Managing Notebooks

Viewing Notebooks

The Serverless GPU page shows:

All your notebooks
Current status
GPU allocation
Running time
Cost to date

Starting a Notebook

For stopped notebooks:

Click Start
Wait for Running status
Access via notebook link

Stopping a Notebook

To pause and save costs:

Click Stop
Notebook enters Stopped state
Billing pauses

Note: Unsaved work in memory is lost. Create a checkpoint or save files before stopping.

Deleting a Notebook

To permanently remove:

Click Delete
Confirm deletion
All files, cells, and checkpoints are removed

Notebook Lifecycle

Status States

Status	Description
Creating	Notebook being provisioned
Starting	GPU being allocated and notebook starting
Running	Active and ready to use
Stopping	Shutting down
Stopped	Inactive, can be restarted
Failed	Error occurred, check logs

Real-Time Updates

Notebook status updates automatically via WebSocket:

Execution progress
Resource usage
Status changes
Cell output streaming

No need to refresh the page.

Pre-installed Libraries

Common ML libraries ready to use:

PyTorch
TensorFlow
Transformers
NumPy, Pandas, Scikit-learn
Matplotlib, Seaborn
Jupyter extensions

Installing Additional Packages

From a code cell:

!pip install package_name

Or use the built-in terminal.

Packages persist within the session until the notebook is stopped.

Web Terminal

Access terminal directly:

Find your notebook
Click Terminal
Browser terminal opens

Use for:

Installing packages
Running scripts
System commands
File operations

GPU Access

Verify GPU availability:

import torch
print(f"GPU Available: {torch.cuda.is_available()}")
print(f"GPU Name: {torch.cuda.get_device_name(0)}")
print(f"GPU Memory: {torch.cuda.get_device_properties(0).total_mem / 1e9:.1f} GB")

Use Cases

Quick Experiments

Test ideas without setup:

# Instant GPU access
model = AutoModel.from_pretrained("bert-base")
model.to("cuda")

Data Exploration

Analyze datasets interactively:

import pandas as pd
df = pd.read_csv("data.csv")
df.describe()

Prototyping

Build proof-of-concepts with reactive execution - modify a cell and all dependent cells update automatically.

Learning

Practice ML concepts with instant GPU access and pre-installed libraries.

Billing

Serverless GPU is billed:

Per-second when running
Based on GPU type
No charge when stopped
View real-time cost breakdown in the notebook

Monitor costs:

Check the cost breakdown tab in the notebook
View running time in notebook list
View spending in wallet

Limitations

Compared to full pods:

Fixed GPU configurations per provider
No custom Docker images
No direct SSH access (use web terminal)
Single GPU typically

For advanced needs, use Pods.

AI Assistant

Each notebook has an AI Assistant side panel (sparkle icon) that can read your dataset context, suggest cells, and edit cells in place.

Conversational Authoring

Type a prompt into the AI panel and the assistant generates one or more cells in response. Generated cells appear inline with an approve / reject affordance so nothing executes without your consent.

Dataset-Aware Generation

If you’ve attached datasets to the notebook (via the Resource Picker), the AI panel surfaces a dataset chip showing what data the AI can see, plus a Generate from dataset jumpstart button for empty notebooks. The jumpstart produces a multi-cell scaffold — load → inspect → visualize → summarize — instead of a single dump cell.

Per-Cell AI Actions

Each cell has two AI buttons in its header:

Fix with AI — diagnose and rewrite a cell that errored. Reads the traceback, proposes a fix, and shows a diff before applying.
Optimize with AI — refactor a working cell for a specific goal. Pick the optimization target (speed, memory, readability, GPU efficiency) from the popover.

Auto Mode

When enabled, the AI can chain multiple cell edits in response to a single prompt without per-edit approval — useful for “scaffold an entire pipeline” prompts. A status indicator shows when auto mode is active.

Pod Templates for Cells

Reactive notebooks let each cell run on a different pod spec. Pod Templates (/notebooks/pod-templates) are reusable specs — image, CPU/memory/GPU, env vars, startup command — that you can attach to specific cells.

Use cases:

Run a single GPU-heavy training cell on an H100 while keeping the rest of the notebook on CPU
Pin a cell to a custom image with non-standard dependencies
Share a template across teammates so everyone runs the cell with the same spec

Create, edit, and delete templates from the Pod Templates page. Attach a template to a cell via the cell’s resource picker.

Image-as-Code Builds

Notebooks support image-as-code via podstack.Image.build() — declare your dependencies in Python, and the platform builds a Docker image for you to use as a cell runner.

Build History

The Image Builds page (/notebooks/image-builds) shows every build submitted from this project. The list auto-refreshes every 3 seconds while builds are in flight.

Each row shows:

Status (queued, building, success, failed)
Base image
A content hash of the Dockerfile (so identical builds dedupe)
Queued timestamp and duration
The pushed image URI

Using a Built Image

Once a build succeeds, copy its URI and pass it to @podstack.cell(runner_image=...) to run that cell on the custom image. The same image can be reused across cells and notebooks.

Real-Time Collaboration

When REACT_APP_ENABLE_COLLABORATION=true, notebooks support multi-user editing via Y.js:

A Presence Bar at the top shows everyone currently in the notebook.
Cell content syncs character-by-character; cursor positions are broadcast.
The platform seeds the shared Y.Text from the server, so opening a notebook never produces duplicated cell content.
Self-echo is skipped in cell_updated broadcasts so your cursor stays stable while typing.

Deploy from Notebook

The notebook toolbar has a Deploy button that packages the notebook into a serving artifact and sends it through the deployment flow. Use this when a notebook has matured into a callable endpoint.

Best Practices

Save Work Frequently

Create checkpoints before major changes
Export notebooks in .ipynb format for backup
Attach NFS volumes for persistent data

Use Reactive Execution

Let the DAG track dependencies automatically
Use “Execute Dirty Cells” instead of re-running everything
Keep cells focused on single operations for better reactivity

Cost Management

Monitor the cost breakdown tab
Stop notebooks when taking breaks
Use idle shutdown to prevent runaway costs
Choose the right GPU for your workload

Use for Right Workloads

Good for:

Experimentation
Quick tasks
Learning
Prototyping
Data exploration

Better with Pods:

Long training jobs
Production workloads
Custom environments
Multi-GPU needs

Next Steps

Learn about AI Studio for model management
Use Pods for more control
Explore MLOps for experiment tracking