Serverless GPU

Serverless GPU provides instant Jupyter notebook environments with GPU access. Start coding immediately without configuring pods or infrastructure.

What is Serverless GPU?

Serverless GPU is a managed JupyterHub environment that:

Starts in seconds
Pre-configured with ML libraries
Automatic resource management
Pay only for active time
No infrastructure to manage

Getting Started

Create a Notebook

Navigate to Serverless GPU
Click Create Notebook
Configure:
- Name: Notebook identifier
- GPU Type: Select available GPU
Click Create

Access Your Notebook

Wait for status to show Running
Click the notebook link
JupyterLab opens in a new tab
Start coding!

Notebook Features

Pre-installed Libraries

Common ML libraries ready to use:

PyTorch
TensorFlow
Transformers
NumPy, Pandas, Scikit-learn
Matplotlib, Seaborn
Jupyter extensions

GPU Access

Verify GPU availability:

import torch
print(f"GPU Available: {torch.cuda.is_available()}")
print(f"GPU Name: {torch.cuda.get_device_name(0)}")

File Storage

Files persist within your notebook session:

/home/jovyan/ - Home directory
Upload files via JupyterLab UI
Download results before stopping

Managing Notebooks

Viewing Notebooks

The Serverless GPU page shows:

All your notebooks
Current status
GPU allocation
Running time

Starting a Notebook

For stopped notebooks:

Click Start
Wait for Running status
Access via notebook link

Stopping a Notebook

To pause and save costs:

Click Stop
Notebook enters Stopped state
Billing pauses

Note: Unsaved work in memory is lost. Save files before stopping.

Deleting a Notebook

To permanently remove:

Click Delete
Confirm deletion
All files are removed

Web Terminal

Access terminal directly:

Find your notebook
Click Terminal
Browser terminal opens

Use for:

Installing packages
Running scripts
System commands

Real-Time Updates

Notebook status updates automatically via WebSocket:

Execution progress
Resource usage
Status changes

No need to refresh the page.

Use Cases

Quick Experiments

Test ideas without setup:

# Instant GPU access
model = AutoModel.from_pretrained("bert-base")
model.to("cuda")

Data Exploration

Analyze datasets interactively:

import pandas as pd
df = pd.read_csv("data.csv")
df.describe()

Prototyping

Build proof-of-concepts:

# Quick model training
model.fit(X_train, y_train)
predictions = model.predict(X_test)

Learning

Practice ML concepts:

Follow tutorials
Experiment with models
Learn new libraries

Installing Additional Packages

Using pip

!pip install package_name

Using conda

conda install package_name

Requirements File

!pip install -r requirements.txt

Packages persist until notebook is stopped.

Data Transfer

Upload Files

Use JupyterLab file browser
Click upload button
Select files
Files appear in current directory

Download Files

Right-click file in browser
Select Download
File downloads to your computer

From Object Storage

import boto3

s3 = boto3.client('s3',
    endpoint_url='https://s3.podstack.ai',
    aws_access_key_id='YOUR_KEY',
    aws_secret_access_key='YOUR_SECRET'
)

s3.download_file('bucket', 'data.csv', 'local_data.csv')

Billing

Serverless GPU is billed:

Per-second when running
Based on GPU type
No charge when stopped

Monitor costs:

Check running time in notebook list
View spending in wallet

Limitations

Compared to full pods:

Fixed resource configurations
No custom Docker images
No SSH access
Limited persistence
Single GPU typically

For advanced needs, use Pods.

Best Practices

Save Work Frequently

Save notebooks often
Download important results
Use object storage for persistence

Stop When Idle

Stop notebooks when taking breaks
Billing continues while running
Data on disk is preserved

Use for Right Workloads

Good for:

Experimentation
Quick tasks
Learning
Prototyping

Better with Pods:

Long training jobs
Production workloads
Custom environments
Multi-GPU needs

Next Steps

Learn about AI Studio for model management
Use Pods for more control