AI Studio

Name: Podstack GPU Cloud
Brand: Podstack
SKU: PODSTACK-GPU-CLOUD
Availability: InStock
Rating: 4.9 (180 reviews)

AI Studio provides end-to-end model management capabilities: browse model catalogues, import from Hugging Face, fine-tune models on your data, evaluate performance, and deploy for inference.

Overview

AI Studio includes:

Model Catalogue - Browse available base and fine-tuned models
Hugging Face Integration - Search and import models from Hugging Face Hub
Model Playground - Test models interactively before deploying
Fine-Tuning - Train models using Unsloth or Axolotl engines
Evaluation - Assess model performance with custom datasets and metrics
Deployments - Serve models for inference via API

Model Catalogue

Browsing Models

Navigate to AI Studio > Models to explore:

Base Models Pre-trained foundation models ready for deployment or fine-tuning:

Large language models (LLMs)
Vision models
Multi-modal models
Embedding models

Fine-Tuned Models Your custom-trained models appear here after training completes.

Model Information

Each model card shows:

Name and Description: Model identity and capabilities
Parameters: Model size (7B, 13B, 70B, etc.)
Architecture: Model architecture type
Tags: Categorization labels (text-generation, NLP, computer-vision, etc.)
License: Usage rights (Apache 2.0, MIT, commercial, etc.)
Pricing: Cost per million tokens (input and output separately)
Status: Available, coming soon, or deprecated

Filtering and Search

Search Type model name or keywords to find specific models.

Filter by Tags

text-generation
language-model
computer-vision
NLP
embedding
multi-modal

Filter by Model Type

Base models
Fine-tuned models
Your models

Model Statistics View aggregate stats including total models, active deployments, and fine-tuning jobs.

Hugging Face Integration

Import models directly from the Hugging Face Hub.

Searching Hugging Face

Go to AI Studio > Models
Click Import from Hugging Face
Search by model name, task, or keyword
Browse popular models by category

Importing a Model

Find the model on Hugging Face
Click Import
Podstack validates model access and compatibility
Model is added to your catalogue
Ready for fine-tuning or deployment

Model Categories

Browse Hugging Face models by category:

Text Generation
Text Classification
Question Answering
Summarization
Translation
Image Classification
Object Detection
Audio Processing

Access Validation

For gated models on Hugging Face:

Podstack checks if you have access to the model
You may need to accept the model’s license on Hugging Face first
Provide your Hugging Face token if required

Model Playground

Test models interactively before deploying:

Accessing the Playground

Navigate to a model’s detail page
Click Playground
Or go to AI Studio > Playground and select a model

Playground Features

Chat Interface

Send prompts and receive responses
View conversation history
Clear conversation and start fresh

Parameter Controls

Temperature: Control randomness (0.0 - 2.0)
Max Tokens: Limit response length
Top P: Nucleus sampling parameter
Stop Sequences: Custom stop tokens

Playground Configuration Each model can have a custom playground config with:

Default parameters
System prompt templates
Example prompts

Use Cases

Evaluate model capabilities before deployment
Test prompts and prompt engineering
Compare models for specific tasks
Demo models to stakeholders

Fine-Tuning

Train models on your custom data to improve performance for specific tasks.

Training Engines

Podstack supports multiple training engines:

Unsloth (Recommended for LoRA)

Optimized for fast LoRA fine-tuning
2-5x faster than standard training
Lower memory requirements
Pre-configured templates for popular models
Best for: Quick fine-tuning, LoRA adapters, memory-constrained setups

Axolotl (Full Fine-Tuning)

Full parameter fine-tuning support
Advanced configuration options
Custom training recipes
Best for: Full fine-tuning, advanced users, custom training pipelines

Selecting an Engine

When creating a fine-tuning job, choose your engine
Use Get Recommendation to let Podstack suggest the best engine for your model and use case
View engine capabilities and compatible models

Preparing Data

Dataset Requirements

Format: JSONL, CSV, or supported formats
Quality: Clean, representative data
Size: Minimum samples depend on model and engine

Upload Dataset

Go to AI Studio > Fine-Tuning
Click Upload Dataset
Select your file
Dataset is automatically validated for format and compatibility
View dataset details including sample count, size, and format

Dataset Validation Podstack validates datasets before training:

Schema validation against model requirements
Format checking (JSONL structure, required fields)
Sample preview for verification

Creating a Fine-Tuning Job

Click Create Fine-Tuning Job
Configure:

Model Selection

Choose base model to fine-tune
Or select from Hugging Face quickstart models

Training Engine

Select Unsloth or Axolotl
Or use the recommendation engine

Dataset

Select uploaded training dataset
Optionally select validation dataset

Training Parameters

Parameter	Description
Learning Rate	How fast the model learns (e.g., 1e-5)
Batch Size	Samples per training step
Epochs	Passes through the dataset
Max Sequence Length	Maximum input length
LoRA Rank	LoRA adapter dimension (Unsloth)
LoRA Alpha	LoRA scaling factor (Unsloth)

Unsloth Templates When using Unsloth, select from optimized templates:

Templates are pre-configured for specific model families
Automatic parameter recommendations
Validated configurations that work out-of-the-box

Cost Estimation Before starting, review the estimated training cost based on:

GPU type and count
Estimated training duration
Dataset size and epoch count

Click Start Training

Monitoring Training

Track progress on the fine-tuning page:

Training Status: Queued, running, completed, failed
Current Epoch: Progress through the dataset
Loss Metrics: Training and validation loss curves (real-time via WebSocket)
Training Time: Elapsed and estimated remaining time
Resource Usage: GPU utilization during training

Training Results

When training completes successfully:

Model saved to your catalogue under “My Fine-Tuned Models”
Training metrics and loss curves available for review
Model ready for evaluation or direct deployment
Download option for model weights
Deploy directly to AI Studio or Inference

Managing Training Jobs

View all your training jobs:

Active Jobs: Currently running
Completed Jobs: Successfully finished
Failed Jobs: Jobs that encountered errors

For each job you can:

View detailed logs
View resource usage metrics
Cancel running jobs
Delete completed jobs

Fine-Tuning Analytics

View aggregate analytics across all fine-tuning activity:

Total jobs run
GPU hours consumed
Success/failure rates
Resource utilization trends

Evaluation

Assess how well your models perform with structured evaluation jobs.

Evaluation Datasets

Before running evaluations, prepare your evaluation data:

Upload Dataset

Go to AI Studio > Evaluation
Click Upload Dataset
Select your evaluation file
Dataset is validated for format

Generate Dataset Podstack can help generate evaluation datasets:

Click Generate Dataset
Configure generation parameters
Review and edit generated samples

Dataset Templates Use pre-built templates for common evaluation tasks:

Question-answering evaluation
Summarization evaluation
Classification evaluation
Custom task evaluation

Supported Formats View supported dataset formats for your evaluation type.

Creating an Evaluation Job

Go to AI Studio > Evaluation
Click New Evaluation
Configure:
- Model to evaluate
- Evaluation dataset
- Metrics to compute
Run evaluation

Evaluation Metrics

Depending on task type:

Accuracy: Correct predictions
Perplexity: Language model quality
BLEU/ROUGE: Text generation quality
F1 Score: Precision and recall balance
Custom metrics: Task-specific measures

View all available metric types in the metrics catalog.

Comparing Models

Compare multiple models:

Side-by-side metrics
Performance across datasets
Cost vs. quality tradeoffs

Deployments

Deploy models for inference via API.

Creating a Deployment

Go to AI Studio > Deployments
Click Deploy Model
Configure:
- Model to deploy (base, fine-tuned, or from registry)
- GPU type for inference
- Scaling settings
Click Deploy

Deployment Settings

Setting	Description
Model	Which model to serve
GPU Type	Hardware for inference
Min Replicas	Minimum instances
Max Replicas	Maximum for scaling

Deployment Lifecycle

Deploy: Start serving the model

Model → Deploying → Active

Undeploy: Stop serving without deleting configuration

Active → Undeploying → Inactive

Using Deployed Models

Access via API:

import requests

response = requests.post(
    'https://api.podstack.ai/v1/inference/deployments/<deployment_id>/predict',
    headers={'Authorization': 'Bearer YOUR_TOKEN'},
    json={'prompt': 'Your input text'}
)

print(response.json()['output'])

Batch Predictions Send multiple inputs in a single request:

response = requests.post(
    'https://api.podstack.ai/v1/inference/deployments/<deployment_id>/batch-predict',
    headers={'Authorization': 'Bearer YOUR_TOKEN'},
    json={'inputs': ['Input 1', 'Input 2', 'Input 3']}
)

Managing Deployments

Monitor

Request count and latency metrics
Error rates and status codes
GPU utilization per replica
Token throughput (tokens per second)
Deployment logs

Deployment Actions

Deploy: Start serving the model
Undeploy: Stop serving requests
Update: Change model version or settings
Delete: Permanently remove deployment

Endpoint Details

View deployment information:

Endpoint URL: Full API endpoint for inference
Health Status: Check deployment health
Model Info: Currently deployed model details
Uptime: Time since last deployment/restart

Direct Deployment from Registry

Deploy models directly from the Model Registry:

Go to MLOps > Model Registry
Find a model with a Production stage version
Click Deploy to AI Studio
Configure deployment settings
Model is deployed for inference

Pricing

AI Studio costs include:

Fine-Tuning

GPU time during training
Based on GPU type and duration
Cost estimation available before starting

Inference

Per-token pricing
Varies by model size
Volume discounts available

Storage

Dataset storage
Model storage

Best Practices

Data Preparation

Clean data thoroughly - Remove noise and errors
Balance classes - Avoid skewed distributions
Validate format - Use Podstack’s validation tool
Start small - Test with subset first

Fine-Tuning

Use Unsloth for LoRA - Faster and more memory-efficient
Start with templates - Use pre-configured Unsloth templates
Monitor loss curves - Watch for overfitting via real-time metrics
Use validation set - Track generalization
Review cost estimates - Check estimated cost before starting

Evaluation

Use held-out data - Don’t evaluate on training data
Multiple metrics - Don’t rely on single measure
Compare baselines - Measure improvement over base model
Use dataset templates - Start with structured evaluation formats

Deployment

Test in playground first - Validate before deploying
Start small - Begin with minimal replicas
Monitor usage - Track latency and errors
Scale gradually - Increase based on demand

Feature Availability

AI Studio requires:

Feature flag enabled (REACT_APP_ENABLE_AI_STUDIO)
Sufficient wallet balance for training and inference

Contact support if AI Studio isn’t visible in your portal.

Next Steps

Create a Pod for custom training setups
Set up Storage for datasets
Use the Model Registry to version models
Manage Billing for AI Studio costs