Files
deltaglider-beshu-tech/docs/sdk/getting-started.md
Simone Scarduzio edcbd2c7d0 Add simplified SDK client API and comprehensive documentation
- Create DeltaGliderClient with user-friendly interface
- Add create_client() factory function with sensible defaults
- Implement UploadSummary dataclass with helpful properties
- Expose simplified API through main package
- Add comprehensive SDK documentation under docs/sdk/:
  - Getting started guide with installation and examples
  - Complete API reference documentation
  - Real-world usage examples for 8 common scenarios
  - Architecture deep dive explaining how DeltaGlider works
  - Automatic documentation generation scripts
- Update CONTRIBUTING.md with SDK documentation guidelines
- All tests pass and code quality checks succeed

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-09-23 13:44:38 +02:00

5.5 KiB

Getting Started with DeltaGlider SDK

This guide will help you get up and running with the DeltaGlider Python SDK in minutes.

Prerequisites

  • Python 3.11 or higher
  • AWS credentials configured (or access to MinIO/S3-compatible storage)
  • xdelta3 installed on your system (installed automatically with the package)

Installation

Using pip

pip install deltaglider

Using uv (faster)

uv pip install deltaglider

Development Installation

git clone https://github.com/beshu-tech/deltaglider
cd deltaglider
pip install -e ".[dev]"

Configuration

AWS Credentials

DeltaGlider uses standard AWS credential discovery:

  1. Environment Variables
export AWS_ACCESS_KEY_ID=your_access_key
export AWS_SECRET_ACCESS_KEY=your_secret_key
export AWS_DEFAULT_REGION=us-west-2
  1. AWS Credentials File (~/.aws/credentials)
[default]
aws_access_key_id = your_access_key
aws_secret_access_key = your_secret_key
region = us-west-2
  1. IAM Role (when running on EC2/ECS/Lambda) Automatically uses instance/task role credentials

Custom S3 Endpoints

For MinIO, Cloudflare R2, or other S3-compatible storage:

from deltaglider import create_client

client = create_client(endpoint_url="http://minio.local:9000")

Or via environment variable:

export AWS_ENDPOINT_URL=http://minio.local:9000

Your First Upload

Basic Example

from deltaglider import create_client

# Create a client
client = create_client()

# Upload a file
summary = client.upload(
    file_path="my-app-v1.0.0.zip",
    s3_url="s3://my-bucket/releases/v1.0.0/"
)

# Check the results
print(f"Upload completed!")
print(f"Original size: {summary.original_size_mb:.1f} MB")
print(f"Stored size: {summary.stored_size_mb:.1f} MB")
print(f"Compression: {summary.savings_percent:.0f}%")
print(f"Is delta: {summary.is_delta}")

Understanding the Results

When you upload a file, DeltaGlider returns an UploadSummary with:

  • operation: What was done (PUT for new reference, PUT_DELTA for delta)
  • original_size_mb: Original file size in MB
  • stored_size_mb: Actual size stored in S3
  • savings_percent: Percentage of storage saved
  • is_delta: Whether delta compression was used
  • delta_ratio: Ratio of delta size to original (smaller is better)

Downloading Files

# Download a file
client.download(
    s3_url="s3://my-bucket/releases/v1.0.0/my-app-v1.0.0.zip",
    output_path="downloaded-app.zip"
)

# The file is automatically reconstructed if it was stored as a delta

Working with Multiple Versions

Here's where DeltaGlider shines - uploading multiple versions:

from deltaglider import create_client
from pathlib import Path

client = create_client()

# Upload multiple versions
versions = ["v1.0.0", "v1.0.1", "v1.0.2", "v1.1.0"]

for version in versions:
    file = f"builds/my-app-{version}.zip"

    summary = client.upload(
        file_path=file,
        s3_url=f"s3://releases/{version}/"
    )

    if summary.is_delta:
        print(f"{version}: Compressed to {summary.stored_size_mb:.1f}MB "
              f"(saved {summary.savings_percent:.0f}%)")
    else:
        print(f"{version}: Stored as reference ({summary.original_size_mb:.1f}MB)")

# Typical output:
# v1.0.0: Stored as reference (100.0MB)
# v1.0.1: Compressed to 0.2MB (saved 99.8%)
# v1.0.2: Compressed to 0.3MB (saved 99.7%)
# v1.1.0: Compressed to 5.2MB (saved 94.8%)

Verification

Verify the integrity of stored files:

# Verify a stored file
is_valid = client.verify("s3://releases/v1.0.0/my-app-v1.0.0.zip")
print(f"File integrity: {'✓ Valid' if is_valid else '✗ Corrupted'}")

Error Handling

from deltaglider import create_client

client = create_client()

try:
    summary = client.upload("app.zip", "s3://bucket/path/")
except FileNotFoundError:
    print("Local file not found")
except PermissionError:
    print("S3 access denied - check credentials")
except Exception as e:
    print(f"Upload failed: {e}")

Logging

Control logging verbosity:

# Debug logging for troubleshooting
client = create_client(log_level="DEBUG")

# Quiet mode
client = create_client(log_level="WARNING")

# Default is INFO
client = create_client()  # INFO level

Local Testing with MinIO

For development and testing without AWS:

  1. Start MinIO
docker run -p 9000:9000 -p 9001:9001 \
  -e MINIO_ROOT_USER=minioadmin \
  -e MINIO_ROOT_PASSWORD=minioadmin \
  minio/minio server /data --console-address ":9001"
  1. Create a bucket (via MinIO Console at http://localhost:9001)

  2. Use DeltaGlider

from deltaglider import create_client

client = create_client(
    endpoint_url="http://localhost:9000"
)

# Set credentials via environment
import os
os.environ["AWS_ACCESS_KEY_ID"] = "minioadmin"
os.environ["AWS_SECRET_ACCESS_KEY"] = "minioadmin"

# Now use normally
summary = client.upload("test.zip", "s3://test-bucket/")

Best Practices

  1. Group Similar Files: Upload related files to the same S3 prefix for optimal compression
  2. Version Naming: Use consistent naming for versions (e.g., app-v1.0.0.zip, app-v1.0.1.zip)
  3. Cache Management: The local reference cache improves performance - don't clear it unnecessarily
  4. Error Recovery: Always handle exceptions for production code
  5. Monitoring: Log compression ratios to track effectiveness

Next Steps