Pack-A-Mal Development Environment

Docker Compose-based development environment for Pack-A-Mal with separate services for backend, frontend, and database.

Architecture

┌─────────────────────────────────────────────────────────┐
│                    Nginx (Port 8080)                    │
│            Static Files Serving + Reverse Proxy         │
└──────────────────────┬──────────────────────────────────┘
                       │
                       ↓
┌─────────────────────────────────────────────────────────┐
│              Django Backend (Port 8001)                  │
│              Pack-A-Mal Web Application                  │
└──────────────────────┬──────────────────────────────────┘
                       │
                       ↓
┌─────────────────────────────────────────────────────────┐
│            PostgreSQL Database (Port 5433)              │
│                   packamal_dev                          │
└─────────────────────────────────────────────────────────┘

# View service status
docker-compose ps

# View logs
docker-compose logs -f backend
docker-compose logs -f frontend
docker-compose logs -f database

# Execute commands in containers
docker-compose exec backend python manage.py shell
docker-compose exec database psql -U packamal_db -d packamal
docker-compose exec backend bash

# Collect static files
docker-compose exec backend python manage.py collectstatic --noinput

# Run tests
docker-compose exec backend python manage.py test

Troubleshooting

Container Won't Start

# Check logs
docker-compose logs backend

# Rebuild image
docker-compose build --no-cache backend
docker-compose up -d

Database Connection Issues

# Check database is healthy
docker-compose exec database pg_isready -U packamal_db

# Verify environment variables
docker-compose exec backend env | grep POSTGRES

Port Already in Use

Change ports in docker-compose.yml:

Backend: "8001:8000" → "9001:8000"
Frontend: "8080:80" → "9080:80"
Database: "5433:5432" → "5434:5432"

Clean Start

# Remove all containers and volumes
docker-compose down -v

# Rebuild and start fresh
docker-compose build
docker-compose up -d

Inspecting Analysis Results Volume

# List files in volume
docker run --rm -v analysis_results:/data alpine ls -lah /data

# View a file
docker run --rm -v analysis_results:/data alpine cat /data/path/to/file.json

# Copy files to host
docker run --rm -v analysis_results:/data -v "$PWD":/host alpine sh -c 'cp -r /data/* /host/'

# Inspect volume path
docker volume inspect analysis_results

Testing API

curl -X POST "http://localhost:8080/api/v1/analyze/" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"purl": "pkg:npm/lodash@4.17.21"}'

2. Local Testing with Kubernetes (Minikube)

Run Pack-A-Mal on a local Kubernetes cluster using Minikube for testing Kubernetes-specific features.

Prerequisites

Minikube installed
kubectl installed
Docker installed

Quick Start

1. Start Minikube

minikube start --driver=docker --force-systemd=true --container-runtime=containerd

2. Build and Load Images

# Build local images
docker build -t packamal-backend:local ./backend
docker build -t packamal-frontend:local ./frontend
docker build -t packamal-go-worker-analysis:local -f ./worker/cmd/analyze/Dockerfile ./worker

# Load images into minikube
minikube image load packamal-backend:local 
minikube image load packamal-frontend:local
minikube image load packamal-go-worker-analysis:local

3. Apply Kubernetes Resources

# Apply all resources in correct order
./prd/k8s_minikube/apply-k8s.sh

This script will:

Create namespace "packamal"
Set up ConfigMaps and Secrets
Create PersistentVolumeClaims
Set up RBAC permissions for pod creation
Deploy all services (PostgreSQL, Redis, Backend, Frontend, Workers, etc.)
Configure Horizontal Pod Autoscaler (HPA)

4. Create Superuser

kubectl exec -it -n packamal deployment/backend -- python manage.py createsuperuser

5. Expose Services

# Port forward frontend service
./prd/k8s_minikube/port-forward-external.sh

This will expose the frontend on http://localhost:8080 (or another port if 8080 is busy).

Architecture

All components run in a single Kubernetes namespace packamal:

Frontend (listening on port 80)
Backend: Django (listening on port 8000)
Heavy Task (Worker): Pods are automatically created by the backend during requests using the image packamal-go-worker-analysis:local
Database: PostgreSQL (for persistent state)
Cache/Queue: Redis (for job queuing)

Communication Flow

Task Creation: Backend receives an API request and creates an AnalysisTask with status received.
Queueing: Task is pushed to Celery (Redis) and status changes to queued.
Job Submission: Celery worker processes the job and:
- Calls K8sService.run_analysis() to create a Kubernetes Job
- On success, status changes to processing and job_name is stored
- See details in backend/package_analysis/services/k8s_service.py
Analysis Execution: The Kubernetes Job pod "go analysis worker" performs the analysis:
- Runs dynamic analysis in a sandboxed environment
- Saves the analysis result to permanent PVC storage "analysis-results-pvc"
- Sends a callback signal to "http://backend:8000/api/v1/internal/callback/done/" when completed
Completion Handling (two paths):
- Worker Callback Path: Backend receives callback from worker:
  - Reads analysis result from PVC storage
  - Generates and saves report to database
  - Creates professional report
  - Updates task status to completed
  - Triggers K8s job cleanup
- Watcher Path: sync_k8s_job_status task (runs every 60s):
  - Monitors all tasks with status processing
  - Queries K8s API for job status
  - Updates status based on job conditions:
    - completed: Job succeeded (job_status.succeeded > 0)
    - failed: Job failed (non-timeout reasons)
    - timeout: Job killed by K8s (DeadlineExceeded condition)
  - Triggers cleanup for completed/failed/timeout tasks

Task Status Lifecycle

The system uses the following status values for AnalysisTask:

Status	Description	Next Possible Statuses
`received`	Task mới tạo record trong DB	`queued`
`queued`	Đã đẩy vào Celery (Redis) để chờ K8sService xử lý	`processing`, `failed`
`processing`	Đã gọi API K8s thành công (Job đang chạy)	`completed`, `failed`, `timeout`
`completed`	Worker báo về thành công hoặc Watcher thấy Job Succeeded	(final state)
`failed`	Lỗi ứng dụng, lỗi code Go, hoặc K8s job failed	(final state)
`timeout`	Bị K8s giết do chạy quá deadline (activeDeadlineSeconds)	(final state)

Status Flow Diagram:

received → queued → processing → completed
                              ↘ failed
                              ↘ timeout

Status API Endpoints

1. Check Task Status

GET /api/v1/task/<task_id>/

Returns detailed task information including:

Current status
Queue position (if queued)
Remaining time (if processing)
Error details (if failed/timeout)
Download URL (if completed)

Example Response:

{
  "task_id": 123,
  "purl": "pkg:npm/lodash@4.17.21",
  "status": "processing",
  "package_name": "lodash",
  "package_version": "4.17.21",
  "ecosystem": "npm",
  "created_at": "2024-01-15T10:00:00Z",
  "started_at": "2024-01-15T10:00:05Z",
  "remaining_time_minutes": 25,
  "is_timed_out": false,
  "job_id": "analysis-lodash-914f70e8"
}

2. List Tasks

GET /api/v1/reports/?page=1&page_size=20&status=processing

Query parameters:

page: Page number (default: 1)
page_size: Items per page (default: 20, max: 100)
status: Filter by status (received, queued, processing, completed, failed, timeout)

3. Queue Status

GET /api/v1/queue/status/

Returns current queue status:

Number of queued tasks
Number of processing tasks
List of queued tasks with positions
List of processing tasks with job IDs

4. Task Queue Position

GET /api/v1/task/<task_id>/queue/

Returns queue position for a specific task:

Queue position (if queued)
Status information
Package details

5. Timeout Status

GET /api/v1/timeout/status/

Returns timeout information for all processing tasks:

Number of running tasks
Number of timed out tasks
Detailed timeout info for each task

6. Internal Callback (Worker)

POST /api/v1/internal/callback/done/
Authorization: Bearer <INTERNAL_API_TOKEN>
Content-Type: application/json

{
  "task_id": "123"
}

Called by Go worker when analysis completes. Requires internal API token.

Status Monitoring

Automatic Monitoring

Watcher Task: sync_k8s_job_status runs every 60 seconds via Celery Beat
- Monitors all tasks with status processing
- Queries K8s API for job status
- Updates task status based on job conditions
- Triggers cleanup for completed tasks

Manual Monitoring

# Check task status via API
curl -X GET "http://localhost:8080/api/v1/task/123/" \
  -H "Authorization: Bearer YOUR_API_TOKEN"

# Check queue status
curl -X GET "http://localhost:8080/api/v1/queue/status/" \
  -H "Authorization: Bearer YOUR_API_TOKEN"

# Check timeout status
curl -X GET "http://localhost:8080/api/v1/timeout/status/"

Cleanup Process

When a task reaches a final state (completed, failed, or timeout):

Log Retrieval: For failed/timeout tasks, pod logs are retrieved and saved to error_details
Job Deletion: K8s job is deleted to free cluster resources
Cleanup Trigger: Both watcher and worker callback trigger cleanup automatically

Cleanup also runs periodically via cleanup_old_tasks task (every hour) to remove old completed/failed/timeout tasks and their associated K8s jobs.

Additional Components

A Horizontal Pod Autoscaler (HPA) for the Backend deployment
Automatic pod creation for analysis workers from image packamal-go-worker-analysis:local, based on requests from the Celery worker

Restarting

# Restart minikube and rebuild/load images
./prd/k8s_minikube/restart_containerd_minikube.sh

# Use -w flag to build and load go-heavy worker
./prd/k8s_minikube/restart_containerd_minikube.sh -w

Testing

You should log in and create an API key before testing.

curl -X POST "http://localhost:8080/api/v1/analyze/" \
  -H "Authorization: Bearer YOUR_API_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"purl": "pkg:npm/lodash@4.17.21"}'

Troubleshooting

Podman Cgroup Errors

Check whether the Cgroup error is resolved:

kubectl exec -n packamal analysis-lodash-914f70e8-hdh4h -- sh -lc '
set -e
mkdir -p /sys/fs/cgroup/libpod_parent/test
echo $$ > /sys/fs/cgroup/libpod_parent/test/cgroup.procs
echo "OK: cgroup.procs write works"
'

Debugging Analysis Pods

For debugging, you can exec into an analysis pod:

kubectl exec -it -n packamal analysis-<analysis-pod> -- /bin/bash

Inside the pod, you can run:

analyze -ecosystem npm -package lodash -version 4.17.21 -mode dynamic -nopull -dynamic-bucket file:///results/

Useful Commands

# Check pod status
kubectl get pods -n packamal

# View logs
kubectl logs -f -n packamal deployment/backend
kubectl logs -f -n packamal deployment/celery-worker

# Check services
kubectl get svc -n packamal

# Check PVCs
kubectl get pvc -n packamal

# Check HPA
kubectl get hpa -n packamal

Documentation

For more details, see:

3. Production Deployment on Azure Kubernetes Service (AKS)

Deploy Pack-A-Mal to Azure Kubernetes Service (AKS) for production use.

Overview

This deployment uses Azure Kubernetes Service (AKS) with:

Azure Container Registry (ACR) for container images
Azure Disk for persistent storage
Azure Load Balancer for external access
Azure Monitor for logging and monitoring

Prerequisites

Azure subscription with appropriate permissions
Azure CLI installed and configured
kubectl installed
Docker installed (for building images)

Quick Start

1. Set Up Infrastructure

Follow the comprehensive guide in prd/aks/01-infrastructure-setup.md to:

Create AKS cluster
Set up networking
Configure storage classes

2. Set Up Container Registry

Follow prd/aks/02-container-registry.md to:

Create Azure Container Registry (ACR)
Build and push images
Configure AKS to pull from ACR

3. Configure Security and RBAC

Follow prd/aks/03-security-rbac.md to:

Set up RBAC
Configure Azure Key Vault
Set up network policies

4. Deploy Application

# Connect to AKS cluster
az aks get-credentials --resource-group packamal-rg --name packamal-aks

# Apply Kubernetes manifests
./prd/aks/apply-aks.sh

This script will:

Create namespace and ConfigMaps
Set up PersistentVolumeClaims
Configure RBAC
Deploy databases (PostgreSQL, Redis)
Deploy application components (Backend, Frontend, Workers)
Set up HPA and monitoring

Architecture

┌─────────────────────────────────────────────────────────┐
│                    Azure Load Balancer                   │
│                    (Ingress Controller)                  │
└───────────────────────┬─────────────────────────────────┘
                        │
        ┌───────────────┴───────────────┐
        │                                 │
┌───────▼────────┐              ┌───────▼────────┐
│   Frontend      │              │    Backend      │
│   (Nginx)       │──────────────│   (Django)      │
└────────────────┘              └───────┬──────────┘
                                        │
                    ┌───────────────────┼───────────────────┐
                    │                   │                   │
            ┌───────▼──────┐    ┌───────▼──────┐    ┌───────▼──────┐
            │   Celery     │    │  PostgreSQL  │    │    Redis     │
            │   Worker     │    │              │    │              │
            └───────┬──────┘    └──────────────┘    └──────────────┘
                    │
            ┌───────▼──────┐
            │  Analysis    │
            │  Jobs (Go)    │
            │  (Ephemeral)  │
            └──────────────┘

Key Components

Frontend: Nginx serving static files
Backend: Django/Gunicorn API server with HPA
Celery Worker: Processes analysis jobs, creates Kubernetes Jobs
Go Analysis Worker: Ephemeral pods for heavy analysis
PostgreSQL: Primary database with persistent storage
Redis: Message broker and cache with persistent storage

Migration from Minikube

For step-by-step migration instructions, see prd/aks/09-migration-guide.md.

Documentation

Comprehensive documentation is available in the prd/aks/ directory:

00-overview.md - High-level overview, architecture, and goals
01-infrastructure-setup.md - AKS cluster creation, networking, storage
02-container-registry.md - Azure Container Registry setup and image management
03-security-rbac.md - Security, RBAC, Key Vault, network policies
04-kubernetes-manifests/ - AKS-optimized Kubernetes manifests
05-cicd-pipeline.md - CI/CD pipeline setup
06-monitoring-logging.md - Monitoring, logging, and observability
07-backup-disaster-recovery.md - Backup strategies and DR procedures
08-cost-optimization.md - Cost optimization strategies
09-migration-guide.md - Step-by-step migration from Minikube to AKS

Estimated Costs

Initial Setup: ~$549/month

AKS Cluster (3 nodes): ~$450
Storage: ~$15
Load Balancer: ~$25
Monitoring: ~$50
Container Registry: ~$9

After Optimization: ~$80-240/month

See 08-cost-optimization.md for details

Useful Commands

# Connect to AKS
az aks get-credentials --resource-group packamal-rg --name packamal-aks

# Check deployments
kubectl get deployments -n packamal

# Check pods
kubectl get pods -n packamal

# View logs
kubectl logs -f -n packamal deployment/backend

# Check services
kubectl get svc -n packamal

# Check HPA
kubectl get hpa -n packamal

# Get external IP
kubectl get svc frontend -n packamal

Troubleshooting

Image Pull Errors

# Ensure AKS has permission to pull from ACR
az aks update -n packamal-aks -g packamal-rg --attach-acr packamalacr

PVC Not Binding

# Check storage class
kubectl get storageclass

# Check PVC status
kubectl get pvc -n packamal
kubectl describe pvc <pvc-name> -n packamal

Analysis Jobs Failing

# Check analysis job pods
kubectl get pods -n packamal | grep analysis

# Check logs
kubectl logs -n packamal <analysis-pod-name>

# Verify privileged mode is enabled
kubectl get job -n packamal -o yaml | grep privileged

4. CI/CD with GitHub Actions

Automated CI/CD pipeline for building, pushing, and deploying Pack-A-Mal to AKS.

Overview

The CI/CD pipeline automatically:

Builds Docker images (Backend, Frontend, Go Worker)
Pushes images to Azure Container Registry (ACR)
Updates Kubernetes manifests with new image tags
Deploys to AKS cluster

Prerequisites

GitHub repository with Actions enabled
Azure Container Registry (ACR) created
AKS cluster deployed
Azure Service Principal with appropriate permissions

Quick Start

1. Configure GitHub Secrets

Follow prd/cicd/02-github-secrets.md to set up:

Required Secrets:

ACR_USERNAME - ACR admin username
ACR_PASSWORD - ACR admin password
AZURE_CREDENTIALS - Azure Service Principal JSON

Get ACR Credentials:

az acr credential show --name packamalacr

Create Service Principal:

az ad sp create-for-rbac \
  --name "packamal-github-actions" \
  --role contributor \
  --scopes /subscriptions/SUBSCRIPTION_ID/resourceGroups/packamal-rg \
  --sdk-auth

2. Workflow File

The workflow file is located at .github/workflows/aks-deploy.yml.

Trigger:

Automatic: Push to main branch
Manual: Workflow dispatch from GitHub Actions UI

3. Test Workflow

Push code to main branch to trigger workflow automatically
Or manually trigger from GitHub Actions tab
View logs in GitHub Actions to monitor progress

Pipeline Architecture

┌─────────────────┐
│  Git Push to    │
│  main branch    │
└────────┬────────┘
         │
         ▼
┌─────────────────────────────────────┐
│  GitHub Actions Workflow Trigger    │
│  (.github/workflows/aks-deploy.yml) │
└────────┬────────────────────────────┘
         │
         ├─────────────────┬─────────────────┐
         ▼                 ▼                 ▼
┌──────────────┐  ┌──────────────┐  ┌──────────────┐
│ Build        │  │ Build        │  │ Build        │
│ Backend      │  │ Frontend     │  │ Go Worker    │
│ Image        │  │ Image        │  │ Image        │
└──────┬───────┘  └──────┬───────┘  └──────┬───────┘
       │                 │                 │
       └─────────────────┴─────────────────┘
                         │
                         ▼
              ┌──────────────────────┐
              │  Push to ACR         │
              │  packamalacr.        │
              │  azurecr.io          │
              └──────────┬───────────┘
                         │
                         ▼
              ┌──────────────────────┐
              │  Update Image Tags    │
              │  in K8s Manifests     │
              └──────────┬───────────┘
                         │
                         ▼
              ┌──────────────────────┐
              │  Connect to AKS      │
              │  packamal-aks        │
              └──────────┬───────────┘
                         │
                         ▼
              ┌──────────────────────┐
              │  Deploy Manifests    │
              │  to namespace        │
              │  packamal            │
              └──────────┬───────────┘
                         │
                         ▼
              ┌──────────────────────┐
              │  Verify Deployment   │
              │  & Health Check      │
              └──────────────────────┘

Workflow Jobs

Job 1: build-and-push

Builds Backend image from ./backend/Dockerfile
Builds Frontend image from ./frontend/Dockerfile
Builds Go Worker image from ./worker/cmd/analyze/Dockerfile
Pushes all images to packamalacr.azurecr.io
Tags with commit SHA and latest

Job 2: deploy

Logs into Azure using Service Principal
Connects to AKS cluster packamal-aks
Updates image tags in Kubernetes manifests
Deploys all manifests to namespace packamal
Verifies deployment and performs health checks

Images Built

Image	Dockerfile	Registry Path
Backend	`./backend/Dockerfile`	`packamalacr.azurecr.io/packamal-backend`
Frontend	`./frontend/Dockerfile`	`packamalacr.azurecr.io/packamal-frontend`
Go Worker	`./worker/cmd/analyze/Dockerfile`	`packamalacr.azurecr.io/packamal-go-worker-analysis`

Manifests Updated

The workflow automatically updates image tags in:

05-backend.yaml - Backend deployment (initContainer and main container)
07-frontend.yaml - Frontend deployment
06-worker.yaml - Celery worker deployment
08-worker-2.yaml - Celery worker 2 deployment
01-config.yaml - ANALYSIS_IMAGE config value
13-image-preloader.yaml - Image preloader deployment

Verify Deployment

# Connect to AKS
az aks get-credentials --resource-group packamal-rg --name packamal-aks

# Check deployments
kubectl get deployments -n packamal

# Check pods
kubectl get pods -n packamal

# Check image versions
kubectl get deployments -n packamal -o jsonpath='{range .items[*]}{.metadata.name}{":\t"}{.spec.template.spec.containers[*].image}{"\n"}{end}'

Rollback

If deployment fails:

# Rollback deployment
kubectl rollout undo deployment/backend -n packamal
kubectl rollout undo deployment/frontend -n packamal
kubectl rollout undo deployment/celery-worker -n packamal

# Or rollback to previous image by updating manifest with old commit SHA

Documentation

For more details, see:

CI/CD Plan - Detailed pipeline architecture and implementation
GitHub Secrets Guide - How to configure secrets
CI/CD README - Overview and quick reference

Troubleshooting

Workflow Not Triggering

✅ Check .github/workflows/aks-deploy.yml is committed
✅ Check push is to main branch
✅ Check GitHub Actions is enabled for repository

Build Failures

✅ Check Dockerfile syntax
✅ Check build context paths
✅ Check ACR credentials

Deploy Failures

✅ Check Azure credentials
✅ Check AKS cluster status
✅ Check Service Principal permissions
✅ Check kubectl connection

Image Pull Failures

# Ensure AKS has permission to pull from ACR
az aks update -n packamal-aks -g packamal-rg --attach-acr packamalacr

Monitoring

GitHub Actions Logs

View logs in GitHub Actions tab
Each workflow run has detailed logs
Logs can be downloaded

Kubernetes Logs

# View deployment status
kubectl get deployments -n packamal

# View pod logs
kubectl logs -f deployment/backend -n packamal

# View rollout status
kubectl rollout status deployment/backend -n packamal

5. Task Status API and Lifecycle

Overview

Pack-A-Mal uses a comprehensive task status system to track analysis jobs from creation to completion. The system supports both worker callbacks and automatic watcher monitoring to ensure reliable status updates.

Task Status Values

Status	Description	Next Possible Statuses
`received`	Task mới tạo record trong DB	`queued`
`queued`	Đã đẩy vào Celery (Redis) để chờ K8sService xử lý	`processing`, `failed`
`processing`	Đã gọi API K8s thành công (Job đang chạy)	`completed`, `failed`, `timeout`
`completed`	Worker báo về thành công hoặc Watcher thấy Job Succeeded	(final state)
`failed`	Lỗi ứng dụng, lỗi code Go, hoặc K8s job failed	(final state)
`timeout`	Bị K8s giết do chạy quá deadline (activeDeadlineSeconds)	(final state)

Status Flow

┌──────────┐
│ received │  Task created in database
└────┬─────┘
     │
     ▼
┌──────────┐
│  queued  │  Pushed to Celery broker (Redis)
└────┬─────┘
     │
     ▼
┌──────────────┐
│  processing  │  K8s job created successfully
└──────┬───────┘
       │
       ├──────────────┬──────────────┐
       ▼              ▼              ▼
┌─────────────┐ ┌──────────┐ ┌──────────┐
│  completed  │ │  failed  │ │ timeout  │
└─────────────┘ └──────────┘ └──────────┘

Status API Endpoints

1. Check Task Status

GET /api/v1/task/<task_id>/
Authorization: Bearer <API_TOKEN>

Response:

{
  "task_id": 123,
  "purl": "pkg:npm/lodash@4.17.21",
  "status": "processing",
  "package_name": "lodash",
  "package_version": "4.17.21",
  "ecosystem": "npm",
  "created_at": "2024-01-15T10:00:00Z",
  "started_at": "2024-01-15T10:00:05Z",
  "remaining_time_minutes": 25,
  "is_timed_out": false,
  "job_id": "analysis-lodash-914f70e8",
  "queue_position": null,
  "timeout_minutes": 30
}

2. List Tasks

GET /api/v1/reports/?page=1&page_size=20&status=processing
Authorization: Bearer <API_TOKEN>

Query Parameters:

page: Page number (default: 1)
page_size: Items per page (default: 20, max: 100)
status: Filter by status (optional)

Response:

{
  "items": [
    {
      "task_id": 123,
      "purl": "pkg:npm/lodash@4.17.21",
      "status": "processing",
      "created_at": "2024-01-15T10:00:00Z",
      "package_name": "lodash",
      "package_version": "4.17.21",
      "ecosystem": "npm"
    }
  ],
  "page": 1,
  "page_size": 20,
  "total": 1
}

3. Queue Status

GET /api/v1/queue/status/

Response:

{
  "queue_length": 3,
  "processing_tasks": 1,
  "queued_tasks": [
    {
      "task_id": 120,
      "purl": "pkg:pypi/requests@2.31.0",
      "queue_position": 1,
      "priority": 0,
      "queued_at": "2024-01-15T10:00:00Z"
    }
  ],
  "processing_tasks": [
    {
      "task_id": 123,
      "purl": "pkg:npm/lodash@4.17.21",
      "job_id": "analysis-lodash-914f70e8",
      "started_at": "2024-01-15T10:00:05Z"
    }
  ]
}

4. Task Queue Position

GET /api/v1/task/<task_id>/queue/
Authorization: Bearer <API_TOKEN>

5. Timeout Status

GET /api/v1/timeout/status/

Response:

{
  "running_tasks": 1,
  "timed_out_tasks": 0,
  "tasks": [
    {
      "task_id": 123,
      "purl": "pkg:npm/lodash@4.17.21",
      "started_at": "2024-01-15T10:00:05Z",
      "timeout_minutes": 30,
      "remaining_minutes": 25,
      "is_timed_out": false
    }
  ]
}

Status Monitoring

Automatic Monitoring

Watcher Task (sync_k8s_job_status):

Runs every 60 seconds via Celery Beat
Monitors all tasks with status processing
Queries K8s API for job status
Updates task status based on job conditions:
- completed: Job succeeded (job_status.succeeded > 0)
- failed: Job failed (non-timeout reasons)
- timeout: Job killed by K8s (DeadlineExceeded condition)
Triggers cleanup for completed/failed/timeout tasks

Worker Callback:

Go worker calls /api/v1/internal/callback/done/ when analysis completes
Backend processes results and marks task as completed
Handles race condition with watcher (if watcher marked as completed first, still processes results)

Manual Monitoring

# Check specific task status
curl -X GET "http://localhost:8080/api/v1/task/123/" \
  -H "Authorization: Bearer YOUR_API_TOKEN"

# Poll for completion
while true; do
  STATUS=$(curl -s -X GET "http://localhost:8080/api/v1/task/123/" \
    -H "Authorization: Bearer YOUR_API_TOKEN" | jq -r '.status')
  echo "Status: $STATUS"
  if [ "$STATUS" = "completed" ] || [ "$STATUS" = "failed" ] || [ "$STATUS" = "timeout" ]; then
    break
  fi
  sleep 5
done

Cleanup Process

When a task reaches a final state (completed, failed, or timeout):

Log Retrieval: For failed/timeout tasks, pod logs are retrieved and saved to error_details
Job Deletion: K8s job is deleted to free cluster resources
Automatic Cleanup: Both watcher and worker callback trigger cleanup automatically

Periodic Cleanup:

cleanup_old_tasks task runs every hour
Removes old completed/failed/timeout tasks (older than 7 days)
Cleans up associated K8s jobs before deletion

Error Handling

Failed Tasks

Error Message: Human-readable error description
Error Category: Categorized error type (e.g., k8s_job_failed, timeout_error, results_not_found)
Error Details: JSON object with detailed information including pod logs (if available)

Timeout Tasks

Automatically detected when K8s job exceeds activeDeadlineSeconds
Error category: timeout_error
Error details include timeout reason and timestamp

Best Practices

Polling: Poll task status every 5-10 seconds, not more frequently
Timeout Handling: Check is_timed_out flag for processing tasks
Error Handling: Always check error_message and error_category for failed tasks
Queue Position: Use queue position to estimate wait time for queued tasks
Status URLs: Use status_url from initial response for status checks

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
backend		backend
database		database
frontend		frontend
prd		prd
worker		worker
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
cleancode_prompt.md		cleancode_prompt.md
docker-compose.yml		docker-compose.yml

pakaremon/packamal

Folders and files

Latest commit

History

Repository files navigation

Pack-A-Mal Development Environment

Architecture

Table of Contents

1. Local Testing with Docker Compose

Prerequisites

Quick Start

Services

🐳 Backend (Django)

🌐 Frontend (Nginx)

💾 Database (PostgreSQL)

🔴 Redis

⚙️ Celery Workers

Using Makefile (Recommended)

Configuration

Environment Variables

Development Workflow

Making Code Changes

Database Migrations

Installing New Dependencies

Scaling

Data Persistence

Useful Commands

Troubleshooting

Container Won't Start

Database Connection Issues

Port Already in Use

Clean Start

Inspecting Analysis Results Volume

Testing API

2. Local Testing with Kubernetes (Minikube)

Prerequisites

Quick Start

1. Start Minikube

2. Build and Load Images

3. Apply Kubernetes Resources

4. Create Superuser

5. Expose Services

Architecture

Communication Flow

Task Status Lifecycle

Status API Endpoints

1. Check Task Status

2. List Tasks

3. Queue Status

4. Task Queue Position

5. Timeout Status

6. Internal Callback (Worker)

Status Monitoring

Automatic Monitoring

Manual Monitoring

Cleanup Process

Additional Components

Restarting

Testing

Troubleshooting

Podman Cgroup Errors

Debugging Analysis Pods

Useful Commands

Documentation

3. Production Deployment on Azure Kubernetes Service (AKS)

Overview

Prerequisites

Quick Start

1. Set Up Infrastructure

2. Set Up Container Registry

3. Configure Security and RBAC

4. Deploy Application

Architecture

Key Components

Migration from Minikube

Documentation

Estimated Costs

Useful Commands

Troubleshooting

Image Pull Errors

Packages