Commit Graph

51 Commits

Author SHA1 Message Date
ZhenYi
894c3873a4 fix deploy repo volume permissions 2026-05-15 00:50:13 +08:00
ZhenYi
18b4864050 refactor(deploy): add SSH service annotations and externalTrafficPolicy support 2026-05-14 10:01:18 +08:00
ZhenYi
cac342bdc5 refactor(deploy): remove gingress controller, switch to nginx ingress
- Delete gingress templates (deployment, rbac, service)
- Remove gingress config block from values.yaml
- Switch ingress class to nginx with full annotations:
  - Unlimited body size for large file uploads
  - WebSocket support with 1h timeouts
  - Cookie-based session affinity
  - Real IP passthrough via X-Forwarded-For
2026-05-12 17:20:52 +08:00
ZhenYi
8ecd16868c feat(core): initialize project with access control and AI integration 2026-05-12 17:01:42 +08:00
ZhenYi
8be15cb81e fix(deploy): hardcode PVC name as shared-data in templates, remove pvcName Helm value
PVC name is now immutable — hardcoded in all 4 deployment templates instead
of being a configurable Helm value. Removed pvcName from values.yaml and
--set pvcName from deploy.sh. This ensures the PVC can never be renamed or
deleted by Helm operations, only manually via kubectl.
2026-05-12 16:36:13 +08:00
ZhenYi
4d5caffe0b fix(deploy): disable readOnlyRootFilesystem to prevent temp file write errors
Email worker and other pods fail with "Read-only file system" when
readOnlyRootFilesystem is true, since they need to write temp files.
Also adds debug print statements for database connection lifecycle.
2026-05-11 01:14:06 +08:00
ZhenYi
b560d9ea0f fix(db): use seconds for connection pool timeouts instead of milliseconds
ConfigMap values are in seconds (e.g. connection_timeout=30 means 30s),
but Duration::from_millis() interpreted them as ms (30ms), causing pool
timeout on startup. Changed to from_secs(). Also removed Namespace from
Helm chart to prevent cascade deletion of PVC/ConfigMap on uninstall.
2026-05-10 23:58:16 +08:00
ZhenYi
065c9e6aa5 fix(deploy): replace underscores with hyphens in container names and fix namespace Helm ownership
Kubernetes container names must follow RFC 1123 (no underscores).
Also update deploy.sh to label/annotate namespace with Helm ownership metadata.
2026-05-10 23:23:45 +08:00
ZhenYi
1f025ee957 fix(deploy): unify gingress namespace to app 2026-05-10 22:49:03 +08:00
ZhenYi
7148c8fd39 feat(gingress): add Git UA routing and convert gingress to Helm templates
- Route requests with git/JGit User-Agent directly to gitserver backend
- Parse gingress.io/git-backend annotation (format: namespace/name:port)
- Convert static gingress YAML to Helm templates under deploy/templates/gingress/
- Add gingress config block to values.yaml (namespace, replicas, ports, resources)
2026-05-10 22:47:18 +08:00
ZhenYi
670bcc8c06 feat(deploy): configure ingress with gingress, cert-manager TLS, and SSH LB
- Set primary domain gitdata.ai and static.gitdata.ai with cert-manager TLS
- Add LoadBalancer service for gitserver SSH (port 2222)
- Exclude .server.yaml from Helm packaging
2026-05-10 22:29:32 +08:00
ZhenYi
ba2490dab4 feat(core): initialize project with access control and AI integration 2026-05-10 21:01:21 +08:00
ZhenYi
14f6e1e500 feat(core): initialize project with access control and AI integration
- Add gitignore and prettier configuration files for project scaffolding
- Implement room access control service with project member verification
- Create user access key management with CRUD operations and activity logging
- Add accordion UI component for frontend expandable sections
- Implement room AI configuration with list, upsert, and delete operations
- Add AI event types for agent join/leave/status change tracking
- Create streaming AI processing services for mode and react patterns
- Build room AI service with model detection and idempotency handling
- Integrate chat service orchestration for AI message processing
- Add typing indicators and stream cancellation for AI interactions
- Implement mention parsing and context extraction for AI agents
2026-05-03 06:04:31 +08:00
ZhenYi
d1ade2c3c3 feat(deploy): add HPA autoscaling rules for all services except email
Some checks are pending
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Frontend Build (push) Blocked by required conditions
Add HorizontalPodAutoscaler (autoscaling/v2) using CPU and memory utilization
metrics to all deployment templates: app, static, gitserver, git-hook,
operator, adminrpc. Email-worker is excluded as requested.

- CPU target: 80% average utilization
- Memory target: 80% average utilization
- Each service has per-service min/max replicas in values.yaml
- Operator autoscaling defaults to disabled (enabled: false)
- Conditional via {{ if .Values.<service>.autoscaling.enabled }}
2026-04-28 13:42:37 +08:00
ZhenYi
fd232354cc fix(gitserver): correct health probe port path in k8s template 2026-04-26 00:11:48 +08:00
ZhenYi
10836730ed feat: add health endpoints and Prometheus metrics to git-hook and email-worker
Some checks are pending
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Frontend Build (push) Blocked by required conditions
Health monitoring:
- gitserver: /health endpoint on port 8021 (DB + Redis ping)
- git-hook: hyper health server on port 8083 with /health
- email-worker: hyper health server on port 8084 with /health
- K8s probes updated to httpGet for all three services

Metrics (via /metrics endpoint):
- git-hook: hook_tasks_total/success/failed/locked/retried/exhausted,
  hook_sync_branches/tags_changed_total
- email: email_queued/consumed/sent/failed_total,
  email_validation_skipped/build_errors/send_attempts_total
2026-04-25 23:45:48 +08:00
ZhenYi
38da729860 fix(adminrpc): expose HTTP port 9091 in k8s deployment and service
Some checks are pending
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Frontend Build (push) Blocked by required conditions
The adminrpc binary runs HTTP endpoints on port grpc_port+1 (9091),
but k8s deployment only exposed port 9090 (gRPC). The /api/admin/*
HTTP routes were unreachable from the admin dashboard frontend.

- Add http container port 9091 to Deployment
- Add http named port to k8s Service
- Point liveness/readiness probes to HTTP port 9091
- Add http_port: 9091 to Helm values.yaml
2026-04-22 23:56:38 +08:00
ZhenYi
bb127d4dbb feat(adminrpc): add Docker image, Helm chart, and build/deploy scripts
- docker/adminrpc.Dockerfile: minimal debian runtime image
- deploy/templates/adminrpc-deployment.yaml: Deployment (replicas=1,
  ClusterIP svc on 9090, tcpSocket readiness probe)
- deploy/values.yaml: adminrpc section with defaults
- scripts/build.js: add adminrpc to RUST_SERVICES
- scripts/deploy.js: add adminrpc to SERVICES + helm setValues
2026-04-21 23:14:46 +08:00
ZhenYi
00a5369fe1 feat(frontend): Discord layout + AI Studio theme + Room Settings
Some checks are pending
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Frontend Build (push) Blocked by required conditions
Frontend:
- Add Discord-style 3-column layout (server icons / channel sidebar / chat)
- AI Studio design system: new CSS token palette (--room-* vars)
- Replace all hardcoded Discord colors with CSS variable tokens
- Add RoomSettingsPanel (name, visibility, AI model management)
- Settings + Member list panels mutually exclusive (don't overlap)
- AI models shown at top of member list with green accent
- Fix TS errors: TipTap SuggestionOptions, unused imports, StarterKit options
- Remove MentionInput, MentionPopover, old room components (废弃代码清理)

Backend:
- RoomAiResponse returns model_name from agents.model JOIN
- room_ai_list and room_ai_upsert fetch model name for each config
- AiConfigData ws-protocol interface updated with model_name

Note: RoomSettingsPanel UI still uses shadcn defaults (未完全迁移到AI Studio)
2026-04-18 16:59:36 +08:00
ZhenYi
7be2f4eb61 fix(deploy): simplify ingress by removing static paths
Some checks are pending
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Frontend Build (push) Blocked by required conditions
The SPA assets are embedded in the app binary, so no separate
static file serving is needed.
2026-04-18 00:51:52 +08:00
ZhenYi
0ea6440ea3 fix(deploy): add proxy-http-version annotation for WebSocket support 2026-04-17 23:18:00 +08:00
ZhenYi
845aba443b fix(deploy): add APP_REPOS_ROOT env to git-hook deployment
Some checks are pending
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Frontend Build (push) Blocked by required conditions
Missing env var — git-hook worker needs APP_REPOS_ROOT to resolve repo
storage paths. Added explicit env block with APP_REPOS_ROOT=/data/repos,
consistent with gitserver deployment.
2026-04-17 14:21:15 +08:00
ZhenYi
3354055e6d fix(operator): mount /data PVC into git-hook deployment
GitHook controller was generating a Deployment without any persistent
storage — only a ConfigMap volume at /config. The worker needs /data to
access repo storage paths (APP_REPOS_ROOT defaults to /data/repos).

Changes:
- GitHookSpec: added storage_size field (default 10Gi), matching the
  pattern already used by GitServerSpec
- git_hook.rs reconcile(): now creates a PVC ({name}-data) before the
  Deployment, mounts it at /data, and sets APP_REPOS_ROOT=/data/repos
- git-hook-crd.yaml: synced storageSize field into the CRD schema
2026-04-17 14:15:38 +08:00
ZhenYi
e50c31daa6 fix(deploy): mount /data PVC into git-hook container
The git-hook worker needs /data to access repo storage paths, but the
deployment defined the shared-data PVC volume at pod level without
attaching it to the container via volumeMounts. Added the missing
volumeMounts block (name: shared-data, mountPath: /data) so the PVC is
properly mounted into the git-hook container, consistent with app and
static deployments.
2026-04-17 14:06:24 +08:00
ZhenYi
7e42139989 feat(frontend): embed SPA assets into app binary at compile time
Some checks are pending
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Frontend Build (push) Blocked by required conditions
- Add libs/frontend crate: build.rs runs pnpm build, copies dist/ to
  OUT_DIR/dist_blobs/, generates frontend.rs with lazy_static! map
- libs/api/dist.rs serves embedded assets via serve_frontend handler
- Register /{path:.*} SPA fallback in route.rs (after /api/*)
- Remove frontend container from deploy: docker/frontend.Dockerfile,
  deploy/templates/frontend-*.yaml, values.yaml frontend section
- Update ingress: gitdata.ai root now routes to app service
- Update scripts: build.js removes frontend step, deploy.js removes frontend
2026-04-17 12:04:34 +08:00
ZhenYi
5482283727 feat(seo): add useHead to all landing pages with Command as Service titles and descriptions
Some checks are pending
CI / Frontend Build (push) Blocked by required conditions
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
2026-04-16 19:12:06 +08:00
ZhenYi
431f40063f fix(ws): allow APP_DOMAIN_URL and APP_STATIC_DOMAIN origins
Some checks are pending
CI / Rust Lint & Check (push) Waiting to run
CI / Rust Tests (push) Waiting to run
CI / Frontend Lint & Type Check (push) Waiting to run
CI / Frontend Build (push) Blocked by required conditions
validate_origin() only allowed localhost origins by default, causing
production WebSocket connections to be rejected. Now it reads
APP_DOMAIN_URL and APP_STATIC_DOMAIN from env and automatically
adds their http/https/ws/wss variants to the allowed origins list.

Also add APP_DOMAIN_URL to the production configmap.
2026-04-16 18:51:52 +08:00
ZhenYi
fa091b9d22 deploy(k8s): add CRD definitions for operator 2026-04-16 17:24:25 +08:00
ZhenYi
f5084974b3 fix(k8s): add APP_SESSION_SECRET to ConfigMap to fix captcha errors with multi-pod
Without a shared cookie signing key, each pod generates a random key on
startup. Requests that hit different pods fail session validation, causing
CaptchaError when the captcha and login requests route to different pods.
2026-04-15 23:31:11 +08:00
ZhenYi
b6022e824d feat(k8s): enforce minimum 2 replicas for all services except email-worker 2026-04-15 23:08:25 +08:00
ZhenYi
e7cf0c544f fix(k8s): remove all health probes from gitserver 2026-04-15 23:07:17 +08:00
ZhenYi
dd4bbf3bb5 fix(k8s): add startupProbe to gitserver deployment template 2026-04-15 23:04:17 +08:00
ZhenYi
451e55596a fix(k8s): protect PVCs from deletion on helm uninstall 2026-04-15 23:00:55 +08:00
ZhenYi
c033cc3ff8 fix(k8s): add procps to worker images and fix probe commands
- Add procps to git-hook and email-worker Dockerfiles (provides pgrep)
- Change all exec probes from pgrep to kill -0 1 (more reliable, bash built-in)
- Add startupProbe to gitserver with 30 failure threshold (5min max startup time)
- Increase gitserver liveness initialDelay to 30s for slower SSH init
2026-04-15 22:13:16 +08:00
ZhenYi
0ce70eca7f fix(deploy): bind app to 0.0.0.0 for K8s Service connectivity 2026-04-15 14:26:42 +08:00
ZhenYi
d307c13878 fix(deploy): route /api and /ws to app, frontend as default on gitdata.ai 2026-04-15 14:19:25 +08:00
ZhenYi
6c3f5b49f8 feat(deploy): single unified Ingress with per-host routing
Replace multiple conflicting Ingress resources with one that routes:
- gitdata.ai         → frontend (port 80)
- api.gitdata.ai     → app (port 8080)
- git.gitdata.ai     → gitserver-http (port 8022)
- static.gitdata.ai  → static (port 8081)

Disable service-level ingress configs in values.yaml (they would
conflict on the same host/path). Single TLS secret covers all hosts.
2026-04-15 14:17:03 +08:00
ZhenYi
b9a9acbc75 fix(deploy): correct ConfigMap name from -secrets to -config
All deployments referenced the wrong ConfigMap name.
ConfigMap is named gitdata-config in the cluster.
2026-04-15 14:11:40 +08:00
ZhenYi
d4c9759a7e revert: use configMapRef instead of secretRef
ConfigMap is managed externally and contains all required env vars.
Keeping envFrom + configMapRef injection pattern.
2026-04-15 14:09:48 +08:00
ZhenYi
b7b56938aa fix(deploy): use secretRef instead of configMapRef in all deployments
ConfigMap was removed from Helm chart (managed externally).
Secret contains all required env vars (APP_DATABASE_URL, APP_REDIS_URL,
SMTP credentials, SSH domain/key, etc.) and is always rendered
(helm.sh/resource-policy: keep prevents deletion on upgrade).
Also add APP_SSH_PORT and APP_SSH_SERVER_PRIVATE_KEY to secret template.
2026-04-15 14:06:40 +08:00
ZhenYi
ad2cb7255f refactor(deploy): use envFrom with configMapRef for all deployments
Replace individual configMapKeyRef env vars with envFrom + configMapRef
to inject the full ConfigMap, plus minimal env blocks for static values.
Also remove resources block from operator deployment (K8s schema validation).
2026-04-15 13:48:13 +08:00
ZhenYi
b37425d3c7 fix: add SMTP env vars to email-worker deployment 2026-04-15 13:24:45 +08:00
ZhenYi
3bc381da45 feat: add SMTP env vars to app deployment 2026-04-15 11:51:29 +08:00
ZhenYi
2f31e7268b feat: add imagePullSecrets support to all deployment templates 2026-04-15 10:59:15 +08:00
ZhenYi
33c5501fc9 fix: remove resources from deployment templates to fix K8s schema validation 2026-04-15 10:56:22 +08:00
ZhenYi
53c63dad19 fix: simplify static deployment template, fix YAML parse error 2026-04-15 10:47:25 +08:00
ZhenYi
9112cb2dff fix: use scratch variable for port binding in static template 2026-04-15 10:42:53 +08:00
ZhenYi
9776c923c0 fix: use %v instead of %s+print for port in static template 2026-04-15 10:40:02 +08:00
ZhenYi
0ebbc6e29e fix: remove limits from resources to fix K8s schema validation
The cluster's API server rejects the limits field in Deployment containers.
Keep only requests (CPU/memory) for now.
2026-04-15 10:38:47 +08:00
ZhenYi
650243510c refactor: remove namespace and configmap from Helm chart
Namespace and ConfigMap are managed manually (not by Helm).
2026-04-15 10:30:02 +08:00