orchard

Author	SHA1	Message	Date
Mondo Diaz	8c6ba01a73	deps: add redis-py for caching layer	2026-02-04 09:11:12 -06:00
Mondo Diaz	196f3f957c	docs: add detailed implementation plan for PyPI proxy performance	2026-02-04 09:05:18 -06:00
Mondo Diaz	9cadfa3b1b	Add PyPI proxy performance & multi-protocol architecture design Comprehensive design for: - HTTP connection pooling with lifecycle management - Redis caching layer (TTL for discovery, permanent for immutable) - Abstract PackageProxyBase for multi-protocol support (npm, Maven) - Database query optimization with batch operations - Dependency resolution caching for ensure files - Observability via health endpoints Maintains hermetic build guarantees: artifact content and extracted metadata are immutable, only discovery data uses TTL-based caching.	2026-02-04 08:56:40 -06:00
Mondo Diaz	19e034ef56	Fix duplicate dependency extraction from PyPI wheel METADATA Wheel METADATA files can list the same dependency multiple times under different extras (e.g., bokeh appears under [docs] and [bokeh-tests]). This caused unique constraint violations when storing dependencies. Fix by deduplicating extracted deps before DB insertion.	2026-02-03 17:43:38 -06:00
Mondo Diaz	45a48cc1ee	Add inline migration for tag removal (024_remove_tags) Adds the tag removal migration to the inline migrations in database.py: - Drops tag-related triggers and functions - Removes tag_constraint column from artifact_dependencies - Makes version_constraint NOT NULL - Drops tags and tag_history tables - Renames uploads.tag_name to version	2026-02-03 17:22:40 -06:00
Mondo Diaz	7068f36cb5	Restore dependency extraction from PyPI packages Re-adds the dependency extraction that was accidentally removed with the proactive caching feature. Now when a PyPI package is cached: 1. Extract METADATA from wheel or PKG-INFO from sdist 2. Parse Requires-Dist lines for dependencies 3. Store in artifact_dependencies table This restores the dependency graph functionality for PyPI packages.	2026-02-03 17:18:54 -06:00
Mondo Diaz	e471202f2e	Fix SQLAlchemy subquery warning in artifact listing	2026-02-03 17:10:34 -06:00
Mondo Diaz	d12e4cdfc5	Add configurable PyPI download mode (redirect vs proxy) Adds ORCHARD_PYPI_DOWNLOAD_MODE setting (default: "redirect"): - "redirect": Redirect pip to S3 presigned URL - reduces pod bandwidth - "proxy": Stream through Orchard pod - for environments where clients can't reach S3 In redirect mode, Orchard only handles metadata requests and upstream fetches. All file transfers go directly from S3 to the client.	2026-02-03 17:09:05 -06:00
Mondo Diaz	1ffe17bf62	Fix artifact listing to include PyPI proxy cached packages The list_package_artifacts endpoint was only querying artifacts via the Upload table. PyPI proxy creates PackageVersion records but not Upload records, so cached packages would show stats (size, version count) but no artifacts in the listing. Now queries artifacts from both Upload and PackageVersion tables using a union, so PyPI-cached packages display their artifacts correctly.	2026-02-03 16:46:35 -06:00
Mondo Diaz	c21af708af	Fix PyPI proxy timeout by streaming from S3 instead of loading into memory Large packages like TensorFlow (~600MB) caused read timeouts because the entire file was loaded into memory before responding to the client. Now the file is stored to S3 first, then streamed back using StreamingResponse.	2026-02-03 16:42:30 -06:00
Mondo Diaz	1ae989249b	Fix PackageArtifactResponse missing sha256 and version fields - Add sha256 field to list_package_artifacts response (artifact ID is SHA256) - Add version field to PackageArtifactResponse schema - Add version field to frontend PackageArtifact type - Update getArtifactVersion to prefer direct version field	2026-02-03 16:24:31 -06:00
Mondo Diaz	c0c8603d05	Fix migrations 008 and 011 to handle removed tags table	2026-02-03 16:05:29 -06:00
Mondo Diaz	2501ba21d4	Fix migration 005 to not create indexes on removed tags table	2026-02-03 16:01:09 -06:00
Mondo Diaz	c94fe0389b	Fix tests for tag removal and version behavior - Fix upload response to return actual version (not requested version) when artifact already has a version in the package - Update ref_count tests to use multiple packages (one version per artifact per package design constraint) - Remove allow_public_internet references from upstream caching tests - Update consistency check test to not assert global system health - Add versions field to artifact schemas - Fix dependencies resolution to handle removed tag constraint	2026-02-03 15:35:44 -06:00
Mondo Diaz	9a95421064	Fix remaining tag references in tests - Update CacheRequest test to use version field - Fix upload_test_file calls that still used tag parameter - Update artifact history test to check versions instead of tags - Update artifact stats tests to check versions instead of tags - Fix garbage collection tests to delete versions instead of tags - Remove TestGlobalTags class (endpoint removed) - Update project/package stats tests to check version_count - Fix upload_test_file fixture in test_download_verification	2026-02-03 12:51:31 -06:00
Mondo Diaz	87f30ea898	Update tests for tag removal - Remove Tag/TagHistory model tests from unit tests - Update CacheSettings tests to remove allow_public_internet field - Replace tag= with version= in upload_test_file calls - Update test assertions to use versions instead of tags - Remove tests for tag: prefix downloads (now uses version:) - Update dependency tests for version-only schema	2026-02-03 12:45:44 -06:00
Mondo Diaz	106e30b533	Remove obsolete tag support test from DragDropUpload The tag functionality was removed in the previous commit, so this test that expected a 'tag' field in the upload FormData is no longer valid.	2026-02-03 12:32:11 -06:00
Mondo Diaz	c4c9c20763	Remove tag system, use versions only for artifact references Tags were mutable aliases that caused confusion alongside the immutable version system. This removes tags entirely, keeping only PackageVersion for artifact references. Changes: - Remove tags and tag_history tables (migration 012) - Remove Tag model, TagRepository, and 6 tag API endpoints - Update cache system to create versions instead of tags - Update frontend to display versions instead of tags - Remove tag-related schemas and types - Update artifact cleanup service for version-based ref_count	2026-02-03 12:18:19 -06:00
Mondo Diaz	62c709e368	Remove superuser-only session_replication_role from factory reset	2026-02-03 11:19:50 -06:00
Mondo Diaz	b6fb9e7546	Use same variable pattern as integration tests for reset job	2026-02-03 11:05:04 -06:00
Mondo Diaz	9db94d035d	Add shell-level debug for password variable	2026-02-03 11:01:01 -06:00
Mondo Diaz	6d9cd9d45d	Add debug to detect hidden characters in password	2026-02-03 10:59:00 -06:00
Mondo Diaz	f5b60468ce	Fix invalid sort field error on package artifact listing The artifacts endpoint only supports sorting by: created_at, size, original_name But the frontend was defaulting to 'name' (from the old tags endpoint). - Change default sort from 'name' to 'created_at' - Change default order from 'asc' to 'desc' (newest first) - Remove sortable flag from version/tags columns (not DB fields) - Add sortable flag to original_name and size columns	2026-02-03 10:55:00 -06:00
Mondo Diaz	f7643a5c13	Add debug output to reset_feature job for auth troubleshooting	2026-02-03 10:25:36 -06:00
Mondo Diaz	281474d72f	Fix self-dependency detection to strip PyPI extras brackets The circular dependency error '_pypi/psutil → _pypi/psutil' occurred because dependencies with extras like 'psutil[test]' weren't being recognized as self-dependencies. The comparison 'psutil[test] != psutil' failed. - Add _normalize_pypi_package_name() helper that strips extras brackets and normalizes separators per PEP 503 - Update _detect_package_cycle to use normalized names for cycle detection - Update check_circular_dependencies to use normalized initial path - Simplify self-dependency check in resolve_dependencies to use helper	2026-02-03 10:17:13 -06:00
Mondo Diaz	bb7c30b15c	Fix circular dependency resolution by switching to artifact-centric display - Add artifact: prefix handling in resolve_dependencies for direct artifact ID references, enabling dependency resolution for tagless artifacts - Refactor PackagePage from tag-based to artifact-based data display - Add PackageArtifact type with tags array for artifact-centric API responses - Update download URLs to use artifact:ID prefix when no tags exist - Conditionally show "View Ensure File" only when artifact has tags	2026-02-03 10:00:15 -06:00
Mondo Diaz	9587ed8f17	Fix progress bar CSS scoping conflict between upload and dashboard	2026-02-03 08:29:03 -06:00
Mondo Diaz	e86d974339	Add reset job after integration tests on feature branches	2026-02-03 08:24:22 -06:00
Mondo Diaz	bf2737b3a2	Fix self-dependency check to use case-insensitive PyPI name normalization	2026-02-03 08:23:39 -06:00
Mondo Diaz	17d3004058	Pass upstream policy errors through PyPI proxy to users - Add _parse_upstream_error() to extract policy messages from JFrog/Artifactory - Pass through 403 and other 4xx errors with detailed messages - Pin babel and electron-to-chromium to older versions for CI compatibility	2026-02-03 08:09:08 -06:00
Mondo Diaz	549c85900e	Pin lodash to 4.17.21 to avoid immature package policy block	2026-02-03 08:02:37 -06:00
Mondo Diaz	c60ed9ab21	Move Dashboard and Teams from navbar to user dropdown menu Cleaner navbar with just Projects and Docs links. Dashboard and Teams are now in the user menu dropdown.	2026-02-02 20:44:04 -06:00
Mondo Diaz	34ff9caa08	Fix circular dependency error message to show actual cycle path The error was hardcoding [pkg_key, pkg_key] regardless of actual cycle. Now tracks the path through dependencies to report the real cycle.	2026-02-02 20:43:05 -06:00
Mondo Diaz	ac3477ff22	Replace custom dependency graph with React Flow - Install reactflow and dagre for professional graph visualization - Use dagre for automatic tree layout (top-to-bottom) - Custom styled nodes with package name, version, and size - Built-in zoom/pan controls and minimap - Click nodes to navigate to package page - Cleaner, more professional appearance	2026-02-02 20:38:35 -06:00
Mondo Diaz	f87e5b4a51	Improve dependency UI: rename to DependGraph, hide empty Used By - Rename "Dependency Graph" modal title to "DependGraph" - Hide "Used By" section when no packages depend on this package	2026-02-02 20:34:32 -06:00
Mondo Diaz	01915bcb45	Fix circular dependency detection and hide empty graph modal - Add artifact-level self-dependency check (skip if dep resolves to same artifact) - Close dependency graph modal if package has no dependencies to show (only root package with no children and no missing deps)	2026-02-02 20:31:46 -06:00
Mondo Diaz	72952d84a1	Skip self-dependencies in dependency resolver PyPI packages can have self-referential dependencies for extras (e.g., pytest[testing] depends on pytest). These were incorrectly detected as circular dependencies. Now we skip them.	2026-02-02 19:45:34 -06:00
Mondo Diaz	e6d42d91cd	Fix [object Object] error when API returns structured error detail The backend returns detail as an object for some errors (circular dependency, conflicts, etc.). The API client now JSON.stringifies object details so they can be properly parsed by error handlers like DependencyGraph.	2026-02-02 18:33:55 -06:00
Mondo Diaz	b3ae3b03eb	Show missing dependencies in dependency graph instead of failing When dependencies are not cached on the server (common since we removed proactive caching), the dependency graph now: - Continues resolving what it can find - Shows missing dependencies in a separate section with amber styling - Displays the constraint and which package required them - Updates the header stats to show "X cached • Y not cached" This provides a better user experience than showing an error when some dependencies haven't been downloaded yet.	2026-02-02 16:29:37 -06:00
Mondo Diaz	ba0a658611	Fix dependency graph error for invalid version constraints When a dependency has an invalid version constraint like '>=' (without a version number), the resolver now treats it as a wildcard and returns the latest available version instead of failing with 'Dependency not found'. This handles malformed metadata that may have been stored from PyPI packages.	2026-02-02 16:26:18 -06:00
Mondo Diaz	081cc6df83	Remove proactive PyPI dependency caching feature The background task queue for proactively caching package dependencies was causing server instability and unnecessary growth. The PyPI proxy now only caches packages on-demand when users request them. Removed: - PyPI cache worker (background task queue and worker pool) - PyPICacheTask model and related database schema - Cache management API endpoints (/pypi/cache/*) - Background Jobs admin dashboard - Dependency extraction and queueing logic Kept: - On-demand package caching (still works when users request packages) - Async httpx for non-blocking downloads (prevents health check failures) - URL-based cache lookups for deduplication	2026-02-02 16:17:33 -06:00
Mondo Diaz	cf7bdccb3a	Center text in jobs table columns	2026-02-02 15:30:46 -06:00
Mondo Diaz	1329d380a4	Convert PyPI proxy from sync to async httpx to prevent event loop blocking The pypi_download_file, pypi_simple_index, and pypi_package_versions endpoints were using synchronous httpx.Client inside async functions. When upstream PyPI servers respond slowly, this blocked the entire FastAPI event loop, preventing health checks from responding. Kubernetes would then kill the pod after the liveness probe timed out. Changes: - httpx.Client → httpx.AsyncClient - client.get() → await client.get() - response.iter_bytes() → response.aiter_bytes() This ensures the event loop remains responsive during slow upstream downloads, allowing health checks to succeed even when downloads take 20+ seconds.	2026-02-02 15:26:24 -06:00
Mondo Diaz	361210a2bc	Add cancel job button and improve jobs table UI - Remove "All Jobs" title - Move Status column to front of table - Add Cancel button for in-progress jobs - Add cancel endpoint: POST /pypi/cache/cancel/{package_name} - Add btn-danger CSS styling	2026-02-02 15:18:59 -06:00
Mondo Diaz	415ad9a29a	Stream downloads to temp file to reduce memory usage - Download packages in 64KB chunks to temp file instead of loading into memory - Upload to S3 from temp file (streaming) - Clean up temp file after processing - Reduces memory footprint from 2x file size to 1x file size	2026-02-02 15:10:25 -06:00
Mondo Diaz	1667c5a416	Increase memory to 1Gi and reduce workers to 1 for stability	2026-02-02 15:08:00 -06:00
Mondo Diaz	1021e2b942	Add PyPI cache config and bump memory in values-prod.yaml	2026-02-02 14:38:47 -06:00
Mondo Diaz	d0e91658d7	Add PyPI cache config and bump memory in values-stage.yaml	2026-02-02 14:38:21 -06:00
Mondo Diaz	7b89f41704	Add PyPI cache config and bump memory in values-dev.yaml	2026-02-02 14:37:55 -06:00
Mondo Diaz	ba43110123	Add PyPI cache worker config and increase memory limit - Add orchard.pypiCache config section to helm values - Set default workers to 2 (reduced from 5 to limit memory) - Bump pod memory from 512Mi to 768Mi (request=limit) - Add ORCHARD_PYPI_CACHE_* env vars to deployment template	2026-02-02 14:37:27 -06:00

1 2 3 4 5

241 Commits