momentry_core

Author	SHA1	Message	Date
Accusys	5fcd5212d5	fix: FilesView merge logic and computed property Bug: When regFiles API fails, all files get 'unregistered' status even if they are actually registered. Also, computed property was using reference to files.value instead of a copy, which could cause mutation issues. Fix: - Fetch scan results FIRST (source of truth for files on disk) - Use scan API's is_registered field as fallback status - Only override with regFiles data if file exists in scan results - Computed property now uses [...files.value] to create a copy - Skip files from regFiles that don't exist on disk (deleted)	2026-07-04 23:42:42 +08:00
Accusys	53f28ac458	fix: FilesView filter logic - remove 'registered_scan' status Bug: Scan files were getting status 'registered_scan' which doesn't match any filter value (unregistered/pending/processing/completed/indexed/unindexed). When toggling filters on/off, files would disappear because their status didn't match any valid filter. Fix: - Removed 'registered_scan' status entirely - Fetch regFiles FIRST to get real statuses - Scan files default to 'unregistered' status - regFiles overlay with actual status (pending/processing/completed) - Increased regFiles page_size to 200 for larger libraries	2026-07-04 22:48:23 +08:00
Accusys	7fc4dcbddb	feat: add media type and indexed status filters to FilesView Frontend: - Add media type filter (全部/影片/照片) - Add indexed status filter (未入庫/已入庫) - Show media type column with icons - Fix status filter to handle indexed/unindexed correctly - Determine media type from file extension Backend: - Add total_chunks field to FileItem API response - Query chunk counts efficiently in batch with IN clause - Frontend uses total_chunks to determine is_indexed status	2026-07-04 22:41:51 +08:00
Accusys	96e13e40cb	fix: all_completed now checks ALL expected processors have results Bug: all_completed only checked existing results, not missing processors. If a processor (like pose) never created a result row, all_completed would still return true and mark the job as completed. Fix: all_completed now checks that every processor in job_processors has a corresponding completed result. Added logging for missing processors. Also fixed: - any_pending now checks all expected processors, not just existing results - Added missing_processors detection and logging	2026-07-04 22:09:38 +08:00
Accusys	4e8c0ea5b9	fix: skip empty ASRX segments in Rule 1, fix chunk_id numbering - Skip chunks where both ASRX text and OCR text are empty - Use count-based chunk_id instead of index to avoid gaps - This ensures PostgreSQL and Qdrant chunk counts match	2026-07-04 12:41:40 +08:00
Accusys	e4d6fbac50	feat: add search_by_appearance agent tool for clothing color search - New Python script: clothing_color_search.py - New agent tool: search_by_appearance (red, blue, green, etc.) - Uses appearance.json person bboxes + HSV color analysis - Returns matched frames with confidence scores	2026-07-02 22:22:07 +08:00
Accusys	78364afc51	fix: keyword search - add text_content field and CJK support - Added text_content field to SearchResult and SemanticSearchResult - Added get_chunk_by_id_no_embedding for keyword results without embedding requirement - Fixed search_bm25 to use position-based ranking for CJK/Korean content - Fixed sqlx column mapping with explicit alias - Skip text_match filter for keyword-only results - Use text_content as fallback when summary is empty	2026-07-02 21:16:38 +08:00
Accusys	5a9d4325d8	fix: face thumbnail crop using bbox parameters - Added bbox_x, bbox_y, bbox_w, bbox_h fields to ThumbQuery - face_thumbnail now uses bbox params for ffmpeg crop filter - Frontend passes bboxX/Y/Width/Height which maps to bbox_x/y/w/h	2026-07-02 18:31:54 +08:00
Accusys	3035c6db5d	fix: add /api/v1/face-thumbnail route for People view thumbnails - Frontend calls /api/v1/face-thumbnail?uuid=...&frame=... - Backend only had /api/v1/file/:file_uuid/thumbnail - Added compat route and uuid field to ThumbQuery	2026-07-02 18:13:38 +08:00
Accusys	28a4e9b1b8	fix: worker/processor.rs ASRX 使用正確的 start_frame - 使用 segment.start_frame 取代 i (sequential index) - data JSON 加入 start_frame, end_frame	2026-07-02 17:10:24 +08:00
Accusys	3943075a9b	fix: ASRX pre_chunks 使用正確的 start_frame - pipeline/mod.rs: 使用 segment.start_frame 取代 i (sequential index) - data JSON 加入 end_time, start_frame, end_frame 供 rule1_ingest 使用 - 確保 ASRX pre_chunks 有正確的 frame 資訊	2026-07-02 17:08:50 +08:00
Accusys	e2b3858b67	fix: job never completes - processor_results.file_uuid is NULL - ingestion_complete query used file_uuid column which is always NULL - Changed to JOIN processor_results with monitor_jobs on job_id - All stuck jobs now complete successfully	2026-07-02 16:42:24 +08:00
Accusys	bd6d108ade	fix: remove trace_chunks from API + fix OCR frame calculation - Removed trace_chunks field from PostgresStats struct - Removed trace_chunks query from get_file_stats and get_ingestion_status - Fixed OCR fetch_ocr_texts to compute frames from start_time*FPS - Updated scan.rs to use separate count_nodes/count_edges functions	2026-07-02 16:23:25 +08:00
Accusys	d4c26deae2	fix: pipeline progress computed from DB state instead of Redis - get_pipeline_progress_handler now queries actual DB counts - Fixed processor_results query (requires JOIN with monitor_jobs) - Card progress bar and right-click content now consistent	2026-07-02 15:11:25 +08:00
Accusys	619b056ada	fix: TKG stats API returning 0 - count_by_type used wrong column - tkg_nodes has no edge_type column, query was failing silently - Split into count_nodes(node_type) and count_edges(edge_type) - Fixed text_region → text_trace node type name - Also: OCR frame fix in rule1 (end_frame computed from end_time+FPS)	2026-07-02 14:53:47 +08:00
Accusys	6507766ea2	fix: Qdrant collection name + PipelineProgress accumulation - scan.rs: rule1 collection 'momentry_public_rule1_v2' → 'momentry_rule1' - progress.rs: publish_pipeline_progress now reads existing progress and merges stages	2026-07-02 13:44:45 +08:00
Accusys	64f29d614b	fix: Rule 1/TKG trigger conditions + essential_failed guard - Rule 1 trigger: has_asr_or_asrx → has_asrx (wait for ASRX pre_chunks) - P3/P4 triggers: has_asr_or_asrx → has_asrx (need ASRX data) - Add essential_failed check: job only fails if essential processor fails - P2/P3/P4 triggers: all_completed → has_face && has_asrx - Add publish_pipeline_progress calls at each pipeline stage	2026-07-02 13:31:38 +08:00
Accusys	3eabd45882	fix: ASRX duplication, TKG edges, trace ingest, and add pipeline progress publishing - ASRX handler no longer stores duplicate 'asr' pre_chunks - Pre_chunks storage made idempotent (delete-before-insert) - Rule 1 + trace_ingest changed to query 'asrx' not 'asr' - Trace chunks removed (dynamic from TKG/Qdrant) - TKG scroll_face_points fixed: trace_id >= 1 (not == 1) - TKG AsrxSegmentEntry: start/end -> start_time/end_time (match ASRX JSON) - Unregister error handling: log instead of silent discard - Add publish_pipeline_progress calls at each pipeline stage (processors, rule1, face_trace, identity_agent, TKG, rule2, completion)	2026-07-02 10:43:46 +08:00
Accusys	d791d138f2	fix: API endpoints for file_uuid filtering of pending identities - get_file_identities: UNION face_detections + file_identities - list_identities: add file_bindings from file_identities table - Add back /api/v1/traces/unassigned route - Total count query now includes file_identities Frontend can now: - Filter pending identities by file_uuid - Filter pending faces (unassigned traces) by file_uuid	2026-06-26 14:26:36 +08:00
Accusys	bd7d8c77bf	feat: add migrate_manual_file_identities.py Migrate identities.file_uuid to file_identities table for consistent structure	2026-06-26 13:55:10 +08:00
Accusys	6f1a560d06	fix: add script_dir() method to PythonExecutor	2026-06-26 13:46:23 +08:00
Accusys	67caf09732	feat: tmdb_agent now inserts identities and file_identities to DB - tmdb_agent.py: INSERT identities with status='pending' - tmdb_agent.py: INSERT file_identities (file_uuid → identity_id) - identity.json: file_bindings includes file_uuid, movie_id, character - backfill_file_identities.py: migrate existing TMDb identities - Tested: 27 Charade cast identities linked to file	2026-06-26 13:39:08 +08:00
Accusys	6cbc11efda	feat: add confirm_identity API endpoint - Add POST /api/v1/agents/identity/confirm endpoint - Calls confirm_identity.py to bind trace to identity - Updates TKG, Qdrant _faces, PG face_detections, _seeds - Optional Round 2 propagation after confirmation - Fix trace_id=0 check in confirm_identity.py (use 'is not None') - Document API endpoint in 08_identity_agent.md	2026-06-26 08:30:03 +08:00
Accusys	615f9da2df	fix: identity status - TMDb and user_defined identities start as 'pending' (確認制)	2026-06-26 02:16:46 +08:00
Accusys	a2f2b7918a	fix: add trace_id and status to face_track nodes, force update properties on rebuild	2026-06-26 00:19:00 +08:00
Accusys	0c3f385b1f	remove: skin_tone_trace node type - skin_tone is a person attribute (like height), not trace attribute - Remove build_skin_tone_trace_nodes function - Remove skin_tone_trace_nodes from TkgResult and API response - Remove skin_tone_trace from documentation tables	2026-06-25 19:21:05 +08:00
Accusys	fd2edd5736	fix: TKG rebuild type mismatch and face_track nodes - Fix trace_id type mismatch (INT4 vs i64) with explicit ::bigint cast - Change build_face_track_nodes to use from_pg version - Add skin_tone_trace_nodes to API response - Add #[derive(Serialize)] to TkgResult - Fix Unicode panic in text label truncation - Add push_existing_embeddings.py script	2026-06-25 11:23:53 +08:00
Accusys	ecb0e9c7d0	feat: add /api/v1/health public endpoint - Add public health routes at /api/v1/health, /api/v1/health/detailed, /api/v1/health/consistency - Make health functions and response types public - Public routes bypass auth middleware (unlike protected /api/v1/* routes)	2026-06-25 10:05:33 +08:00
Accusys	4273576612	feat: implement skin_tone_trace node builder and standardize TKG node naming - Add build_skin_tone_trace_nodes() to tkg.rs (Fitzpatrick I-VI classification) - Add skin_tone_trace_nodes field to TkgResult - Standardize node naming: _trace -> _track (text uses _region) - Add external_id format column to Node Types table - Add storage names to Edge Types table - Create TKG_FORMATION_V1.0.md with Phase 0-4 definition, flow diagram, queries - Add cross-reference from identity_agent_v4.0.md to TKG Formation - Update Python scripts to executable mode	2026-06-25 03:09:16 +08:00
Accusys	406b2d5524	docs: update TKG documentation for Identity Agent V4.0 - Add new file: 2026-06-25_identity_agent_v4.0.md (M4 workspace) - Complete architecture overview - All phases completed - Thresholds, components, test results - Update: API_WORKSPACE/modules/15_tkg.md - Correct node type: face_trace → face_track - Add text_region (replaces text_trace) - Add Identity Agent integration section - face_track status values (pending/suggested/confirmed/stranger) - Example face_track node with identity properties	2026-06-25 02:27:34 +08:00
Accusys	4b4d37b332	fix: qdrant_request empty body handling (use 'is not None' check) Fix qdrant_request() to properly handle empty dict {} as body. Python's 'if body' evaluates to False for empty dict, causing EOF error. Changed: - data = json.dumps(body).encode() if body is not None else None Also cleaned up count_seeds() to use consistent body passing.	2026-06-25 02:19:07 +08:00
Accusys	b19b1a8c46	fix: count_seeds empty body handling Fix count_seeds() to always pass valid JSON body to Qdrant count API. Empty dict {} was causing EOF error when no source filter provided.	2026-06-25 02:02:17 +08:00
Accusys	d20819b03b	feat: add manual_seed.py for user-selected face trace seed creation Implements: - create_identity(): Create PG identity (source='manual') - create_manual_seed(): Full flow from trace → seed → confirm - Get trace centroid embedding from Qdrant _faces - Create identity in PG - Push to Qdrant _seeds - Confirm trace binding (TKG + Qdrant + PG) - Auto-trigger Round 2 propagation - list_pending_traces(): List traces for user selection - run_propagation(): Auto propagation trigger Usage: # List pending traces python manual_seed.py --file-uuid <uuid> --list # Create seed from trace python manual_seed.py --file-uuid <uuid> --trace-id 1 --name 'John Doe' # Custom UUID python manual_seed.py --file-uuid <uuid> --trace-id 1 --name 'John Doe' --identity-uuid xxx # No propagation python manual_seed.py --file-uuid <uuid> --trace-id 1 --name 'John Doe' --no-propagate Flow: select trace → label → create identity → push seed → auto-bind → propagate	2026-06-25 01:49:53 +08:00
Accusys	b5e3adf5de	feat: add generate_seed_embeddings.py for TMDb profile extraction Implements: - get_tmdb_identities(): Query PG for TMDb identities with profile photos - download_tmdb_image(): Download profile image from TMDb (handles full URL or path) - extract_face_embedding(): CoreML FaceNet 512D embedding extraction - generate_seed_embeddings(): Full flow: download → extract → push to _seeds TMDb image handling: - Supports both full URL (https://...) and path (/xxx.jpg) - Uses 'original' size for better quality (replaces /w185) Usage: python generate_seed_embeddings.py # All TMDb identities python generate_seed_embeddings.py --limit 10 # Limit to 10 python generate_seed_embeddings.py --dry-run # Don't push to Qdrant Tested: 3 seeds successfully pushed (Cary Grant, Audrey Hepburn, Walter Matthau)	2026-06-25 01:45:48 +08:00
Accusys	4198a74002	feat: add confirm_identity.py for identity binding confirmation Implements: - confirm_single_trace(): Confirm identity binding for one trace - Update TKG face_track node: status='confirmed' - Update Qdrant _faces: identity_uuid for all points - Update PG face_detections: identity_id - Add trace centroid to _seeds (source='propagation') - Auto-trigger Round 2 matching - batch_confirm_from_json(): Batch confirm from suggestions file - Confirm multiple suggestions from identity_matcher output - Final propagation after all confirmations - run_round_2_propagation(): Auto propagation trigger - Get confirmed traces from TKG nodes - Build identity_map for propagation - Run identity_matcher.py Round 2 Usage: python confirm_identity.py --file-uuid <uuid> --trace-id 1 --identity-id 1 --identity-uuid xxx --name 'Tom Hanks' python confirm_identity.py --file-uuid <uuid> --json suggestions.json python confirm_identity.py --file-uuid <uuid> --json suggestions.json --no-propagate	2026-06-25 01:38:00 +08:00
Accusys	21b9f500d9	feat: add TKG node marking for Identity Agent suggestions TKG Helper (scripts/utils/tkg_helper.py): - mark_face_track_suggested(): Mark node as 'suggested' with pending identity info - mark_face_track_confirmed(): Mark node as 'confirmed' with identity_ref - mark_face_track_stranger(): Mark node as 'stranger' with stranger_ref - batch_mark_suggestions(): Batch mark multiple traces - batch_mark_strangers(): Batch mark stranger clusters - get_face_track_nodes(): Get all face_track nodes for a file - get_pending_face_tracks(): Get nodes with status='pending' - get_suggested_face_tracks(): Get nodes with status='suggested' Identity Matcher updates: - Add --mark-tkg flag to update TKG nodes after matching - Integrates with tkg_helper for batch operations Node properties schema: - status: pending \| suggested \| confirmed \| stranger - pending_identity_name/uuid/id: suggested identity info - suggested_by: tmdb \| propagation \| manual - confidence: matching score - identity_ref: confirmed identity reference	2026-06-25 01:11:05 +08:00
Accusys	6851cb4734	feat: add identity_matcher.py for multi-angle face matching Implements: - match_faces_round_1: TMDb seeds → traces (TH=0.55) - match_faces_round_2: Confirmed traces → pending (TH=0.55) - match_faces_round_3_plus: Propagation (TH=0.50) - cluster_strangers: Greedy merge unmatched traces (TH=0.40) - multi_angle_match: max(cosine(seed, rep)) across 3 representatives - cosine_similarity: Vector similarity calculation Usage: python identity_matcher.py --file-uuid <uuid> --round 1 python identity_matcher.py --file-uuid <uuid> --round 2 --confirmed-traces 1,2,3 python identity_matcher.py --file-uuid <uuid> --round 1 --stranger Output: JSON with suggestions {trace_id: {identity_id, uuid, name, score, suggested_by}}	2026-06-25 00:57:22 +08:00
Accusys	580c4b4017	feat: add _seeds collection helper functions for Identity Agent - Add ensure_seeds_collection(): create _seeds collection (512D, Cosine) - Add push_seed_embedding(): push identity seed with payload {identity_id, uuid, name, source, file_uuid, trace_id, tmdb_id} - Add get_seeds(): get all seeds (optional source filter) - Add search_seeds(): cosine search against seeds - Add delete_seed(): delete seed by identity_id - Add count_seeds(): count seeds (optional source filter) - Add get_trace_representatives(): get 3 representatives per trace for multi-angle matching - Add get_trace_centroid(): get centroid embedding for a trace - Add update_identity_in_faces(): update identity_id/uuid for all face points with trace_id Point ID strategy: identity_id directly as point_id for _seeds collection All functions tested successfully	2026-06-25 00:47:25 +08:00
Accusys	9fbb4f9b48	feat: add Qdrant _faces collection embedding push - Add qdrant_faces.py utility module for _faces collection operations - Modify face_processor.py to push embeddings to Qdrant (CoreML extraction re-enabled) - Modify store_traced_faces.py to update trace_id in Qdrant after face tracking - Collection schema: 512D vectors, Cosine distance, fixed name '_faces' - Payload: file_uuid, frame, trace_id, bbox, confidence, identity_id/uuid, stranger_id - Batch size: 100 (default), configurable via QDRANT_BATCH_SIZE env var - Error handling: face_processor.py exits with error if Qdrant push fails	2026-06-25 00:23:20 +08:00
Accusys	074cdcdbed	refactor: remove face embedding architecture - single Qdrant _faces collection - Delete FaceEmbeddingDb module (face_embedding_db.rs) - Stub match_faces_iterative, generate_seed_embeddings, tmdb_match_handler - Remove sync_trace_embeddings, populate_face_embeddings_to_qdrant - Remove embedding from face.json output (face_processor.py) - Remove embedding from PG UPDATE (store_traced_faces.py) - Remove workspace traces staging (checkin.rs, qdrant_workspace.rs) - Fix tests: add pose_angle to Face, hand_nodes to TkgResult Disabled functions (need reimplement with _faces): - match_faces_iterative (identity agent) - generate_seed_embeddings (TMDb seeds) - tmdb_match_handler (TMDb matching) - cluster_face_embeddings, search_similar_faces - merge_traces_within_cuts	2026-06-24 22:27:09 +08:00
Accusys	360cb991e1	feat: add queued status + FIFO queue ordering - Add Queued variant to VideoStatus enum - Trigger sets videos.status='queued' instead of staying 'pending' - Worker sets videos.status='processing' on pickup - list_monitor_jobs_by_status ORDER BY created_at ASC (FIFO) - queue_position counts both 'pending' and 'queued' jobs	2026-06-24 05:18:40 +08:00
Accusys	14e886cc08	feat: progressive multi-round face matching + pending person API - Identity agent: per-face max matching, multi-round with derived seeds from high-confidence faces, angle diversity filter (cosine sim < 0.90) - Pending person API: POST /file/:file_uuid/pending-person + GET /file/:file_uuid/pending-persons with status=pending, source=manual - Update API docs (07_identity.md)	2026-06-24 03:42:04 +08:00
Accusys	766a1d9a6d	feat: Swift Face Pose integration + TKG 方案 B Major Changes: - swift_face_pose: output pose angles (yaw/pitch/roll) in face.json - face_processor.py: call swift_face_pose (dual output: face.json + pose.json) - Face struct: add pose_angle field - TKG 方案 B: gaze/lip_track nodes from face.json (no face_detections dependency) - Chunk cleanup: delete old data before rebuild (avoid duplicate key) - Hand nodes: classify by hand_type + gesture (15 combinations) - HAND_OBJECT edges: bbox spatial matching (174 matches) Test Results: - Blake Jones: 8 faces, pose_angle ✓, 66 nodes, 174 edges - FilmRiot: 394 faces, pose_angle ✓, 35 nodes, 39 edges - Left hands: 132, Right hands: 2 Architecture: - All TKG nodes built from JSON files (face.json, hand.json, yolo.json) - Swift processors: sample_interval=3 (Face/Pose/Hand sync) - Cleanup functions: delete_tkg_nodes_by_uuid, delete_tkg_edges_by_uuid	2026-06-23 05:47:24 +08:00
Accusys	e1e2da2140	fix: processor-counts API + ASRX field name conversion - Fix processor-counts API to correctly read JSON counts: - YOLO: use frames.length (was returning null) - CUT: prioritize scenes.length over frame_count - Result: YOLO 1963 frames, CUT 25 scenes (correct) - Fix ASRX field name conversion: - Convert start_time/end_time → start/end for ASRX compatibility - Prefer frame-based positioning over time-based - Document issues in issues_2026-06-21.md: - Issue 6: ASRX field name mismatch - Issue 7: processor-counts API null values	2026-06-22 23:33:39 +08:00
Accusys	db8bb8fa95	fix(tkg): handle null identity_id + remove skin_tone nodes - Fix Phase 2.5 null handling in build_gaze/lip_track_nodes - Use query_scalar::<_, Option<i64>> + flatten() for nullable fields - Prevents 'unexpected null' decoding errors - Remove skin_tone_trace_nodes from TKG build - Delete build_skin_tone_trace_nodes function (110 lines) - Remove from TkgResult struct and API response - Skin tone should be independent function, not in TKG Result: TKG rebuild now completes successfully - Nodes: 40 (face_track, gaze_track, text_region, appearance) - Edges: 2967 (co_occurrence edges increased from 21 → 2964)	2026-06-22 16:39:47 +08:00
Accusys	70e849d3ae	refactor: remove Rule 3, Story, and Caption processors - Remove Rule 3 (Scene Chunking) from worker auto-trigger - Remove rule3_ingest.rs and related imports - Remove Story/Caption from playground module parsing - Clean up scan.rs Rule 3 display - Fix ASRX field name conversion (start_time -> start) Reason: Story/5W1H/Scene accuracy too poor - will redesign later	2026-06-22 15:34:02 +08:00
Accusys	22f13eca4b	fix(cut): change ffprobe output format to default=nk=0 - Problem: compact=p=0:nk=1 outputs pipe-delimited format without pts_time= - Fix: default=nk=0 outputs pts_time=XXX format that parser can match - Result: Charade scene detection from 1 scene -> 833 scenes (correct)	2026-06-22 13:25:16 +08:00
Accusys	30b252ac95	fix: pre_chunks schema + TMDb movie name extraction - pre_chunks: add chunk_type, text_content columns; drop NOT NULL on coordinate_type/coordinate_index (INSERT statements reference these columns but CREATE TABLE was missing them) - run_migrations: add ALTER TABLE for existing databases - extract_movie_name: filter noise words (youtube, fps, 24fps, 1080p, pure digits) so 'Charade_YouTube_24fps' → 'Charade' - run-server-3002.sh: add companion worker startup (matching 3003 script)	2026-06-22 11:55:12 +08:00
Accusys	f4de741d5b	fix: add appearance back to processor list, keep mediapipe/story filtered out	2026-06-22 09:20:16 +08:00
Accusys	c93b54efeb	fix: filter deprecated processors from trigger API requests	2026-06-22 09:15:02 +08:00

1 2 3 4 5 ...

527 Commits