gh-1SecondEveryday-image-analysis-eval

sjs/gh-1SecondEveryday-image-analysis-eval

Fork 0

mirror of https://github.com/1SecondEveryday/image-analysis-eval.git synced 2026-03-25 09:05:49 +00:00

Commit graph

Author	SHA1	Message	Date
Sami Samhuri	0848b43304	Enhance README with comprehensive testing history and insights Documents the complete 7-round evaluation process, from initial 6-model testing through Gemma3:12b's breakthrough selfie detection. Adds historical context for removed experimental prompts (07-11), model evolution insights, and performance characteristics discovered through extensive testing. Key additions: - Complete testing history (Take 1-7 plus mini-tests) - Model ranking evolution and breakthrough discoveries - Experimental prompt history and removal rationale - Technical insights from 768px optimization and repetition patterns - Results archive documentation 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-09 13:35:46 -07:00
Sami Samhuri	357018ee7b	Add comprehensive README for image analysis evaluation framework Documents the VLM evaluation system for extracting searchable tags from video diary snippets. Includes setup instructions, script documentation, prompt strategies, and performance insights from extensive testing. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-09 13:26:34 -07:00

Author

SHA1

Message

Date

Sami Samhuri

0848b43304

Enhance README with comprehensive testing history and insights

Documents the complete 7-round evaluation process, from initial 6-model testing through Gemma3:12b's breakthrough selfie detection. Adds historical context for removed experimental prompts (07-11), model evolution insights, and performance characteristics discovered through extensive testing.

Key additions:
- Complete testing history (Take 1-7 plus mini-tests)
- Model ranking evolution and breakthrough discoveries
- Experimental prompt history and removal rationale
- Technical insights from 768px optimization and repetition patterns
- Results archive documentation

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-07-09 13:35:46 -07:00

Sami Samhuri

357018ee7b

Add comprehensive README for image analysis evaluation framework

Documents the VLM evaluation system for extracting searchable tags from video diary snippets. Includes setup instructions, script documentation, prompt strategies, and performance insights from extensive testing.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-07-09 13:26:34 -07:00

2 commits