bookapp

Author	SHA1	Message	Date
thethreemagi	6684ec2bf5	feat: Improve book quality — stronger evaluator, more refinement attempts, quality-first model selection - Fix: chapter quality evaluation now uses model_logic (free Pro) instead of model_writer (Flash). The model that wrote the chapter was also scoring it, causing circular, lenient grading. - Increase max_attempts in write_chapter from 2 to 3 for more refinement passes per chapter. - Update auto model selection prompt (ai/setup.py) to prioritize quality over budget framing: free/preview/exp models preferred by capability (Pro > Flash, 2.5 > 2.0 > 1.5), not just cost. Writer role now allowed to use best free Flash/Pro preview — not restricted to basic Flash only. - Bump version to 3.0. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-22 21:28:49 -05:00
thethreemagi	3ba648ac5f	fix: Run DB migration for story_state/persona tables and missing run columns; fix defaults missing book_cost	2026-02-22 13:23:44 -05:00
thethreemagi	6f19808f15	fix: Clarify budget is text-only; Imagen cover cost (~$0.12 max) is separate	2026-02-22 10:43:08 -05:00
thethreemagi	f1d7fcbcb7	feat: Budget-aware model selection — book cost ceiling with per-role cost calculations	2026-02-22 10:41:22 -05:00
thethreemagi	c3724a6761	feat: Cost-aware Pro model selection — free Pro beats Flash, paid Pro loses to Flash	2026-02-22 10:38:57 -05:00
thethreemagi	74cc66eed3	feat: Prefer Flash models in auto-selection criteria for cost reduction	2026-02-22 10:33:38 -05:00
thethreemagi	353dc859d2	feat: Optimize AI model usage for cost reduction	2026-02-22 10:23:47 -05:00
thethreemagi	1f01fedf00	Auto-commit: v2.9 — Fix background task hangs (OAuth headless guard, SQLite timeouts, log touch) - ai/setup.py: Added threading import; OAuth block now detects background/headless threads and skips run_local_server to prevent indefinite blocking. Logs a clear warning and falls back to ADC for Vertex AI. Token file only written when creds are not None. - web/tasks.py: All sqlite3.connect() calls now use timeout=30, check_same_thread=False. OperationalError on the initial status update is caught and logged via utils.log. generate_book_task now touches initial_log immediately so the UI polling endpoint always finds an existing file even if the worker crashes on the next line. - ai_blueprint.md: Bumped to v2.9; Section 12.D sub-items 1-3 marked ✅; item 13 added to summary. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-21 10:50:00 -05:00
thethreemagi	1f799227d9	Auto-commit: Fix spinning logs — API timeouts + reliable Huey consumer start Root causes of indefinite spinning during book create/generate: 1. ai/models.py — ResilientModel.generate_content() had no timeout: a stalled Gemini API call would block the thread forever. Now injects request_options={"timeout": 180} into every call. Also removed the dangerous init_models(force=True) call inside the retry handler, which was making a second network call during an existing API failure. 2. ai/setup.py — genai.list_models() calls in get_optimal_model(), select_best_models(), and init_models() had no timeout. Added request_options={"timeout": 30} to all three calls so model init fails fast rather than hanging indefinitely. 3. web/app.py — Huey task consumer only started inside `if __name__ == "__main__":`, meaning tasks queued via flask run, gunicorn, or other WSGI runners were never executed (status stuck at "queued" forever). Moved consumer start to module level with a WERKZEUG_RUN_MAIN guard to prevent double-start under the reloader. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-21 02:16:39 -05:00
thethreemagi	db70ad81f7	Blueprint v1.0.4: Implemented AI Context Optimization & Token Management - core/utils.py: Added estimate_tokens(), truncate_to_tokens(), get_ai_cache(), set_ai_cache(), make_cache_key() utilities - story/writer.py: Applied truncate_to_tokens() to prev_content (2000 tokens) and prev_sum (600 tokens) context injections - story/editor.py: Applied truncate_to_tokens() to summary (1000t), last_chapter_text (800t), eval text (7500t), propagation contexts (2500t/3000t) - web/routes/persona.py: Added MD5-keyed in-memory cache for persona analyze endpoint; truncated sample_text to 750 tokens - ai/models.py: Added pre-dispatch payload size estimation with 30k-token warning threshold Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 23:30:39 -05:00
thethreemagi	f7099cc3e4	v2.0.0: Modularize project into single-responsibility packages Replaced monolithic modules/ package with a clean architecture: - core/ config.py, utils.py - ai/ models.py (ResilientModel), setup.py (init_models) - story/ planner.py, writer.py, editor.py, style_persona.py, bible_tracker.py - marketing/ cover.py, blurb.py, fonts.py, assets.py - export/ exporter.py - web/ app.py (Flask factory), db.py, helpers.py, tasks.py, routes/{auth,project,run,persona,admin}.py - cli/ engine.py (run_generation), wizard.py (BookWizard) Flask routes split into 5 Blueprints; all templates updated with blueprint- prefixed url_for() calls. Dockerfile and docker-compose updated to use web.app entry point and new package paths. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 22:20:53 -05:00

11 Commits