From 024034f708fec20268c8da34298f87405f17eec4 Mon Sep 17 00:00:00 2001
From: HP <2726519488@qq.com>
Date: Thu, 4 Jun 2026 19:49:26 +0800
Subject: [PATCH] systemflow and spec

---
 docs/system_flow_and_interface_spec.md | 1315 ++++++++++++++++++++++++
 1 file changed, 1315 insertions(+)
 create mode 100644 docs/system_flow_and_interface_spec.md

diff --git a/docs/system_flow_and_interface_spec.md b/docs/system_flow_and_interface_spec.md
new file mode 100644
index 0000000..f587bf2
--- /dev/null
+++ b/docs/system_flow_and_interface_spec.md
@@ -0,0 +1,1315 @@
+# 课前拍照打卡系统总流程与接口规范
+
+## 1. 文档目的
+
+本文档用于说明课前拍照打卡系统的整体实现流程、模块拆分、模块内部职责、模块之间的接口规范、数据结构、错误码、数据库读写约定和后续扩展方向。
+
+当前阶段系统按单机板端方案设计，优先实现两个基础能力：
+
+1. 大图人头数统计：输入课堂大图，检测人头或人脸位置，输出人数统计结果。
+2. 单张人脸匹配：输入单张清晰人脸，和本地已注册人脸库匹配，输出识别结果。
+
+完整课堂大图多人脸识别打卡属于后续扩展功能。当前文档会预留接口，但不会把第一阶段目标复杂化。
+
+## 2. 设计原则
+
+1. 单机优先：程序、模型、配置、数据库都部署在 IMX6ULL_ALPHA 板端。
+2. 先命令行，后界面：第一阶段使用命令行运行，方便调试、答辩和移植。
+3. 模块解耦：摄像头、模型推理、后处理、数据库、业务流程分离。
+4. 模型可替换：接口层不绑定具体模型，ncnn 和 TFLite 通过适配层切换。
+5. 数据可追溯：每次检测、匹配都保存输入图片路径、结果、阈值、模型版本和时间。
+6. 板端友好：避免重复加载模型，避免保存过多大文件，避免不必要的内存拷贝。
+
+## 3. 默认技术决策
+
+当前先采用以下默认方案，后续如果实测性能不满足，再做调整。
+
+| 项目 | 默认方案 | 原因 |
+| --- | --- | --- |
+| 开发语言 | C++17 | 适合嵌入式 Linux，便于调用 OpenCV、SQLite、ncnn |
+| 构建系统 | CMake | 方便 PC 编译和交叉编译 |
+| 图像处理 | OpenCV | 摄像头采集、图片读写、resize、画框方便 |
+| 推理框架 | ncnn 优先，TFLite 预留 | ncnn 对 ARM CPU 友好，部署文件简单 |
+| 本地数据库 | SQLite | 单机部署简单，不需要数据库服务 |
+| 配置格式 | JSON | 结构清晰，便于维护 |
+| 第一阶段运行方式 | 命令行工具 | 降低 UI 干扰，先跑通核心功能 |
+| 时间格式 | 本地时间字符串 `YYYY-MM-DD HH:MM:SS` | 便于查看数据库记录 |
+| 图片路径 | 相对项目运行目录保存 | 方便板端迁移 |
+
+## 4. 系统总体架构
+
+```text
+                 +-----------------------------+
+                 |        attendance_app       |
+                 |  command line entry point   |
+                 +--------------+--------------+
+                                |
+                                v
+                 +-----------------------------+
+                 |          AppContext         |
+                 | config / db / model paths   |
+                 +--+-----------+-----------+--+
+                    |           |           |
+        +-----------+           |           +----------------+
+        v                       v                            v
++---------------+       +---------------+            +----------------+
+| CameraCapture |       | HeadCounter   |            | FaceMatchFlow  |
+| V4L2/OpenCV   |       | large image   |            | register/match |
++-------+-------+       +-------+-------+            +--------+-------+
+        |                       |                             |
+        v                       v                             v
++---------------+       +---------------+            +----------------+
+| ImageUtils    |       | HeadDetector  |            | FaceDetector   |
+| read/save     |       | model infer   |            | single face    |
+| draw/resize   |       +-------+-------+            +--------+-------+
++---------------+               |                             |
+                                v                             v
+                        +---------------+            +----------------+
+                        | Postprocess   |            | FaceAligner    |
+                        | decode / NMS  |            | crop / align   |
+                        +-------+-------+            +--------+-------+
+                                |                             |
+                                v                             v
+                        +---------------+            +----------------+
+                        | AttendanceDB  |<-----------| FeatureExtract |
+                        | SQLite write  |            | embedding      |
+                        +---------------+            +--------+-------+
+                                                              |
+                                                              v
+                                                     +----------------+
+                                                     | FaceMatcher    |
+                                                     | cosine search  |
+                                                     +----------------+
+```
+
+## 5. 程序运行模式
+
+主程序 `attendance_app` 负责解析命令行参数，并根据 `--mode` 调用不同业务流程。
+
+| 模式 | 功能 | 当前阶段 |
+| --- | --- | --- |
+| `capture` | 摄像头拍照并保存图片 | 可选实现 |
+| `head_count` | 大图人头数统计 | 必须实现 |
+| `register_face` | 注册学生人脸特征 | 必须实现 |
+| `face_match` | 单张人脸匹配 | 必须实现 |
+| `init_db` | 初始化数据库 | 必须实现 |
+| `full_attendance` | 大图多人脸识别打卡 | 后续扩展 |
+
+### 5.1 命令行规范
+
+```bash
+./attendance_app --mode init_db
+./attendance_app --mode capture --output data/images/capture_001.jpg
+./attendance_app --mode head_count --image data/images/classroom.jpg
+./attendance_app --mode register_face --student-no 20240001 --name zhangsan --class-name class01 --image data/faces/zhangsan.jpg
+./attendance_app --mode face_match --image data/images/test_face.jpg
+```
+
+### 5.2 通用命令行参数
+
+| 参数 | 必填 | 说明 |
+| --- | --- | --- |
+| `--mode` | 是 | 运行模式 |
+| `--config` | 否 | 主配置文件路径，默认 `config/app_config.json` |
+| `--image` | 部分模式必填 | 输入图片路径 |
+| `--output` | 否 | 输出图片路径 |
+| `--save-result-image` | 否 | 是否保存带框结果图 |
+| `--verbose` | 否 | 输出详细日志 |
+
+## 6. 总流程设计
+
+## 6.1 程序启动流程
+
+所有模式共用启动流程。
+
+```text
+main()
+  -> parse command line
+  -> load app_config.json
+  -> load camera_config.json
+  -> load model_config.json
+  -> create AppContext
+  -> init log module
+  -> ensure data directories exist
+  -> open SQLite database
+  -> dispatch by mode
+```
+
+### 6.1.1 启动阶段检查项
+
+| 检查项 | 失败处理 |
+| --- | --- |
+| 配置文件存在且可解析 | 返回 `CONFIG_ERROR` |
+| 数据目录可写 | 返回 `FILE_IO_ERROR` |
+| 数据库可打开 | 返回 `DATABASE_ERROR` |
+| 模型文件存在 | 进入对应模式时返回 `MODEL_LOAD_ERROR` |
+| 输入图片存在 | 返回 `INVALID_ARGUMENT` 或 `FILE_IO_ERROR` |
+
+## 6.2 大图人头数统计流程
+
+### 6.2.1 流程说明
+
+```text
+输入图片路径或摄像头采集
+  -> 读取原图 cv::Mat
+  -> 检查图片是否为空
+  -> resize / letterbox 到模型输入尺寸
+  -> 模型推理
+  -> 解码模型输出
+  -> 置信度过滤
+  -> NMS 去重
+  -> 坐标映射回原图
+  -> 统计有效目标数量
+  -> 绘制检测框
+  -> 保存结果图片
+  -> 写入 head_count_records
+  -> 输出 JSON 或终端结果
+```
+
+### 6.2.2 顺序调用关系
+
+```text
+HeadCountFlow
+  -> ImageUtils::ReadImage()
+  -> HeadCounter::Count()
+     -> HeadDetector::Detect()
+        -> HeadPreprocessor::Preprocess()
+        -> ModelEngine::Forward()
+        -> DetectPostprocess::Decode()
+        -> DetectPostprocess::Nms()
+     -> ImageUtils::DrawDetections()
+  -> AttendanceDB::InsertHeadCountRecord()
+```
+
+### 6.2.3 输入
+
+| 字段 | 类型 | 说明 |
+| --- | --- | --- |
+| `image_path` | string | 待检测课堂大图 |
+| `save_result_image` | bool | 是否保存带框图片 |
+| `result_image_path` | string | 结果图片路径，可为空 |
+| `confidence_threshold` | float | 检测置信度阈值 |
+| `nms_threshold` | float | NMS 阈值 |
+
+### 6.2.4 输出
+
+| 字段 | 类型 | 说明 |
+| --- | --- | --- |
+| `head_count` | int | 最终统计人数 |
+| `detections` | array | 检测框列表 |
+| `result_image_path` | string | 带框图片路径 |
+| `elapsed_ms` | int | 处理耗时 |
+| `record_id` | int64 | 数据库记录 ID |
+
+## 6.3 单张人脸注册流程
+
+### 6.3.1 流程说明
+
+```text
+输入学生信息和人脸图片
+  -> 检查 student_no 是否已存在
+  -> 读取图片
+  -> 人脸检测
+  -> 如果多张人脸，选择最大人脸
+  -> 人脸关键点定位
+  -> 人脸对齐为固定尺寸
+  -> 特征提取
+  -> 特征向量 L2 归一化
+  -> 保存学生信息
+  -> 保存人脸图片和特征向量
+  -> 输出注册结果
+```
+
+### 6.3.2 顺序调用关系
+
+```text
+FaceRegisterFlow
+  -> FaceRepository::FindStudentByNo()
+  -> ImageUtils::ReadImage()
+  -> FaceDetector::Detect()
+  -> FaceAligner::Align()
+  -> FaceFeatureExtractor::Extract()
+  -> FaceMatcher::Normalize()
+  -> FaceRepository::InsertStudent()
+  -> FaceRepository::InsertFaceTemplate()
+```
+
+### 6.3.3 输入
+
+| 字段 | 类型 | 说明 |
+| --- | --- | --- |
+| `student_no` | string | 学号，唯一 |
+| `name` | string | 姓名 |
+| `class_name` | string | 班级 |
+| `image_path` | string | 注册人脸图片 |
+| `overwrite` | bool | 学号已存在时是否覆盖 |
+
+### 6.3.4 输出
+
+| 字段 | 类型 | 说明 |
+| --- | --- | --- |
+| `student_id` | int64 | 数据库学生 ID |
+| `template_id` | int64 | 人脸模板 ID |
+| `face_image_path` | string | 保存的人脸图片路径 |
+| `feature_dim` | int | 特征维度 |
+| `message` | string | 注册说明 |
+
+## 6.4 单张人脸匹配流程
+
+### 6.4.1 流程说明
+
+```text
+输入测试人脸图片
+  -> 读取图片
+  -> 人脸检测
+  -> 如果多张人脸，选择最大人脸
+  -> 人脸对齐
+  -> 特征提取
+  -> 特征向量归一化
+  -> 从数据库读取所有注册特征
+  -> 逐个计算余弦相似度
+  -> 选择最高分
+  -> 根据阈值判断 matched / unknown
+  -> 写入 face_match_records
+  -> 输出匹配结果
+```
+
+### 6.4.2 顺序调用关系
+
+```text
+FaceMatchFlow
+  -> ImageUtils::ReadImage()
+  -> FaceDetector::Detect()
+  -> FaceAligner::Align()
+  -> FaceFeatureExtractor::Extract()
+  -> FaceRepository::LoadAllTemplates()
+  -> FaceMatcher::Match()
+  -> FaceRepository::InsertFaceMatchRecord()
+```
+
+### 6.4.3 输入
+
+| 字段 | 类型 | 说明 |
+| --- | --- | --- |
+| `image_path` | string | 待匹配人脸图片 |
+| `threshold` | float | 匹配阈值 |
+| `save_aligned_face` | bool | 是否保存对齐后人脸 |
+
+### 6.4.4 输出
+
+| 字段 | 类型 | 说明 |
+| --- | --- | --- |
+| `matched` | bool | 是否匹配成功 |
+| `student_id` | int64 | 匹配到的学生 ID |
+| `student_no` | string | 学号 |
+| `name` | string | 姓名 |
+| `score` | float | 最高相似度 |
+| `threshold` | float | 使用阈值 |
+| `status` | string | `matched`、`unknown`、`failed` |
+| `record_id` | int64 | 识别记录 ID |
+
+## 6.5 后续完整打卡流程
+
+完整流程在第一阶段之后实现。
+
+```text
+课堂大图拍照
+  -> 大图人脸检测
+  -> 裁剪每个人脸
+  -> 对每个人脸做对齐和特征提取
+  -> 对每个人脸做数据库匹配
+  -> 同一学生结果去重
+  -> 生成一次课堂 session
+  -> 保存每个学生的到课状态
+  -> 保存未知人脸和低置信度人脸
+```
+
+为避免第一阶段过度复杂，当前仅预留 `full_attendance` 模式和数据库扩展方向。
+
+## 7. 模块详细设计
+
+## 7.1 App 模块
+
+### 7.1.1 职责
+
+| 文件 | 职责 |
+| --- | --- |
+| `include/app/app_config.h` | 配置结构定义 |
+| `src/app/app_config.cpp` | JSON 配置读取、默认值填充 |
+| `include/app/app_context.h` | 全局运行上下文 |
+| `src/app/app_context.cpp` | 初始化目录、数据库、公共资源 |
+| `src/main.cpp` | 命令行入口和模式分发 |
+
+### 7.1.2 AppContext 内容
+
+```cpp
+struct AppContext {
+    AppConfig app_config;
+    CameraConfig camera_config;
+    ModelConfig model_config;
+    std::shared_ptr<AttendanceDB> db;
+    std::shared_ptr<Logger> logger;
+};
+```
+
+### 7.1.3 约束
+
+1. `AppContext` 只保存公共依赖，不直接实现业务逻辑。
+2. 模型对象由具体业务模块懒加载或在流程开始时加载。
+3. 数据库连接在程序启动时打开，程序退出时关闭。
+
+## 7.2 Config 模块
+
+### 7.2.1 app_config.json
+
+```json
+{
+  "data_root": "data",
+  "database_path": "data/database/attendance.db",
+  "log_path": "data/logs/attendance.log",
+  "save_result_image": true,
+  "timezone": "Asia/Shanghai"
+}
+```
+
+### 7.2.2 camera_config.json
+
+```json
+{
+  "device": "/dev/video0",
+  "width": 1280,
+  "height": 720,
+  "fps": 15,
+  "format": "MJPEG",
+  "warmup_frames": 5
+}
+```
+
+### 7.2.3 model_config.json
+
+```json
+{
+  "engine": "ncnn",
+  "head_detect": {
+    "param": "models/head_detect/head_detect.param",
+    "bin": "models/head_detect/head_detect.bin",
+    "input_width": 416,
+    "input_height": 416,
+    "confidence_threshold": 0.4,
+    "nms_threshold": 0.45,
+    "model_version": "head_detect_v1"
+  },
+  "face_detect": {
+    "param": "models/face_recognition/face_detect.param",
+    "bin": "models/face_recognition/face_detect.bin",
+    "input_width": 320,
+    "input_height": 320,
+    "confidence_threshold": 0.6,
+    "nms_threshold": 0.4,
+    "model_version": "face_detect_v1"
+  },
+  "face_feature": {
+    "param": "models/face_recognition/face_feature.param",
+    "bin": "models/face_recognition/face_feature.bin",
+    "input_width": 112,
+    "input_height": 112,
+    "feature_dim": 128,
+    "match_threshold": 0.75,
+    "model_version": "face_feature_v1"
+  }
+}
+```
+
+### 7.2.4 配置读取规则
+
+1. 配置缺失时使用默认值，但模型路径、数据库路径不能缺失。
+2. 命令行参数优先级高于配置文件。
+3. 阈值必须在 `[0, 1]` 范围内，否则返回 `CONFIG_ERROR`。
+4. 文件路径统一使用 `/`，在 Windows 调试时由 C++ 标准库兼容处理。
+
+## 7.3 Camera 模块
+
+### 7.3.1 职责
+
+1. 打开摄像头设备。
+2. 设置分辨率、帧率、格式。
+3. 采集一帧图片。
+4. 保存拍照图片。
+5. 出错时返回明确错误码。
+
+### 7.3.2 接口定义
+
+```cpp
+struct CameraConfig {
+    std::string device;
+    int width;
+    int height;
+    int fps;
+    std::string format;
+    int warmup_frames;
+};
+
+struct CaptureResult {
+    cv::Mat image;
+    std::string saved_path;
+    int width;
+    int height;
+    std::string captured_at;
+};
+
+class CameraCapture {
+public:
+    Status Open(const CameraConfig& config);
+    Result<CaptureResult> CaptureOne(const std::string& save_path);
+    void Close();
+};
+```
+
+### 7.3.3 实现约定
+
+1. PC 调试阶段可使用 OpenCV `VideoCapture`。
+2. 板端如果 OpenCV 摄像头采集不稳定，再切换为 V4L2 实现。
+3. 摄像头打开后先丢弃 `warmup_frames` 帧，降低曝光不稳定影响。
+4. 保存图片默认使用 JPEG，质量参数建议 `85`。
+
+## 7.4 ImageUtils 模块
+
+### 7.4.1 职责
+
+1. 图片读取和保存。
+2. 图片尺寸检查。
+3. resize 和 letterbox。
+4. 检测框绘制。
+5. 图片路径生成。
+
+### 7.4.2 接口定义
+
+```cpp
+struct LetterboxInfo {
+    float scale;
+    int pad_x;
+    int pad_y;
+    int resized_width;
+    int resized_height;
+};
+
+Result<cv::Mat> ReadImage(const std::string& path);
+Status SaveImage(const std::string& path, const cv::Mat& image);
+Result<cv::Mat> ResizeWithLetterbox(
+    const cv::Mat& image,
+    int target_width,
+    int target_height,
+    LetterboxInfo* info
+);
+cv::Rect MapBoxToOriginal(
+    const cv::Rect2f& model_box,
+    const LetterboxInfo& info,
+    int original_width,
+    int original_height
+);
+void DrawDetections(cv::Mat* image, const std::vector<Detection>& detections);
+```
+
+### 7.4.3 图片坐标约定
+
+1. 原点在左上角。
+2. `x` 向右增大，`y` 向下增大。
+3. 检测框格式默认使用 `x, y, width, height`。
+4. 数据库存储整数坐标，模型内部可使用浮点坐标。
+
+## 7.5 ModelEngine 模块
+
+### 7.5.1 职责
+
+为 ncnn 或 TFLite 提供统一推理接口，让业务模块不直接依赖具体推理框架。
+
+### 7.5.2 接口定义
+
+```cpp
+struct Tensor {
+    std::vector<int> shape;
+    std::vector<float> data;
+};
+
+struct ModelInput {
+    std::string name;
+    Tensor tensor;
+};
+
+struct ModelOutput {
+    std::string name;
+    Tensor tensor;
+};
+
+class ModelEngine {
+public:
+    virtual ~ModelEngine() = default;
+    virtual Status Load(const ModelLoadOptions& options) = 0;
+    virtual Result<std::vector<ModelOutput>> Forward(
+        const std::vector<ModelInput>& inputs
+    ) = 0;
+};
+```
+
+### 7.5.3 第一阶段简化方案
+
+如果时间紧，可以先不抽象 `ModelEngine`，直接在 `HeadDetector`、`FaceDetector`、`FaceFeatureExtractor` 中调用 ncnn。  
+但头文件接口仍然建议保持稳定，避免后续切换模型时影响业务流程。
+
+## 7.6 HeadCount 模块
+
+### 7.6.1 文件职责
+
+| 文件 | 职责 |
+| --- | --- |
+| `include/head_count/head_detector.h` | 人头检测模型接口 |
+| `src/head_count/head_detector.cpp` | 模型加载、预处理、推理 |
+| `include/head_count/detect_postprocess.h` | 检测后处理接口 |
+| `src/head_count/detect_postprocess.cpp` | 输出解码、阈值过滤、NMS |
+| `include/head_count/head_counter.h` | 人头统计业务接口 |
+| `src/head_count/head_counter.cpp` | 串联检测、统计、保存结果 |
+
+### 7.6.2 核心数据结构
+
+```cpp
+struct Detection {
+    int class_id;
+    std::string class_name;
+    float score;
+    float x;
+    float y;
+    float width;
+    float height;
+};
+
+struct HeadCountRequest {
+    std::string image_path;
+    bool save_result_image;
+    std::string result_image_path;
+    float confidence_threshold;
+    float nms_threshold;
+};
+
+struct HeadCountResult {
+    int head_count;
+    std::vector<Detection> detections;
+    std::string image_path;
+    std::string result_image_path;
+    int elapsed_ms;
+    int64_t record_id;
+};
+```
+
+### 7.6.3 HeadDetector 接口
+
+```cpp
+class HeadDetector {
+public:
+    Status Load(const HeadDetectConfig& config);
+    Result<std::vector<Detection>> Detect(const cv::Mat& image);
+};
+```
+
+### 7.6.4 HeadCounter 接口
+
+```cpp
+class HeadCounter {
+public:
+    HeadCounter(HeadDetector* detector, AttendanceDB* db);
+    Result<HeadCountResult> Count(const HeadCountRequest& request);
+};
+```
+
+### 7.6.5 后处理规则
+
+1. 先按置信度过滤。
+2. 再按类别过滤，只保留 `head` 或 `face` 类别。
+3. 执行 NMS。
+4. 将检测框映射回原图。
+5. 裁剪超出图片边界的坐标。
+6. 最终 `head_count = detections.size()`。
+
+### 7.6.6 NMS 接口
+
+```cpp
+std::vector<Detection> Nms(
+    const std::vector<Detection>& input,
+    float iou_threshold
+);
+```
+
+## 7.7 FaceMatch 模块
+
+### 7.7.1 文件职责
+
+| 文件 | 职责 |
+| --- | --- |
+| `include/face_match/face_detector.h` | 单张图片人脸检测接口 |
+| `src/face_match/face_detector.cpp` | 人脸检测模型推理 |
+| `include/face_match/face_aligner.h` | 人脸对齐接口 |
+| `src/face_match/face_aligner.cpp` | 关键点对齐和裁剪 |
+| `include/face_match/face_feature_extractor.h` | 特征提取接口 |
+| `src/face_match/face_feature_extractor.cpp` | 人脸特征模型推理 |
+| `include/face_match/face_matcher.h` | 特征匹配接口 |
+| `src/face_match/face_matcher.cpp` | 余弦相似度匹配 |
+
+### 7.7.2 核心数据结构
+
+```cpp
+struct FaceLandmarks {
+    cv::Point2f left_eye;
+    cv::Point2f right_eye;
+    cv::Point2f nose;
+    cv::Point2f left_mouth;
+    cv::Point2f right_mouth;
+};
+
+struct FaceDetection {
+    float score;
+    cv::Rect2f box;
+    FaceLandmarks landmarks;
+};
+
+struct FaceFeature {
+    std::vector<float> values;
+    int dim;
+    std::string model_version;
+};
+
+struct Student {
+    int64_t id;
+    std::string student_no;
+    std::string name;
+    std::string class_name;
+};
+
+struct FaceTemplate {
+    int64_t id;
+    Student student;
+    FaceFeature feature;
+    std::string face_image_path;
+};
+```
+
+### 7.7.3 FaceDetector 接口
+
+```cpp
+class FaceDetector {
+public:
+    Status Load(const FaceDetectConfig& config);
+    Result<std::vector<FaceDetection>> Detect(const cv::Mat& image);
+};
+```
+
+### 7.7.4 FaceAligner 接口
+
+```cpp
+class FaceAligner {
+public:
+    Result<cv::Mat> Align(
+        const cv::Mat& image,
+        const FaceDetection& face,
+        int output_width,
+        int output_height
+    );
+};
+```
+
+### 7.7.5 FaceFeatureExtractor 接口
+
+```cpp
+class FaceFeatureExtractor {
+public:
+    Status Load(const FaceFeatureConfig& config);
+    Result<FaceFeature> Extract(const cv::Mat& aligned_face);
+};
+```
+
+### 7.7.6 FaceMatcher 接口
+
+```cpp
+struct FaceMatchCandidate {
+    int64_t student_id;
+    std::string student_no;
+    std::string name;
+    float score;
+};
+
+struct FaceMatchRequest {
+    std::string image_path;
+    float threshold;
+    bool save_aligned_face;
+};
+
+struct FaceMatchResult {
+    bool matched;
+    std::string status;
+    FaceMatchCandidate best;
+    float threshold;
+    std::string input_image_path;
+    std::string aligned_face_path;
+    int elapsed_ms;
+    int64_t record_id;
+};
+
+class FaceMatcher {
+public:
+    static void Normalize(FaceFeature* feature);
+    static float CosineSimilarity(
+        const std::vector<float>& a,
+        const std::vector<float>& b
+    );
+    Result<FaceMatchResult> Match(
+        const FaceFeature& query,
+        const std::vector<FaceTemplate>& templates,
+        float threshold
+    );
+};
+```
+
+### 7.7.7 多人脸处理规则
+
+单张人脸匹配功能原则上要求输入图片只有一个人脸。若检测到多张人脸，第一阶段默认选择面积最大的人脸。
+
+后续可以增加策略：
+
+1. 选择置信度最高的人脸。
+2. 选择离图片中心最近的人脸。
+3. 返回错误，要求重新拍摄。
+
+当前推荐：注册和单张匹配都选择面积最大人脸，并在日志中记录检测到的人脸数量。
+
+## 7.8 Database 模块
+
+### 7.8.1 文件职责
+
+| 文件 | 职责 |
+| --- | --- |
+| `include/database/attendance_db.h` | 数据库连接和表初始化 |
+| `src/database/attendance_db.cpp` | SQLite 执行 SQL、事务封装 |
+| `include/database/face_repository.h` | 学生、人脸模板、识别记录的数据访问 |
+| `src/database/face_repository.cpp` | 业务数据读写实现 |
+
+### 7.8.2 AttendanceDB 接口
+
+```cpp
+class AttendanceDB {
+public:
+    Status Open(const std::string& db_path);
+    Status InitSchema();
+    Status Execute(const std::string& sql);
+    sqlite3* RawHandle();
+    void Close();
+};
+```
+
+### 7.8.3 FaceRepository 接口
+
+```cpp
+class FaceRepository {
+public:
+    explicit FaceRepository(AttendanceDB* db);
+
+    Result<int64_t> InsertStudent(const Student& student);
+    Result<Student> FindStudentByNo(const std::string& student_no);
+    Result<int64_t> InsertFaceTemplate(
+        int64_t student_id,
+        const std::string& image_path,
+        const FaceFeature& feature
+    );
+    Result<std::vector<FaceTemplate>> LoadAllTemplates();
+    Result<int64_t> InsertFaceMatchRecord(const FaceMatchResult& result);
+    Result<int64_t> InsertHeadCountRecord(const HeadCountResult& result);
+};
+```
+
+### 7.8.4 事务约定
+
+1. 注册学生时，`students` 和 `face_templates` 必须在同一事务内写入。
+2. 写入检测记录时，如果结果图片保存失败，不写数据库记录。
+3. 数据库写失败时，业务流程返回失败，但不删除原始输入图片。
+4. 特征向量以 float32 二进制 BLOB 保存。
+
+## 8. 通用接口规范
+
+## 8.1 命名空间
+
+所有业务代码建议放入统一命名空间：
+
+```cpp
+namespace attendance {
+// ...
+}
+```
+
+## 8.2 返回值规范
+
+推荐使用 `Status` 和 `Result<T>` 统一表达成功、失败和错误信息。
+
+```cpp
+enum class StatusCode {
+    OK = 0,
+    INVALID_ARGUMENT,
+    CONFIG_ERROR,
+    FILE_IO_ERROR,
+    CAMERA_ERROR,
+    MODEL_LOAD_ERROR,
+    MODEL_INFER_ERROR,
+    IMAGE_PROCESS_ERROR,
+    FACE_NOT_FOUND,
+    MULTIPLE_FACES_FOUND,
+    FEATURE_ERROR,
+    DATABASE_ERROR,
+    NOT_FOUND,
+    UNKNOWN_ERROR
+};
+
+struct Status {
+    StatusCode code;
+    std::string message;
+
+    bool ok() const {
+        return code == StatusCode::OK;
+    }
+};
+
+template <typename T>
+struct Result {
+    Status status;
+    T value;
+
+    bool ok() const {
+        return status.ok();
+    }
+};
+```
+
+### 8.2.1 错误码说明
+
+| 错误码 | 说明 |
+| --- | --- |
+| `OK` | 成功 |
+| `INVALID_ARGUMENT` | 命令行参数或函数入参不合法 |
+| `CONFIG_ERROR` | 配置文件缺失、格式错误、字段非法 |
+| `FILE_IO_ERROR` | 图片、模型、目录读写失败 |
+| `CAMERA_ERROR` | 摄像头打开或采集失败 |
+| `MODEL_LOAD_ERROR` | 模型文件加载失败 |
+| `MODEL_INFER_ERROR` | 模型推理失败 |
+| `IMAGE_PROCESS_ERROR` | 图片 resize、转换、裁剪失败 |
+| `FACE_NOT_FOUND` | 未检测到人脸 |
+| `MULTIPLE_FACES_FOUND` | 检测到多张人脸且当前策略不允许 |
+| `FEATURE_ERROR` | 特征维度错误或特征提取失败 |
+| `DATABASE_ERROR` | SQLite 执行失败 |
+| `NOT_FOUND` | 数据不存在 |
+| `UNKNOWN_ERROR` | 未分类错误 |
+
+## 8.3 日志规范
+
+日志格式：
+
+```text
+[2026-06-04 09:00:00] [INFO] [HeadCounter] image=data/images/classroom.jpg head_count=36 elapsed_ms=2530
+```
+
+日志级别：
+
+| 级别 | 使用场景 |
+| --- | --- |
+| `DEBUG` | 模型输入尺寸、检测框数量、阈值等调试信息 |
+| `INFO` | 主要流程开始和结束 |
+| `WARN` | 非致命问题，例如检测到多张人脸但选择最大人脸 |
+| `ERROR` | 流程失败，例如模型加载失败、数据库写失败 |
+
+## 8.4 JSON 输出规范
+
+命令行工具建议支持 `--json` 参数，便于后续 UI 或脚本调用。
+
+### 8.4.1 head_count 输出
+
+```json
+{
+  "status": "ok",
+  "mode": "head_count",
+  "image_path": "data/images/classroom.jpg",
+  "result_image_path": "data/results/classroom_detected.jpg",
+  "head_count": 36,
+  "detections": [
+    {
+      "class_id": 0,
+      "class_name": "head",
+      "score": 0.91,
+      "x": 120,
+      "y": 85,
+      "width": 32,
+      "height": 36
+    }
+  ],
+  "elapsed_ms": 2530,
+  "record_id": 1
+}
+```
+
+### 8.4.2 face_match 输出
+
+```json
+{
+  "status": "matched",
+  "mode": "face_match",
+  "input_image_path": "data/images/test_face.jpg",
+  "matched": true,
+  "student_id": 1,
+  "student_no": "20240001",
+  "name": "zhangsan",
+  "score": 0.82,
+  "threshold": 0.75,
+  "elapsed_ms": 840,
+  "record_id": 12
+}
+```
+
+### 8.4.3 失败输出
+
+```json
+{
+  "status": "error",
+  "code": "FACE_NOT_FOUND",
+  "message": "no face detected in image",
+  "mode": "face_match"
+}
+```
+
+## 9. 数据库设计
+
+## 9.1 students
+
+```sql
+CREATE TABLE IF NOT EXISTS students (
+    id INTEGER PRIMARY KEY AUTOINCREMENT,
+    student_no TEXT NOT NULL UNIQUE,
+    name TEXT NOT NULL,
+    class_name TEXT,
+    created_at TEXT NOT NULL,
+    updated_at TEXT
+);
+```
+
+## 9.2 face_templates
+
+```sql
+CREATE TABLE IF NOT EXISTS face_templates (
+    id INTEGER PRIMARY KEY AUTOINCREMENT,
+    student_id INTEGER NOT NULL,
+    face_image_path TEXT NOT NULL,
+    feature BLOB NOT NULL,
+    feature_dim INTEGER NOT NULL,
+    model_version TEXT NOT NULL,
+    created_at TEXT NOT NULL,
+    FOREIGN KEY(student_id) REFERENCES students(id)
+);
+```
+
+## 9.3 head_count_records
+
+```sql
+CREATE TABLE IF NOT EXISTS head_count_records (
+    id INTEGER PRIMARY KEY AUTOINCREMENT,
+    image_path TEXT NOT NULL,
+    result_image_path TEXT,
+    head_count INTEGER NOT NULL,
+    detections_json TEXT NOT NULL,
+    confidence_threshold REAL NOT NULL,
+    nms_threshold REAL NOT NULL,
+    model_version TEXT NOT NULL,
+    elapsed_ms INTEGER,
+    created_at TEXT NOT NULL
+);
+```
+
+## 9.4 face_match_records
+
+```sql
+CREATE TABLE IF NOT EXISTS face_match_records (
+    id INTEGER PRIMARY KEY AUTOINCREMENT,
+    input_image_path TEXT NOT NULL,
+    aligned_face_path TEXT,
+    matched_student_id INTEGER,
+    score REAL,
+    threshold REAL NOT NULL,
+    status TEXT NOT NULL,
+    model_version TEXT NOT NULL,
+    elapsed_ms INTEGER,
+    created_at TEXT NOT NULL,
+    FOREIGN KEY(matched_student_id) REFERENCES students(id)
+);
+```
+
+## 9.5 后续扩展表
+
+完整课堂打卡时建议增加：
+
+```sql
+CREATE TABLE IF NOT EXISTS attendance_sessions (
+    id INTEGER PRIMARY KEY AUTOINCREMENT,
+    class_name TEXT,
+    course_name TEXT,
+    image_path TEXT NOT NULL,
+    result_image_path TEXT,
+    expected_count INTEGER,
+    detected_count INTEGER,
+    recognized_count INTEGER,
+    unknown_count INTEGER,
+    created_at TEXT NOT NULL
+);
+
+CREATE TABLE IF NOT EXISTS attendance_records (
+    id INTEGER PRIMARY KEY AUTOINCREMENT,
+    session_id INTEGER NOT NULL,
+    student_id INTEGER,
+    face_image_path TEXT,
+    bbox_json TEXT,
+    score REAL,
+    status TEXT NOT NULL,
+    created_at TEXT NOT NULL,
+    FOREIGN KEY(session_id) REFERENCES attendance_sessions(id),
+    FOREIGN KEY(student_id) REFERENCES students(id)
+);
+```
+
+## 10. 文件和目录规范
+
+## 10.1 目录用途
+
+| 目录 | 用途 |
+| --- | --- |
+| `config/` | 配置文件 |
+| `models/head_detect/` | 大图人头检测模型 |
+| `models/face_recognition/` | 人脸检测和特征模型 |
+| `data/images/` | 原始图片 |
+| `data/faces/` | 注册人脸、对齐人脸、裁剪人脸 |
+| `data/results/` | 带框结果图 |
+| `data/database/` | SQLite 数据库 |
+| `data/logs/` | 日志文件 |
+| `include/` | 头文件 |
+| `src/` | 源码 |
+| `tools/` | 独立工具程序 |
+| `tests/` | 单元测试 |
+| `docs/` | 设计文档 |
+
+## 10.2 文件命名
+
+拍照图片：
+
+```text
+data/images/capture_YYYYMMDD_HHMMSS.jpg
+```
+
+人头统计结果图：
+
+```text
+data/results/head_count_YYYYMMDD_HHMMSS.jpg
+```
+
+注册人脸图：
+
+```text
+data/faces/register_{student_no}_YYYYMMDD_HHMMSS.jpg
+```
+
+单张匹配对齐图：
+
+```text
+data/faces/match_aligned_YYYYMMDD_HHMMSS.jpg
+```
+
+## 11. 模块间数据流
+
+## 11.1 head_count 数据流
+
+| 阶段 | 输入 | 输出 |
+| --- | --- | --- |
+| 图片读取 | `image_path` | `cv::Mat original` |
+| 预处理 | `original` | `model_input`, `LetterboxInfo` |
+| 推理 | `model_input` | raw tensors |
+| 后处理 | raw tensors, `LetterboxInfo` | `vector<Detection>` |
+| 统计 | `vector<Detection>` | `head_count` |
+| 画图 | `original`, detections | result image |
+| 入库 | result object | record ID |
+
+## 11.2 face_match 数据流
+
+| 阶段 | 输入 | 输出 |
+| --- | --- | --- |
+| 图片读取 | `image_path` | `cv::Mat image` |
+| 人脸检测 | `image` | `vector<FaceDetection>` |
+| 人脸选择 | detections | `FaceDetection selected` |
+| 对齐 | image, selected | `cv::Mat aligned_face` |
+| 特征提取 | aligned_face | `FaceFeature query` |
+| 模板读取 | database | `vector<FaceTemplate>` |
+| 匹配 | query, templates | `FaceMatchResult` |
+| 入库 | result | record ID |
+
+## 12. 性能和资源约束
+
+IMX6ULL_ALPHA 板端性能有限，代码实现时需要遵守以下约束：
+
+1. 模型只加载一次，不能每张图片重新加载。
+2. 尽量使用较小输入尺寸，例如 `320x320`、`416x416`。
+3. 大图检测优先使用 letterbox，不直接拉伸变形。
+4. 数据库特征模板加载后可以缓存，注册新人后再刷新缓存。
+5. 图片保存默认 JPEG，避免保存 BMP/PNG 大文件。
+6. 日志不要频繁写大量检测框细节，调试时再打开 `DEBUG`。
+7. 避免在循环中频繁创建大 `cv::Mat`，可复用缓冲区。
+8. 第一阶段采用单线程顺序流程，确保稳定后再考虑多线程。
+
+## 13. 测试计划
+
+## 13.1 单元测试
+
+| 测试文件 | 测试内容 |
+| --- | --- |
+| `tests/test_head_postprocess.cpp` | NMS、坐标映射、置信度过滤 |
+| `tests/test_face_similarity.cpp` | 余弦相似度、特征归一化 |
+| `tests/test_config.cpp` | 配置读取和默认值 |
+| `tests/test_database.cpp` | 数据库初始化、插入、查询 |
+
+## 13.2 PC 端集成测试
+
+1. 使用本地图片测试 `head_count`。
+2. 注册 5 名学生。
+3. 用同一学生不同图片测试 `face_match`。
+4. 用未注册人员图片测试 `unknown`。
+5. 删除模型文件，确认返回 `MODEL_LOAD_ERROR`。
+6. 输入不存在图片，确认返回 `FILE_IO_ERROR`。
+
+## 13.3 板端测试
+
+1. 摄像头能正常拍照并保存。
+2. `head_count` 能在板端完成一次推理。
+3. `face_match` 能在板端完成一次推理。
+4. 连续运行 20 次不崩溃。
+5. 断电重启后数据库仍可读取。
+6. 存储空间不足时能返回明确错误。
+
+## 14. 第一阶段验收标准
+
+## 14.1 大图人头数统计
+
+1. 能读取本地课堂大图。
+2. 能输出检测到的人头数量。
+3. 能保存带框结果图。
+4. 能将检测记录写入 SQLite。
+5. 命令行能输出文本结果和 JSON 结果。
+
+## 14.2 单张人脸匹配
+
+1. 能注册至少 5 名学生。
+2. 能保存学生信息和特征向量。
+3. 能识别已注册学生。
+4. 能将未注册人员识别为 `unknown`。
+5. 能保存识别记录。
+
+## 15. 待确认决策项
+
+以下问题建议在开始编码核心模型模块前确认。
+
+| 编号 | 决策项 | 推荐默认值 | 影响 |
+| --- | --- | --- | --- |
+| D1 | 推理框架选 ncnn 还是 TFLite | ncnn | 影响模型转换和 C++ 调用方式 |
+| D2 | 大图统计检测对象是人头还是人脸 | 人头优先，脸部可备选 | 人头对低头、侧脸更稳；人脸方便后续识别 |
+| D3 | 摄像头类型 | 先 USB 摄像头 | OpenCV 采集更容易调试 |
+| D4 | 人脸特征维度 | 按模型确定，优先 128 或 512 | 影响数据库 BLOB 大小和匹配速度 |
+| D5 | 注册时一人保存几个模板 | 第一阶段 1 个 | 多模板准确率更高，但管理更复杂 |
+| D6 | 多人脸输入单张匹配时如何处理 | 选择最大人脸 | 简单可演示，但可能误选背景人脸 |
+| D7 | 匹配阈值 | 初始 0.75 | 需要通过测试集调参 |
+| D8 | 是否加入 LCD/Web 界面 | 第一阶段不加 | 先保证算法和数据链路稳定 |
+| D9 | 是否做完整大图多人脸识别 | 第二阶段做 | 第一阶段先验证两个基础能力 |
+
+## 16. 建议实施顺序
+
+1. 建立基础 CMake 工程。
+2. 实现 `Status`、`Result<T>`、日志、时间工具。
+3. 实现配置读取。
+4. 实现 SQLite 初始化和基础表结构。
+5. 实现图片读写、画框、letterbox、坐标映射。
+6. 实现 `head_count` 的后处理和假数据测试。
+7. 接入真实人头检测模型。
+8. 实现人脸特征相似度和数据库模板读写。
+9. 接入人脸检测、人脸对齐、特征提取模型。
+10. 打通 `register_face`。
+11. 打通 `face_match`。
+12. 在 PC 端完整测试。
+13. 交叉编译部署到 IMX6ULL_ALPHA。
+14. 根据板端耗时和准确率调整模型和阈值。
+
+## 17. 后续扩展接口预留
+
+完整课堂打卡可以新增以下业务接口。
+
+```cpp
+struct AttendanceSessionRequest {
+    std::string image_path;
+    std::string class_name;
+    std::string course_name;
+    bool save_result_image;
+};
+
+struct AttendanceRecord {
+    int64_t student_id;
+    std::string student_no;
+    std::string name;
+    Detection face_box;
+    float score;
+    std::string status;
+    std::string face_image_path;
+};
+
+struct AttendanceSessionResult {
+    int64_t session_id;
+    int detected_count;
+    int recognized_count;
+    int unknown_count;
+    std::vector<AttendanceRecord> records;
+    std::string image_path;
+    std::string result_image_path;
+};
+
+class AttendanceFlow {
+public:
+    Result<AttendanceSessionResult> Run(
+        const AttendanceSessionRequest& request
+    );
+};
+```
+
+后续完整打卡流程可以复用第一阶段已经实现的：
+
+1. 图片读取和保存。
+2. 人脸或人头检测。
+3. 人脸对齐。
+4. 特征提取。
+5. 特征匹配。
+6. SQLite 入库。
+
+## 18. 风险点和规避方案
+
+| 风险 | 表现 | 规避方案 |
+| --- | --- | --- |
+| 板端推理慢 | 一张图处理十几秒以上 | 降低输入尺寸、换更小模型、只做演示级检测 |
+| 后排人脸太小 | 漏检严重 | 使用更高分辨率拍照、调整摄像头位置、使用人头检测模型 |
+| 光照差 | 检测和识别不稳定 | 固定摄像头曝光，增加补光，加入模糊/亮度检查 |
+| 阈值不合适 | 误识别或大量 unknown | 用真实班级图片调参 |
+| 数据库特征模型版本混乱 | 新旧特征不可比 | `face_templates` 必须保存 `model_version` |
+| 中文路径或姓名乱码 | 数据显示异常 | 源码和数据库统一 UTF-8 |
+| 摄像头驱动不稳定 | 拍照失败 | 先用 USB 摄像头，必要时改 V4L2 |
+
+## 19. 当前推荐结论
+
+第一阶段按以下路线推进最稳：
+
+1. 使用 C++17 + OpenCV + SQLite + ncnn。
+2. 先实现命令行程序，不做界面。
+3. 大图人头数统计和单张人脸匹配分开实现。
+4. 人头统计先只输出人数，不做人脸身份识别。
+5. 单张人脸匹配先要求输入清晰单人脸。
+6. 所有结果都保存到 SQLite，便于答辩展示和后续扩展。
+