feat(repository): add language statistics analysis feature

- Remove data directory from gitignore to include language data
- Add build script to parse linguist languages.yml and generate static mappings
- Include serde and serde_yml dependencies for YAML parsing
- Add lang_stats module with language detection and statistics calculation
- Generate protobuf definitions for language statistics API endpoints
- Implement GetLanguageStats RPC endpoint in repository server
- Add comprehensive test suite for language statistics functionality
- Include extension and filename based language detection logic
- Implement binary file classification and group resolution features
This commit is contained in:
zhenyi
2026-06-10 13:06:59 +08:00
parent 9a0c26e5f6
commit 939931acad
10 changed files with 10202 additions and 1 deletions
+2
View File
@@ -46,3 +46,5 @@ path = "main.rs"
[build-dependencies]
tonic-prost-build = "0.14"
serde_yml = "0.0.12"
serde = { version = "1", features = ["derive"] }