Arn-Hai ToS Clause Classifier v1

This is a multilingual ToS / Privacy Policy clause classifier trained for the Arn-Hai browser extension prototype.

Task

Input: one ToS / Privacy Policy clause
Output: joint label in the format:

Category__RiskLevel

Example:

v1 was trained primarily on English ToS / privacy datasets:

Thai support is experimental and should be improved with Thai PDPA / Thai privacy policy clauses.

Test:

Thai performance is still weak because v1 has little/no Thai labeled training data.
Rare classes such as Unilateral Termination may be unreliable.
Use with rule-based fallback and human review.

Safetensors

Model size

0.3B params

Tensor type

F32