Authentication
Create an API key in the user center after sign-in. Requests support Authorization: Bearer <API_KEY>、Authorization: <API_KEY> or X-API-Key: <API_KEY>。
API Docs
The API currently supports API-key authentication and returns summary categories, public label codes, label names, and scores for each text.
Create an API key in the user center after sign-in. Requests support Authorization: Bearer <API_KEY>、Authorization: <API_KEY> or X-API-Key: <API_KEY>。
https://www.aimoderations.com/api/v1/moderation/check/
Each request consumes quota. Exact deduction rules are shown in the signed-in user center.
Returns the top-level risk category, primary label code, primary label name, public label list, summary categories, remaining quota, and current quota cost.
The website demo, user center, and API share the same public label schema, making results easier for product, operations, and customers to understand.
Suitable for comment moderation, community text moderation, live-chat interception, and LLM output risk control before release.
Available both as a public-cloud API and as a private deployment option for stricter business needs.
Supports multiple policy templates and also manual configuration by public label code, top-level category, and threshold.
New users receive 1000 free credits, and approved company verification grants another 50000 free credits.
curl -X POST 'https://www.aimoderations.com/api/v1/moderation/check/' \
-H 'Authorization: your_api_key' \
-H 'Content-Type: application/json' \
-d '{
"text": "Add me on WeChat and I will show you more explicit content"
}'
{
"ok": true,
"quota_remaining": 999,
"result": {
"text_length": 56,
"quota_cost": 1,
"primary_category": "Spam & Promotion",
"primary_label_code": "offsite_lead_gen",
"primary_label": "Off-platform Diversion",
"summary_categories": [
{
"name": "Spam & Promotion",
"score": 0.98,
"labels": [
{
"code": "offsite_lead_gen",
"label": "Off-platform Diversion",
"score": 0.98,
"definition": "The text tries to move users off the current platform through add-friend, DM, or diversion wording."
}
]
}
],
"predicted_labels": ["Off-platform Diversion", "Contact Info Promotion"]
}
}
| Field | Type | Description |
|---|---|---|
quota_remaining | integer | Remaining quota after the request. |
text_length | integer | Total character count of the submitted text. |
quota_cost | integer | Quota consumed by this request. |
primary_category | string | Top-level risk category for routing business actions. |
primary_label_code | string | Public label code, suitable for policy rules. |
primary_label | string | Public label name, suitable for operators and customers. |
summary_categories | array | Summary buckets grouped by top-level risk type, each with labels, scores, and definitions. |
predicted_scores | array | Public labels above threshold, each with code, label, score, and definition. |
top_scores | array | TopK public labels for advanced strategies; the website demo only shows items above threshold by default. |
primary_category and primary_label_code。quota_cost、primary_category and primary_label_code。Top-level Risk Categories
Detects explicit sexual content, suggestive flirting, solicitation, minor-related sexual risk, and adult content distribution.
Covers gambling, drugs, fraud, underground trade, privacy abuse, and circumvention-related violations.
Detects sensitive expressions related to public institutions, political figures, regional conflict, historical controversy, and ideology.
Detects personal attacks, discriminatory insults, hate speech, and hostile expressions.
Detects off-platform diversion, contact info drops, ecommerce promotion, financial marketing, and recruitment spam.
Detects violent harm, terrorism, self-harm risk, and weapons or dangerous goods related content.
Captures noisy text, prompt-abuse signals, and edge-case risks that need secondary review.
No clear violation is detected, or the text falls into neutral discussion and normal daily-life context.
Public Label Catalog
Below is a preview of the public label system exposed to customers. Only three representative labels are shown for each top-level category on the website. Contact support for the full catalog.
Three representative labels are shown here. Total labels: 7 Contact support for more details.
sexual_solicitation
The text contains prostitution, sexual service solicitation, or adult traffic diversion.
sexual_teasing
The text contains teasing, suggestive, or borderline sexual expressions.
explicit_sexual_content
The text directly describes sexual acts, sexual crimes, or explicit adult scenes.
Three representative labels are shown here. Total labels: 8 Contact support for more details.
gambling_and_betting
The text involves gambling platforms, betting activities, or abnormal wagering guidance.
drugs_and_contraband
The text involves drugs, prohibited medicine, or other restricted goods.
fraud_and_blackmarket
The text involves fraud, forgery, pyramid schemes, organized crime, or other black/grey market activities.
Three representative labels are shown here. Total labels: 5 Contact support for more details.
public_affairs_sensitive
The text involves governments, institutions, regulations, or sensitive public governance discussion.
political_figure_sensitive
The text involves political figures, major public figures, or related controversial expressions.
separatism_and_regional_conflict
The text involves sovereignty disputes, separatist rhetoric, or escalated regional conflict expressions.
Three representative labels are shown here. Total labels: 2 Contact support for more details.
abusive_attack
The text contains obvious personal attacks, verbal abuse, or malicious provocation.
discriminatory_hate
The text contains discriminatory insults toward groups based on region, gender, profession, appearance, or ethnicity.
Three representative labels are shown here. Total labels: 5 Contact support for more details.
offsite_lead_gen
The text tries to move users off the current platform through add-friend, DM, or diversion wording.
contact_exchange
The text contains WeChat, QQ, phone numbers, URLs, public accounts, or other contact entry points.
commerce_marketing
The text includes product promotion, ecommerce sales, or training-course marketing.
Three representative labels are shown here. Total labels: 4 Contact support for more details.
violent_harm
The text involves gore, physical harm, murder, or hired violence.
terror_extremism
The text involves terrorist organizations, terrorist incidents, or extremist content.
self_harm
The text involves suicide, self-harm, or expressions of harming oneself.
Three representative labels are shown here. Total labels: 2 Contact support for more details.
noisy_text
The text is dominated by gibberish, repetition, spammy filler, or low-information content.
other_risk_signal
Non-standard but noteworthy risk signals are detected and should be judged with additional business rules.
Three representative labels are shown here. Total labels: 3 Contact support for more details.
safe_pass
No clear policy violation is detected in the current text.
safe_context
The text contains neutral discussion, objective statements, or ordinary mentions that do not directly form a risk.
lifestyle_normal
The text is mainly about daily life, health, parenting, or ordinary social topics.
The platform includes built-in OpenAPI docs for field debugging, header validation, and quick response checks.
curl -X POST 'https://www.aimoderations.com/api/v1/moderation/check/' \
-H 'Authorization: your_api_key' \
-H 'Content-Type: application/json' \
-d '{
"text": "Add me on WeChat and I can share more resources"
}'
Built-in OpenAPI docs make it easy to debug fields and validate responses.
Open Swagger