corp_tag.json
Overview
corp_tag.json is a JSON data file that maps corporate or organizational names (primarily game companies, technology firms, and various enterprises) to an array of descriptive tags or categorizations. These tags classify the entities by attributes such as industry standing, specialization, or business characteristics.
The file serves as a reference or lookup table used in applications or systems that require labeling, categorizing, or filtering corporations based on pre-defined tags. The tags are often in Chinese, indicating classifications like "好游戏" (Good Games), "行业头部" (Industry Leader), "综合大厂" (Comprehensive Large Company), "猎" (Recruitment/Headhunting), "微众偏好" (WeBank Preference), "阅文偏好" (Yuewen Preference), and others.
This structured data can be used for analytics, reporting, recommendation systems, or filtering corporations in business intelligence or market analysis platforms.
Structure and Content
The file is organized as a single JSON object with:
Keys: Strings representing the name of a company or organization. These names are usually in English or Chinese, sometimes with additional qualifiers (e.g., location, subsidiaries).
Values: Arrays of strings, where each string is a tag describing the company.
Example Entry
"bilibili": [
"行业头部多元化",
"好游戏",
"阅文偏好"
]
Interpretation:
bilibiliis tagged as an "Industry Leader with diversification" (行业头部多元化), associated with "Good Games" (好游戏), and has a "Yuewen Preference" (阅文偏好).
Key Components
Entities (Companies/Organizations)
Represented as string keys.
Examples include:
Major gaming companies:
"腾讯","网易","育碧","动视".Technology giants:
"华为","字节跳动","阿里巴巴".Specialized studios:
"arkane","insomniac games".Others include service providers, software companies, investment firms, and more.
Tags (Values)
Each company is associated with one or more tags.
Tags provide metadata about the company's industry role, preferences, or business nature.
Common tags include:
"好游戏"— Good games (likely indicating quality or reputation in gaming)."行业头部"— Industry leader or top-tier."综合大厂"— Comprehensive large company."猎"— Related to recruitment, headhunting, or talent sourcing."微众偏好"— Preference by WeBank or related affiliation."阅文偏好"— Preference by Yuewen (a Chinese digital reading platform)."行业好公司"— Good company in the industry."软外"— Possibly refers to "software outsourcing"."行业头部多元化"— Industry leader with diversification.
Usage
The JSON file can be loaded into software systems for various purposes:
Tag Lookup: Given a company name, retrieve its tags for display or analytics.
Filtering: Select companies by tag, e.g., show all "行业头部" (industry leaders).
Recommendation: Suggest companies with similar tags.
Data Enrichment: Augment company profiles with tags for better classification.
Market Analysis: Understand the distribution of company types in a dataset.
Example Code Snippet (Python)
import json
# Load the JSON data
with open('corp_tag.json', 'r', encoding='utf-8') as f:
corp_tags = json.load(f)
# Retrieve tags for a company
company_name = "bilibili"
tags = corp_tags.get(company_name, [])
print(f"Tags for {company_name}: {tags}")
# Find all companies tagged as '行业头部' (Industry Leader)
industry_leaders = [corp for corp, tags in corp_tags.items() if "行业头部" in tags]
print(f"Industry leaders: {industry_leaders}")
Implementation Details
The file is a flat JSON object without nested structures beyond arrays of strings.
No algorithms or processing logic is embedded in the file; it is purely data.
The tags are mostly in Chinese with some English names mixed in.
Data appears curated with a wide range of companies and diverse tags reflecting multiple industries but focused heavily on gaming and technology.
Some company names have multiple aliases or variations listed separately (e.g.,
"腾讯","腾讯游戏","tencent").The file can be extended or updated with new companies or tags as needed.
Interaction with Other System Components
Data Source: This JSON file likely acts as an input or reference dataset for services or modules responsible for user interfaces, reporting dashboards, or recommendation engines.
Integration: Systems that process corporate data or user queries can use this file to augment information with tag-based metadata.
UI Components: Frontend applications might use this data to display company profiles with tags or to filter search results.
Analytics: Data analytics pipelines can reference this file to classify companies in datasets for segmentation or trend analysis.
APIs: Backend services might expose APIs that query this data, providing tag information to clients.
Visual Diagram
Since this file is a utility data file containing mappings between companies and their tags, a flowchart is appropriate to illustrate how the data is typically used.
flowchart TD
A[Load corp_tag.json] --> B{Query by Company Name}
B -- Exists --> C[Return Tags Array]
B -- Not Found --> D[Return Empty / Default]
C --> E[Display Tags in UI]
C --> F[Use Tags for Filtering]
F --> G[Filter Companies by Tag]
G --> H[Return Filtered List]
H --> E
Summary
File Type: JSON data file.
Purpose: Provides a mapping of company names to descriptive tags.
Content: Company names as keys; arrays of tags as values.
Tags: Mostly in Chinese; denote industry position, preferences, or company nature.
Usage: Reference for classification, filtering, and enrichment in software systems.
No executable code: Pure data file.
Integration: Supports UI display, analytics, and API services.
Visual: Flowchart showing data loading, querying, and usage in filtering/display.
This documentation should help developers, analysts, and system integrators understand the role and structure of corp_tag.json within the larger ecosystem.