β€œTranscendental-Programmer”
first commit
c3cc0a9

Dataset Plan for AI/ML Models

This document outlines the datasets required for implementing the AI/ML functionalities described in the ai_plan.md file. For each model, we'll provide the CSV structure and example records.

1. Intelligent Routing and Workflow Automation

1.1 Grievance Data (grievances.csv)

Fields:

  • grievance_id (string)
  • category (string)
  • submission_timestamp (datetime)
  • student_room_no (string)
  • hostel_name (string)
  • floor_number (integer)

Examples:

grievance_id,category,submission_timestamp,student_room_no,hostel_name,floor_number
G12345,plumber,2023-10-01T10:30:00Z,204,bh1,2
G12346,electricity,2023-10-02T08:15:00Z,101,bh2,1
G12347,internet,2023-10-03T14:20:00Z,305,bh3,3
G12348,water cooler,2023-10-04T09:45:00Z,102,ivh,1
G12349,sweeper,2023-10-05T11:00:00Z,201,gh,2
G12350,carpenter,2023-10-06T13:30:00Z,303,bh1,3

1.2 Staff Data (staff.csv)

Fields:

  • staff_id (string)
  • department (string)
  • current_workload (integer)
  • availability_status (string)
  • past_resolution_rate (float)

Examples:

staff_id,department,current_workload,availability_status,past_resolution_rate
S67890,plumber,5,Available,0.95
S67891,electricity,3,Available,0.92
S67892,internet,4,Unavailable,0.88
S67893,water cooler,2,Available,0.97
S67894,sweeper,6,Available,0.91
S67895,carpenter,1,Available,0.94

1.3 Historical Assignments (historical_assignments.csv)

Fields:

  • grievance_id (string)
  • staff_id (string)
  • assignment_timestamp (datetime)
  • resolution_time (string)
  • outcome (string)
  • floor_number (integer)

Examples:

grievance_id,staff_id,assignment_timestamp,resolution_time,outcome,floor_number
G12340,S67890,2023-09-25T09:00:00Z,2 hours,Resolved Successfully,1
G12341,S67891,2023-09-26T10:30:00Z,1 hour,Resolved Successfully,2
G12342,S67892,2023-09-27T14:15:00Z,3 hours,Unresolved,3
G12343,S67893,2023-09-28T11:45:00Z,30 minutes,Resolved Successfully,1
G12344,S67894,2023-09-29T08:30:00Z,1 hour,Resolved Successfully,2
G12345,S67895,2023-09-30T13:00:00Z,2 hours,Pending,3

1.4 Floor Metrics (floor_metrics.csv)

Fields:

  • hostel_name (string)
  • floor_number (integer)
  • date (date)
  • number_of_requests (integer)
  • total_delays (integer)

Examples:

hostel_name,floor_number,date,number_of_requests,total_delays
bh1,2,2023-10-01,20,2
bh2,1,2023-10-02,15,1
bh3,3,2023-10-03,18,3
ivh,1,2023-10-04,12,0
gh,2,2023-10-05,22,4

1.5 Availability Data (availability.csv)

Fields:

  • id (string)
  • type (string) # 'staff' or 'student'
  • date (date)
  • time_slot (string)
  • availability_status (string)

Examples:

id,type,date,time_slot,availability_status
S67890,staff,2023-10-02,09:00-12:00,Available
S67891,staff,2023-10-02,14:00-17:00,Unavailable
STU123,student,2023-10-02,10:00-12:00,Available
STU124,student,2023-10-02,14:00-16:00,Unavailable
S67892,staff,2023-10-03,08:00-11:00,Available
STU125,student,2023-10-03,09:00-11:00,Available

2. Advanced Sentiment and Emotional Intelligence Analysis

2.1 Grievance Sentiment Data (grievance_sentiment.csv)

Fields:

  • grievance_id (string)
  • text (string)
  • emotional_label (string)

Example:

grievance_id,text,emotional_label
G12347,"I am extremely frustrated with the constant power outages in my room.",Frustration
G12348,"Thank you for resolving my issue so quickly.",Satisfaction

3. Context-Aware Chatbots for Initial Grievance Handling

3.1 Conversation Dataset (conversations.csv)

Fields:

  • conversation_id (string)
  • user_message (string)
  • intent (string)
  • entities (string, JSON format)
  • bot_response (string)

Example:

conversation_id,user_message,intent,entities,bot_response
C12345,"I want to report a broken heater in my room.",submit_grievance,"{'grievance_type': 'Broken Heater', 'room_number': '305'}","I'm sorry to hear that. Could you please provide your room number?"

3.2 FAQs Dataset (faqs.csv)

Fields:

  • question_id (string)
  • question (string)
  • answer (string)

Example:

question_id,question,answer
Q12345,"How can I check the status of my grievance?","You can check the status by logging into your dashboard and navigating to the 'My Grievances' section."

4. Anomaly Detection in Grievance Patterns

4.1 Grievance Logs (grievance_logs.csv)

Fields:

  • grievance_id (string)
  • category (string)
  • timestamp (datetime)
  • description (string)

Example:

grievance_id,category,timestamp,description
G12350,Plumbing,2023-10-03T14:20:00Z,"Clogged drain in the kitchen area."

4.2 Anomaly Indicators (anomaly_indicators.csv)

Fields:

  • date (date)
  • category (string)
  • grievance_count (integer)
  • is_anomaly (boolean)

Example:

date,category,grievance_count,is_anomaly
2023-10-01,Electrical,50,false
2023-10-02,Electrical,150,true

5. Multilingual Translation in Chatroom

5.1 Translation Pairs (translation_pairs.csv)

Fields:

  • pair_id (string)
  • source_language (string)
  • target_language (string)
  • source_sentence (string)
  • target_sentence (string)

Example:

pair_id,source_language,target_language,source_sentence,target_sentence
TP12345,Hindi,English,"toilet me paani nahi aa rha hain","There is no water coming in the toilet."

5.2 Language Preferences (language_preferences.csv)

Fields:

  • user_id (string)
  • preferred_language (string)

Example:

user_id,preferred_language
U12345,Hindi
W67890,English

6. Worker Job Recommendation

6.1 Job Requests (job_requests.csv)

Fields:

  • job_id (string)
  • type (string)
  • description (string)
  • urgency_level (string)
  • submission_timestamp (datetime)
  • hostel_name (string)
  • floor_number (integer)
  • room_number (string)

Example:

job_id,type,description,urgency_level,submission_timestamp,hostel_name,floor_number,room_number
J12345,Electrical,"Fan not working in room 204.",High,2023-10-02T08:15:00Z,Hostel A,2,204

6.2 Worker Profiles (worker_profiles.csv)

Fields:

  • worker_id (string)
  • department (string)
  • current_workload (integer)
  • availability_status (string)
  • job_success_rate (float)

Example:

worker_id,department,current_workload,availability_status,job_success_rate
W67890,Electrical,3,Available,0.95

6.3 Historical Job Assignments (historical_job_assignments.csv)

Fields:

  • job_id (string)
  • worker_id (string)
  • assignment_timestamp (datetime)
  • resolution_time (string)
  • outcome (string)
  • hostel_name (string)
  • floor_number (integer)
  • room_number (string)

Example:

job_id,worker_id,assignment_timestamp,resolution_time,outcome,hostel_name,floor_number,room_number
J12340,W67885,2023-09-25T09:00:00Z,2 hours,Resolved Successfully,Hostel A,1,101

6.4 Worker Locations (worker_locations.csv)

Fields:

  • worker_id (string)
  • timestamp (datetime)
  • hostel_name (string)
  • floor_number (integer)
  • room_number (string)

Example:

worker_id,timestamp,hostel_name,floor_number,room_number
W67890,2023-10-02T08:00:00Z,Hostel A,2,210
data/
β”‚
β”œβ”€β”€ intelligent_routing/
β”‚   β”œβ”€β”€ grievances.csv
β”‚   β”œβ”€β”€ staff.csv
β”‚   β”œβ”€β”€ historical_assignments.csv
β”‚   β”œβ”€β”€ floor_metrics.csv
β”‚   └── availability.csv
β”‚
β”œβ”€β”€ sentiment_analysis/
β”‚   └── grievance_sentiment.csv
β”‚
β”œβ”€β”€ chatbot/
β”‚   β”œβ”€β”€ conversations.csv
β”‚   └── faqs.csv
β”‚
β”œβ”€β”€ anomaly_detection/
β”‚   β”œβ”€β”€ grievance_logs.csv
β”‚   └── anomaly_indicators.csv
β”‚
β”œβ”€β”€ multilingual_translation/
β”‚   β”œβ”€β”€ translation_pairs.csv
β”‚   └── language_preferences.csv
β”‚
└── job_recommendation/
    β”œβ”€β”€ job_requests.csv
    β”œβ”€β”€ worker_profiles.csv
    β”œβ”€β”€ historical_job_assignments.csv
    └── worker_locations.csv