The datasets contain data on the use of the virtual telephony service CallMeMaybe. Its clients are organizations that need to distribute large numbers of incoming calls among various operators, or make outgoing calls through their operators. Operators can also make internal calls to communicate with one another. These calls go through CallMeMaybe's network.
Give supervisors information on the least effective operators.
Identify operators that have:
- A large number of missed incoming calls (internal and external).
- A long waiting time for incoming calls.
- A small number of attempts to make a call.
- Data preprocessing
- EDA
- Define effective & ineffective operators
- Give a rank for each operator based on something like RFM analysis:
- Number of missed incoming calls
- Waiting time for incoming calls
- Number of outgoing calls
- Give a rank for each operator based on something like RFM analysis:
- Test statistical hypotheses
The dataset telecom_dataset_us.csv
contains the following columns:
user_id
— client account IDdate
— date the statistics were retrieveddirection
— call direction (out
for outgoing,in
for incoming)internal
— whether the call was internal (between a client's operators)operator_id
— operator identifieris_missed_call
— whether the call was missedcalls_count
— number of callscall_duration
— call duration (excluding waiting time)total_call_duration
— call duration (including waiting time)
The dataset telecom_clients_us.csv
has the following columns:
user_id
tariff_plan
— client's current plandate_start
— client's registration date