BSC Analytics

Deploying the Churn Prediction Model in Production

By Todd Bernson
Chief Technical Officer BSC Analytics

20 Nov 2024

Overview

Deploying a churn prediction model into production is the final step in leveraging machine learning insights to drive business decisions. BigQuery ML simplifies this process by enabling both batch and real-time predictions using SQL queries. This article outlines the deployment steps, compares batch and real-time prediction approaches, and discusses cost management strategies to ensure efficient operations.

Batch Predictions vs. Real-Time Predictions

Batch Predictions

Batch predictions are suitable for periodic updates where predictions don’t need to be generated instantly. This approach is ideal for churn prediction in scenarios like:

Weekly or monthly churn reports.
Customer retention campaigns.

Advantages:

Lower costs as predictions are run less frequently.
Simpler implementation, as they rely on scheduled queries.

Disadvantages:

Not suitable for use cases requiring immediate insights.

Real-Time Predictions

Real-time predictions are triggered for specific events, such as when a customer interacts with a service. This approach is ideal for:

Dynamic churn prediction during customer interactions.
Triggering personalized offers or interventions.

Advantages:

Instant insights allow for immediate action.
Enhances customer experience with timely responses.

Disadvantages:

Higher costs due to constant computation.
Requires more complex infrastructure, such as event-driven architectures.

Prediction Queries in BigQuery

Batch Prediction Query

The following query predicts churn for all customers and writes the results to a BigQuery table for downstream analysis:

CREATE OR REPLACE TABLE `<DATASET_NAME>.predictions` AS
SELECT
  customerID,
  predicted_label AS churn_prediction,
  predicted_probability AS churn_probability,
  *
FROM
  ML.PREDICT(MODEL `<DATASET_NAME>.<MODEL_NAME>`, 
             TABLE `<DATASET_NAME>.<INPUT_TABLE_NAME>`);

Explanation:

ML.PREDICT: Generates predictions using the specified model.
predicted_label: Indicates whether a customer is predicted to churn.
predicted_probability: Provides the confidence score for the prediction.

Real-Time Prediction Query

For real-time predictions, a single customer’s data can be used as input:

SELECT
  customerID,
  predicted_label AS churn_prediction,
  predicted_probability AS churn_probability
FROM
  ML.PREDICT(MODEL `<DATASET_NAME>.<MODEL_NAME>`, 
             (SELECT * FROM `<DATASET_NAME>.<INPUT_TABLE_NAME>` WHERE customerID = 'CUST123'));

Explanation:

Filters the input table to focus on a specific customer (CUST123).
Returns immediate churn predictions for personalized actions.

Integration with Cloud Functions

To enable real-time predictions, you can use Cloud Functions and Pub/Sub to automate the process. For example, trigger a Cloud Function whenever a customer event is logged.

Cloud Function Code Example

import os
from google.cloud import bigquery

def predict_churn(event, context):
    client = bigquery.Client()
    dataset_name = os.environ.get("DATASET_NAME")
    model_name = os.environ.get("MODEL_NAME")
    customer_id = event['attributes']['customerID']

    query = f"""
    SELECT
        predicted_label AS churn_prediction,
        predicted_probability AS churn_probability
    FROM
        ML.PREDICT(MODEL `{dataset_name}.{model_name}`, 
                   (SELECT * FROM `{dataset_name}.customers` WHERE customerID = '{customer_id}'))
    """
    
    results = client.query(query).result()
    for row in results:
        print(f"CustomerID: {customer_id}, Churn Prediction: {row.churn_prediction}, Probability: {row.churn_probability}")

Explanation:

Event-Driven Architecture: The function is triggered by a Pub/Sub event containing the customerID.
BigQuery Integration: Executes a query to predict churn for the specified customer.

Cost Management

Running predictions at scale can incur significant costs. Here are some strategies to optimize spending:

Batch Predictions: Schedule predictions during non-peak hours to leverage GCP’s cost-saving features.
Selective Real-Time Predictions: Limit real-time predictions to high-value customers or critical interactions.
BigQuery Slot Reservations: For frequent predictions, consider purchasing BigQuery slots to reduce on-demand query costs.

Code Snippets for Batch and Real-Time Predictions

Batch Prediction Scheduling with Cloud Scheduler

gcloud scheduler jobs create query churn-batch-predictions \
--schedule="0 2 * * *" \
--statement="CREATE OR REPLACE TABLE `<DATASET_NAME>.predictions` AS SELECT * FROM ML.PREDICT(MODEL `<DATASET_NAME>.<MODEL_NAME>`, TABLE `<DATASET_NAME>.<INPUT_TABLE_NAME>`);" \
--time-zone="UTC" \
--bigquery-use-default-project

Explanation:

Runs the batch prediction query nightly at 2 AM UTC.
Stores results in the predictions table for analysis.

Deploying a churn prediction model involves careful consideration of use cases, cost management, and infrastructure complexity. By leveraging BigQuery ML’s integration with Cloud Functions and Pub/Sub, you can implement both batch and real-time predictions tailored to business needs.

Google Cloud, BigQuery, Data Analytics, Machine Learning

Deploying the Churn Prediction Model in Production

Overview

Batch Predictions vs. Real-Time Predictions

Batch Predictions

Real-Time Predictions

Prediction Queries in BigQuery

Batch Prediction Query

Real-Time Prediction Query

Integration with Cloud Functions

Cloud Function Code Example

Cost Management

Code Snippets for Batch and Real-Time Predictions

Batch Prediction Scheduling with Cloud Scheduler

Related Posts

Related Articles

Inter-Region WireGuard VPN in AWS

Making PDFs Searchable Using AWS Textract and CloudSearch

Slack AI Bot with AWS Bedrock Part 2

Contact Us