Django Strategies for processing large datasets from an external API without a local replica table [closed]

I am building an SMS gateway system in Django that acts as an interface between university data sources (e.g., student information systems) and an SMS provider.

The Architecture: To ensure data freshness and avoid redundancy, we decided not to maintain a local Recipient table (mirroring the external database). Instead, we use a Stateless/Proxy architecture:

User selects filters (e.g., "Faculty: Engineering") in the frontend.
Backend fetches the student list in real-time from the external API.
We iterate through the results and create a "Snapshot" log (SmsDispatch model) for history/reporting.
We send the SMS.

The Problem: The external API might return large datasets (e.g., 20,000+ students). I am concerned about:

Memory (OOM): Loading 20k objects into a Python list before saving to the DB.
Timeouts: The external API being slow, causing the HTTP request or the Celery task to hang.

The AI suggested Approach: plan to use Celery with Python Generators and Django's bulk_create to stream data and write in batches.

My Question is:

Is the chose of not saving the data local a good way?
Is this "Generator + Batch Insert" pattern the standard way to handle this in Django/Celery to prevent OOM errors?
How should I handle Partial Failures? If the external API fails on page 5 of 10, how do I resume without duplicating the SMS for the first 4 pages?
Is the decision to NOT save recipients locally a viable strategy for this scale (20k+ users), or is the reliance on an external API for every campaign considered an anti-pattern in production SMS systems?

Вернуться на верх

Последние вопросы и ответы

unable to make changes on DB or admin after crud operation./formset

Django Strategies for processing large datasets from an external API without a local replica table [closed]

inject an image with the other info in the response body in djangorestframework

How do I make reusable HTML elements in a Django project?

Django channels tutorial issue

ajax request in django

Python Django Rest Framework

Django Summernote: How to delete attachment files from server when removing them from admin?

MySQL DB Optimization

Django Strategies for processing large datasets from an external API without a local replica table [closed]

Последние вопросы и ответы

Рекомендуемые записи по теме