Automating API Information Storage with AWS - Technical Deep Dive into Automated API Information Storage System
Managing API information is a big challenge, demanding streamlined solutions for efficiency and reliability. My introduction to automating API informati...

Todd Bernson
2024-06-26

Managing API information is a big challenge, demanding streamlined solutions for efficiency and reliability. My introduction to automating API information storage showed the need for a sophisticated, automated approach to manage the deluge of API data.

Leveraging AWS's robustness, Docker's agility, and Terraform's precision, I proposed a system to simplify API documentation and discovery.
Here is the repo.
The Role of api_info.json
The automated system lies the api_info.json file, a standardized document that is pivotal in harmonizing API information across repositories. This JSON structure is designed to capture standard details of each API, including its description, version, lifecycle state, and security measures, ensuring a comprehensive database. For instance, the API_version field tracks versioning, enabling developers to access historical and current data seamlessly while API_Security outlines authentication and encryption methods, bolstering security protocols.
{
"Custody Activity Monitoring": {
"API_description": "Enhance operational efficiencies with frequent access to data. Monitor financial transactions and account activity with real-time updates.",
"API_version": "vX",
"API_Provider_APM_number": "APMXXXXXX",
"API_Consumers_APM_numbers": [
"APMXXXXX1",
"APMXXXXX2",
"APMXXXXX3"
],
"Lifecycle_State": "Production",
"Environment": "Production",
"Visibility": "Public",
"Intended_Consumers": "External",
"API_Design": "REST",
"Request_data_format": "JSON",
"Response_data_format": [
"JSON",
"XML"
],
"API_Owner_LAN_ID": "OwnerIDXXX",
"Backend_URL": "https://genericdomain.com/api/version/activity",
"Frontend_URL": "https://apigateway.domain.com/api/version/activity",
"API_Operations": [
"GET retrieve activity",
"POST add activity",
"PUT update activity"
],
"Dependency_APIs_or_Services": [
"APMXXXX100",
"APMXXXX200",
"APMXXXX300"
],
"API_Gateway_Location": "Location",
"API_Gateway_instance_info": "API Gateway Instance Details",
"API_Security": {
"Authentication": "Authentication Method",
"Authorization": "Authorization Method",
"Encryption": [
"Encryption Method 1",
"Encryption Method 2"
]
}
}
}
Setting up the Infrastructure with Terraform
Transitioning to infrastructure, Terraform is the cornerstone of deploying our AWS resources, including the DynamoDB table. The benefits of using Terraform extend beyond automation; it provides a repeatable, error-free deployment process, ensuring consistency across environments. Since this was a PoC, there is very little Terraform:
module "dynamo" {
source = "terraform-aws-modules/dynamodb-table/aws"
name = var.environment
server_side_encryption_enabled = false
deletion_protection_enabled = false
hash_key = "API_NAME"
table_class = "STANDARD"
ttl_enabled = false
attributes = [{
name = "API_NAME"
type = "S"
}]
tags = var.tags
}
Dockerizing the Python Application
Dockerization encapsulates the environment required to run our Python automation scripts, guaranteeing consistency and reliability. This segment explores the Dockerfile, detailing the process of building a Docker image that houses our Python application. The advantages of Docker are manifold, offering an isolated environment that mirrors production settings, thereby enhancing testing accuracy and deployment efficiency. In production, this would likely be run off a webhook from the git repository or in the CI/CD pipeline.
FROM python:3.12
WORKDIR /usr/src/app
COPY ./app /usr/src/app/
COPY api_info.json /usr/src/app
RUN pip install --no-cache-dir -r requirements.txt
ARG TABLE_NAME
ENV TABLE_NAME ${TABLE_NAME}
CMD ["python", "./app.py"]
Querying and Using the API Information
A significant advantage of our system is the ease with which developers can query the DynamoDB table for API information. This part showcases practical examples of how developers can leverage this data in application development and API discovery, facilitating a more efficient and informed development process.
For the PoC, I'm using AWS CLI. However, this could sit behind a static frontend, API gateway, and lambda to gain the proper information.
I've shown the integral role each component plays in the overarching system. From the standardization brought about by api_info.json, to the deployment efficiency of Terraform, and Docker and the seamless integration with CI/CD pipelines, each element contributes to a streamlined, efficient management of API documentation. As I refine and enhance this system, I invite feedback and suggestions from the developer community, fostering an environment of continuous improvement and innovation in API management.
Visit my website here.
Read More
View all posts
AI/ML
Why Enterprise AI Must Be Application-Led, Not Agent-Led
A deep dive by Todd Bernson, CTO and Chief AI Officer, on why enterprise AI systems should be architected as application-led, deterministic platforms with embedded agentic AI—not fully autonomous agents. This article explains how API-first, governed, multi-channel architectures deliver higher reliability, compliance, scalability, and business value in real-world Fortune-500 environments.

Todd Bernson
2025-12-02

AI/ML
Application-First Agentic AI
Application-first agentic AI is emerging as the only reliable path to real enterprise ROI. In this in-depth analysis, Todd Bernson, CTO & CAIO, breaks down why most generative AI initiatives stall in production—and how disciplined enterprise architecture, deterministic workflows, and narrowly scoped AI agents can finally unlock repeatable business value. Using a real sprint-intelligence system as a case study, the article shows how organizations can combine serverless engineering, structured orchestration, and constrained LLM reasoning to reduce reporting effort, increase trust, eliminate hallucinations, and deliver actionable insights across engineering, operations, compliance, and customer experience.

Todd Bernson
2025-11-28
AI/ML
Why 95% of AI Projects Fail and How to Be Among the 5% That Succeed

Lee Hylton
2025-08-22