Home Page - My Skillset
Executive Summary
I’m a full-stack developer and systems builder who connects data, automation, and practical AI to deliver
measurable outcomes. Since 2014, I’ve worked across e-commerce intelligence (Amazon/OA), high-volume web
scraping, Google Sheets/App Script control planes, serverless backends on Google Cloud, Chrome extensions,
and retrieval-augmented generation (RAG) flows using OpenAI, Gemini, or local (Ollama) models. My approach:
design reliable pipelines, preserve data quality (barcodes, prices, URLs), add observability, and keep the
UX fast and simple. I ship solutions that scale without drama—and are easy for teams to run daily.
Core Engineering Principles
- Reliability by design: resilient fetchers, retries/backoff, saved progress, resumable runs.
- Data hygiene first: preserve leading zeros, validate GTIN check-digits, normalize URLs/hosts.
- Small, composable services: REST/GraphQL APIs with clear contracts and structured logging.
- Security & secrets: least-privilege, token rotation, Secret Manager/KMS, CSP and SSRF awareness.
- Observability: Cloud Logging/Monitoring, error classes, latency budgets, actionable alerts.
- Human-in-the-loop: Sheets dashboards, color-coded states, concise summaries for quick review.
My Skills – API Integrations
- REST/GraphQL design with versioning, pagination, idempotency, and typed responses.
- Auth: OAuth 2.0, JWT, app-level tokens, key rotation on 429s with cooldown windows.
- Google Custom Search (CSE): query builders, date filters (e.g.,
dateRestrict), results cleansing, key/cx rotation.
- YouTube Data API: channel/video analytics, captions/comments processing, lead discovery.
- Shopify/Storefront/Admin: product, price, variants, inventory reads; webhooks and throttling.
- Stripe, Slack, Jira/Linear/ClickUp, Notion, Airtable, HubSpot: integrations for payments, alerts, and ops.
- OpenAI/Gemini/Ollama: embeddings, chat, function/tool calling for RAG and agents.
- Facebook/WhatsApp Business, Twilio: messaging flows with templates and opt-in compliance.
My Skills – Frontend
- HTML5/Accessibility: semantic structure, keyboard nav, ARIA patterns.
- CSS3: responsive layout, utility-first (Tailwind) or Bootstrap; component theming.
- JavaScript (ES6+): modular code, fetch/XHR, caching, client-side templating.
- WordPress: custom themes/plugins with performance and secure options pages.
- Perf: lazy images, async decoding, code-split, critical path CSS, CLS-safe UI.
My Skills – Backend
- Node.js (Express/Fastify) and Python (Flask/FastAPI) microservices.
- PHP for CMS glue and lightweight APIs when needed.
- AuthN/AuthZ: JWT/OAuth, signed URLs, HMAC checks, RBAC-ish guards.
- Resilience: connection pools, retries/jitter, circuit breakers, deadline/timeout discipline.
- File/CSV: streaming parsers, large merges, safe encodings (UTF-8), memory-bounded ops.
My Skills – Data & Storage
- Relational: Cloud SQL (Postgres/MySQL), migrations, indexes, EXPLAIN analysis.
- NoSQL: Firestore (document), Bigtable (wide-column) for hot KV or time-series access.
- Warehouse: BigQuery for aggregation/analytics, scheduled queries, cost controls.
- Caching: Memorystore (Redis/Memcached), cache keys, TTL strategies, stampede protection.
- Object storage: Cloud Storage for artifacts, signed downloads, lifecycle rules.
My Skills – Web Scraping & Automation
- Stacks: Python (requests/httpx, BeautifulSoup, lxml), concurrency (ThreadPool/multiprocessing).
- Anti-bot survival: user-agents, session reuse, proxy rotation, randomized pacing.
- Error handling: SSL quirks, 413/429/5xx backoff, resume-from-last progress markers.
- Extraction: JSON/JSON-LD parsing, robust CSS/XPath selectors, schema fallbacks.
- Outputs: CSVs with strict headers (
Barcode, price_extracted, source_url), direct Sheets writes.
My Skills – E-Commerce Intelligence (Amazon / OA)
- Matching: retailer product → Amazon listing via UPC/EAN/GTIN, title/variant heuristics, embedded JSON-LD.
- Signals: Keepa metrics (e.g., BSR thresholds), ROI%, profit margin, brand-listing risk, stock/price deltas.
- Trust checks: website HQ, payment methods, CC acceptance, brand vs third-party listing risk.
- Workflows: per-retailer collections scrape, page limits/time caps, batch zip outputs, auto-download.
- Sheets ops: status color-coding, dedupe, URL cleaning, barcode check digits, PH-timezone stamps.
My Skills – Google Apps Script & Spreadsheet Ops
- CSE pipelines: date filters, key/cx rotation with 24h cooldown, 429 logging, per-row progress colors.
- Data shaping: column mapping/renames, delete empty cols, preserve leading zeros, URL normalizers.
- Utilities: auto-increment barcodes, dedupe by domain/slug, dynamic column widths, bold/wrap headers.
- Integrations: Drive/Sheets read-write, on-edit/on-trigger jobs, structured error reporting.
My Skills – Google Colab & Python Notebooks
- Scrape/ETL notebooks: batch URL fetch, progress bars, rotating proxies, safe timeouts.
- CSV merging: typed joins (e.g.,
Barcode), large file handling, PH-timezone naming.
- Artifacts: zip bundles, auto-download links, Drive uploads; reproducible environment cells.
My Skills – AI, RAG & Lightweight Agents
- Embeddings/RAG: chunking CSV catalogs, hybrid search, prompt-safe summaries, factual grounding.
- Models: OpenAI, Gemini, and local (Ollama) when privacy/cost/latency requires.
- Agents: tool chaining (search → fetch → match → verify), failure surfaces, human-readable rationales.
- Evaluation: precision/recall sets for product matching and dedupe, regression dashboards.
My Skills – Chrome Extension Development
- MV3 service workers, content scripts, cross-tab messaging, storage sync/local.
- In-page extraction: parse dynamic product pages, emphasize JSON-LD and API calls.
- Secure backends: signed requests, throttled endpoints, granular permissions.
My Skills – Google Cloud Platform
I deploy practical, cost-aware architectures on GCP: Cloud Run for containerized HTTP services, Cloud Functions
for light triggers, Cloud Scheduler/Tasks for orchestration, Pub/Sub for decoupled events, Firestore/Cloud SQL
for app data, BigQuery for analytics, Bigtable/Redis for hot reads, Secret Manager/KMS for secrets/keys, and
Cloud Build/Artifact Registry for CI/CD. Logging/Monitoring/Error Reporting keep services observable and calm.
- Cloud Run, Cloud Functions, App Engine, Cloud Build, Artifact Registry
- Cloud Scheduler, Cloud Tasks, Pub/Sub, Eventarc, Workflows
- Cloud SQL, Firestore, BigQuery, Bigtable, Memorystore (Redis/Memcached)
- Cloud Storage, Cloud CDN, Cloud Load Balancing, Cloud DNS, VPC/Firewall
- Secret Manager, Cloud KMS, Cloud Logging/Monitoring/Error Reporting
- Vertex AI (embeddings/hosting when suitable)
Selected Projects (Snapshots)
- Shopify Collections Scraper: batch collections with time caps, concurrent workers with safe backoff, outputs normalized CSVs (Barcode, price_extracted, source_url), auto-zip and auto-download.
- Sheets-Driven CSE Discovery: key/cx rotation, one-week filter, per-row color states, and error classes for quick triage.
- Amazon Matching Assistant: UPC/variant/title matching with RAG-style evidence, highlighting brand vs third-party risk and estimated ROI/BSR signals.
- Chrome Extension (MV3): page augmentation to surface JSON-LD/structured data, one-click export to Sheet/API.
Roles & Collaboration
I act as a full-stack engineer, data plumber, and ops enabler. I document assumptions, keep handoffs crisp,
and make systems runnable by non-engineers through Sheets dashboards and clear logs. I’m timezone-aware and
comfortable coordinating across teams and tools (Slack/Jira/Linear/ClickUp).
- Languages: JavaScript/TypeScript, Python, PHP
- Frameworks: Express/Fastify, Flask/FastAPI, WordPress
- Data: Postgres/MySQL, Firestore, BigQuery, Bigtable, Redis
- Infra: Cloud Run, Cloud Functions, Scheduler/Tasks, Pub/Sub, Secret Manager, KMS
- Scraping: requests/httpx, BS4/lxml; proxy rotation; concurrency controls
- AI: OpenAI/Gemini, Ollama; embeddings, RAG, agents
- DX: Cloud Build, Artifact Registry, lint/test, structured logs, error reporting
What You Can Expect
- Clear problem framing, minimal viable pipeline, and measured iterations.
- Strong data guarantees: consistent headers, preserved barcodes, normalized URLs.
- Operational clarity: dashboards, logs, and lightweight runbooks.
- Security hygiene: secrets isolation, least-privilege credentials, and regular key rotation.
- Scalable results: from single-run notebooks to scheduled, observable services.
My Skills – API Integrations (Legacy Summary)
- RESTful APIs: Designing and consuming RESTful APIs for seamless data exchange between applications.
- GraphQL: Implementing and querying data using GraphQL, focusing on efficient data fetching and reduced over-fetching.
- API Authentication and Authorization: Implementing secure authentication mechanisms (e.g., OAuth 2.0, JWT) and managing authorization policies.
- Data Serialization and Deserialization: Working with JSON and XML for API communication.
- API Testing and Documentation: Postman/Swagger UI and contract documentation.
My Skills – Frontend (Legacy Summary)
- HTML5: For structuring web content.
- CSS3: Responsive design with Bootstrap/Tailwind.
- JavaScript: ES6+ features and dynamic UI.
- WordPress: Theme and plugin development.
- Bootstrap: Rapid prototyping with components.
My Skills – Backend (Legacy Summary)
- Node.js: Scalable server-side apps.
- Apps Script: Google Workspace automations.
- Python: Django/Flask/FastAPI backends.
- Automation and AI: Practical agents and pipelines.
- API Development: REST and GraphQL.