§ projects

Open-source systems & research artifacts.

A selection of open-source systems and research artifacts by Ademar Tutor: GLAMLI (Master's dissertation), the WEKA API Server powering its tool layer, LexiTau and LexExtract (agentic Text-to-SQL over financial documents), an LLM-as-a-Judge evaluator, and SafeBeautyLedger. Most started as research questions and ended up as code — from papers, from production problems, or from a weekend's curiosity. Each links to a write-up, the repo, or both.

indexjump to

GLAMLI (Dissertation) · in progress
WEKA API Server · REST layer over WEKA
LexiTau · agentic Text-to-SQL
LLM-as-a-Judge Evaluator · evaluation methodology
LexExtract · OCR → Text-to-SQL
SafeBeautyLedger · blockchain traceability

§ 01research & OSS

GLAMLI (Dissertation)
2025– · in progress

Guided LLM-Augmented Machine Learning Interface — an agent that walks non-experts through a real ML workflow (load → profile → train → evaluate → deploy).
- Planner-with-revision orchestration topology
- Typed tool registry (data primitives) the agents call into
- Sandboxed code execution for generated Python
- Structured observability traces for trust calibration and failure analysis
repo link when public
WEKA API Server
2026 · rest layer over weka

A small REST API wrapping the open-source WEKA machine learning library — dataset management, classifier training, prediction, evaluation, EDA, leakage-safe preprocessing, and post-training diagnostics, all over plain JSON. Built as the tool layer GLAMLI's agents call into.
- Allowlisted classnames (weka.classifiers.* / weka.filters.*) and input validation around the WEKA classpath
- Diagnostics endpoints: ROC, margin, cost curves, calibration plots, per-instance error visualisation
- Filesystem persistence for datasets and serialized models via Docker bind mounts
- Stack: Java 17 · Javalin · WEKA · Jackson · Maven · Docker Compose
docs github · iamademar/weka-api
LexiTau
2025 · agentic text-to-sql

Agentic Text-to-SQL pipeline for financial document querying. End-to-end: profiling → metadata → schema linking → SQL draft → safety checks → execution → narrative explanation.
- RAG-based schema context retrieval (Jaccard over metadata, cosine on queries)
- Multi-tenant isolation: per-tenant embeddings, scoped auth, row-level policies
- OCR ingestion (Azure Form Recognizer) normalised into tenant-scoped tables
- Stack: FastAPI · PostgreSQL+pgvector · SQLAlchemy · Celery/Redis · Next.js · Docker Compose
Multi-tenant RAG with real isolation is rare in public artifacts.

github · iamademar/LexiTau
LLM-as-a-Judge Evaluator
2025 · evaluation methodology

Statistical-rigor evaluation framework for LLM-judged outputs. Seven SPIN-based scoring dimensions, evaluated with QWK, Pearson r, and ±1 accuracy — three metrics that disagree about which judge is best, which is the point.
- Pydantic schema enforcement; prompt-level contracts for deterministic, auditable outputs
- Tenant-scoped prompt versioning with audit trails (Postgres + SQLAlchemy)
- Score-once / evaluate-many workflow that cuts LLM cost without losing fidelity
- Multi-LLM provider integration (OpenAI · Anthropic · Google)
- Stack: FastAPI · PostgreSQL · SQLAlchemy · LangChain · Docker · Azure Container Apps
github write-up: pipeline evaluation prompts deploy
LexExtract
2024 · ocr → text-to-sql

OCR-to-Text-to-SQL platform for financial document querying. Natural-language queries over PDF-extracted financial data.
- Full pipeline: PaddleOCR → LLM-based Text-to-SQL with memory-efficient processing and progressive fallback
- Explainable query outputs through structured SQL generation and validation layers
- Stack: FastAPI · PostgreSQL · PaddleOCR · Mistral 7B · Docker
github · iamademar/LexExtract
SafeBeautyLedger
2024 · blockchain · gdip

Blockchain traceability for beauty-industry product provenance. Immutable history, public verification, on-chain product registry.
- Solidity smart contracts (Hardhat) with versioned product records, event-driven history retrieval
- On-chain / off-chain data flow integration
- Stack: Solidity (Hardhat) · Node.js + TypeScript · PostgreSQL · Ethers.js · Next.js
Outside the multi-agent work, but a useful counterpoint — different problem class, same engineering discipline.

github · iamademar/safebeautyledger

2026– · in progress

GLAMLI/h3>
Guided LLM-Augmented Machine Learning Interface — an agent that walks non-experts through a real ML workflow (load → profile → train → evaluate → deploy). Planner-with-revision over a typed tool registry, sandboxed execution, full prompt/tool-call traces
2026 · rest layer over weka
WEKA API Server

REST API wrapping the open-source WEKA ML library — dataset management, training, prediction, EDA, leakage-safe preprocessing, and diagnostics over plain JSON. The tool layer GLAMLI's agents call into.
2025 · agentic text-to-sql
LexiTau

Agentic Text-to-SQL for financial documents. RAG schema retrieval, multi-tenant isolation, OCR ingestion.
2025 · evaluation
LLM-as-a-Judge Evaluator

Statistical-rigor evaluation: QWK, Pearson r, ±1 accuracy across seven SPIN-based dimensions.
2024 · ocr
LexExtract

PaddleOCR → LLM-based Text-to-SQL with memory-efficient processing and progressive fallback.
2024 · blockchain
SafeBeautyLedger

On-chain product registry with versioned records and event-driven history retrieval.

§ 02get in touch

Open to conversations about applied multi-agent research.

hey@ademartutor.com

GLAMLI (Dissertation) →

WEKA API Server →

LexiTau →

LLM-as-a-Judge Evaluator →

LexExtract →

SafeBeautyLedger →

Open to conversations about applied multi-agent research.

GLAMLI (Dissertation)

WEKA API Server

LexiTau

LLM-as-a-Judge Evaluator

LexExtract

SafeBeautyLedger