Publications

Research

Hill Research publishes at top-tier venues in AI, machine learning, and medical informatics. Our research is the foundation that TriClick is built on — spanning LLM reasoning, AI agents, retrieval-augmented generation, knowledge graphs, and translational biology.

9 papers at premier venues in 2025–2026 including ACL, AAAI, SIGMETRICS, MLSys, and JAMIA.

JAMIA Open Published

Real-Time Clinical Analytics at Scale: A Platform Built on Large Language Models-Powered Knowledge Graphs

Hill Research · 2026

Describes ClinicalMind, the knowledge graph layer underneath TriClick. Initializes from 300 curated authoritative sources, reducing LLM invocation costs by 70%. Processes 110,000 clinical documents and 60,000 EMRs with 1.5M core concepts and 3M relationships.

110,000 clinical documents + 60,000 EMRs processed
1.5M core concepts, 3M primary relationships
Average query delay: 1.7 seconds
BLEU: 0.85, ROUGE: 0.92

Read Paper News Post

Knowledge GraphsClinical NLPTriClick

ACL 2026 Accepted

From Trajectories to Graphs: Contract-Checked Editing for Verifier-Guided LLM Reasoning

Dr. Jack Li et al. · 2026

Introduces contract-checked graph editing for LLM reasoning. Represents LLM outputs as typed reasoning graphs and runs a deterministic structural gate before the expensive verifier, filtering structurally invalid candidates immediately.

Verifier-runnable recombination: 41.2% → 92.8%
Accuracy: +6.1 on MATH, +9.1 on MATH Level 5
42% fewer verifier calls

News Post

LLM ReasoningVerificationGraph Methods

ACL 2026 Accepted

HyperWorld: Hybrid World Models for Grounded Language Agents

Dr. Jack Li et al. · 2026

A hybrid world model combining SSM-based dynamics with entity-centric episodic memory and critic-guided rollout planning. Enables AI agents to simulate the impact of their decisions before committing to action.

Outperformed GPT-4+ReAct by 11-14 points on ALFWorld, WebShop, SciWorld
53% reduction in constraint violations

News Post

AI AgentsWorld ModelsPlanning

SIGMETRICS 2026 Accepted

EviDex: Provenance-Weighted Evidence-Path Indexing for Fresh and Auditable Retrieval under Continuous Updates

Dr. Jack Li · 2026

Replaces periodic RAG refresh with log-structured online compaction over intent-partitioned evidence-path buckets. Every retrieved path carries its provenance for regulatory audit trails.

Evidence-set violation at 15 min: 1.3% (vs 2.4% baseline)
Cost: $0.68 per 1k queries — 42% cheaper than adaptive TTL
10M docs / 16 nodes: 1,856 queries/sec, p99 latency 2.14s
Clinical correctness: 0.884 on 800-question physician-rated test

News Post

RAGInformation RetrievalClinical Evidence

MLSys 2026 Accepted

Ontology-Guided Long-Term Memory for Conversational RAG

Dr. Jack Li · 2026

Extracts durable user facts into a lightweight ontology memory graph and routes between graph-first and dense-first retrieval with a budget-aware learnable router. Solves the problem of dense retrieval failing in long multi-session conversations.

Recall@10: 0.70 (vs 0.58 for dense-only)
47% reduction in cross-modality disagreement
81% cost reduction vs long-context methods

News Post

RAGMemory SystemsConversational AI

OpenReview Accepted

Med-ICE: Multi-Agent Consensus Framework for Trustworthy Medical AI

Zhiyuan Chen et al. · 2026

A multi-agent LLM framework for high-stakes medical tasks. Multiple agents generate diverse reasoning chains, a semantic consensus module aligns reasoning patterns, and iterative refinement continues until convergence — like a panel of specialists debating a diagnosis.

5-8% improvement in factual accuracy
Fewer unsupported claims across agents
Transparent disagreement surfacing for clinicians

Read Paper News Post

Medical AIMulti-AgentTrustworthiness

AAAI 2026 / BioRxiv Presented

CSLAN: Cross-Species Latent Alignment Network

Dr. Rui Wu et al. · 2026

A transfer learning framework that bridges mouse and human single-cell datasets. Uses species-invariant genomic features to identify human trauma-related immune cells with minimal human samples.

96.67% accuracy with only 240 human samples
Bridges cross-species data scarcity
Scalable to disease modeling and drug development

Read Paper News Post

Single-CellTransfer LearningTranslational Biology

AAAI 2026 Presented

Dynamic Consistency Index for Gene Expression Dynamics

Dr. Rui Wu · 2026

A new metric for measuring how gene expression evolves across immune cell types over time. Enables modeling of gene expression dynamics with high accuracy while estimating uncertainty.

High-accuracy temporal gene expression modeling
Built-in uncertainty estimation
Supports robust biological insights

News Post

Gene ExpressionBioinformaticsUncertainty

AAAI 2026 Presented

Consensus-Based Framework for Reducing LLM Hallucinations in Clinical AI

Zhiyuan Chen · 2026

Models from different families challenge each other in a consensus-based framework, improving accuracy and advancing toward transparent, audit-ready clinical AI systems.

Improved accuracy over single-judge models
Cross-family model consensus validation
Designed for audit-ready clinical settings

News Post

LLMHallucinationClinical AI

Interested in Our Research?

We're always looking for collaborators in clinical AI, NLP, and biostatistics. Get in touch.