Item detail

jd-opensource/JoyAI-VL-Interaction

JoyAI-VL-Interaction is JD.com's Apache-2.0 open-source release of an 8B real-time video-language interaction model: the model continuously watches a live video stream and only responds when the moment warrants it, instead of the turn-based question/answer pattern of every other open video-LLM. The repo ships the technical report (JoyAI-VL-Interaction-Reportv1.pdf, 5MB), the time-aligned interacti

Score7.7
Popularity70.0
Risknone
TierSilver
Score breakdown
Usefulness7.0
Novelty9.0
Momentum7.0
Maturity6.7
Open-source/build8.4
Evidence7.2
Workflow potential8.1
Setup ease4.2

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for ML researchers and video-AI teams who want to study or build on a real-time video-language model from a major Chinese tech company: clone the repo now, read the 5MB technical report for the architecture and training recipe, and prepare to clone the model weights the moment the open-source release lands on June 20.

Who should use it

ML researchers studying real-time or event-driven video-language modelsvideo-AI teams who want a public training recipe + technical report to ground their own designdevelopers planning to clone the model weights the moment the 2026-06-20 open-source release landsmultimodal-AI practitioners who need a presence-in-the-moment assistant pattern instead of turn-based Q&A

Who should skip it

Skip for now if you need a low-setup, non-technical tool today.

Risk explanation

No inherent user-impacting risk is flagged from the captured evidence.

Evidence links

Closest alternatives / related signals

video-languagereal-timemultimodal8b-modelopen-source-releasejd-comapache-2.0