Blog
2 days ago
TalkNet-ASD: The “Who’s Talking?” Model for Any Video
TalkNet-ASD is an audio-visual active speaker detection model that labels speaking faces and outputs JSON speaker tracks for real-world footage.
Source: HackerNoon →TalkNet-ASD is an audio-visual active speaker detection model that labels speaking faces and outputs JSON speaker tracks for real-world footage.
Source: HackerNoon →