What an analysis looks like

From your recording we produce an annotated video plus charts so you can trace eye contact, body posture, speech rate and more.

What is VideoAnalytics?

VideoAnalytics is a research tool developed by the Centre for University Teaching at the University of Bayreuth. It enables lecturers and students to have video recordings of presentations or simulated exam dialogues analysed automatically.

The analysis includes body language, gestures, eye contact, speech behaviour and optionally AI-based emotion recognition. The goal is to receive constructive feedback for improving your own presentation skills.

Record directly in the browser

Record your webcam and optionally your screen, then upload for analysis.

No software installation required. The compatibility check verifies in advance whether camera, microphone and browser audio are working.

Start recording
Upload video file

Submit an existing recording (MP4, WebM, MOV) for analysis.

Upload an existing video file. The analysis starts automatically and you will receive your feedback by email.

Upload video

How the analysis works

Recording

Record your presentation via webcam or upload an existing video. Optionally, you can also record your screen at the same time.

Analysis

Automatic evaluation of posture, gestures, gaze direction, speech pauses, volume and optionally facial expressions/emotions. Processing takes a few minutes.

Feedback

You will receive an email with a link to your results including interactive charts, a transcribed text version and optionally an AI coaching report.

What your feedback looks like

After the analysis you receive an interactive feedback dashboard. Here are examples of the different analysis sections:

Annotated Video
Annotated Video

Skeleton overlay shows posture, gestures and gaze direction in real time

Timeline Charts
Timeline Charts

Interactive charts with zoom, video cursor and calibration

Rhetoric Analysis
Rhetoric Analysis

Evidence-based assessment following scientific standards

AI Rhetoric Check
AI Rhetoric Check

Colour-coded improvement suggestions directly in the transcript

Analysis Results
Analysis Results

Detailed metrics with benchmark comparison

AI Coaching
AI Coaching

Personalised coaching feedback from Claude (Anthropic)

Confidence Analysis (multimodal)

Combined assessment from voice, gaze, speech, gesture and facial expression — with moment highlights for confident passages and improvement areas

Commented Video

Video with text overlays at key moments — shows when confidence rises or drops

Eye Contact Calibration

Label 10 diverse frames so the software accurately detects your individual gaze direction

Video Trimming at Upload

Set start and end time so only the relevant section is analysed — no distorted data from walking in

User Feedback

Per analysis section: thumbs up/down + comments. Your feedback helps us improve the software

More Impressions

Gesture activity (left/right hand)
Gesture activity (left/right hand)
Eye contact percentage over time
Eye contact percentage over time
Voice analysis (pitch, loudness, jitter, shimmer, HNR)
Voice analysis (pitch, loudness, jitter, shimmer, HNR)
Speech tempo (words per minute)
Speech tempo (words per minute)
Facial expression (smile, surprise, etc.)
Facial expression (smile, surprise, etc.)
Emotion analysis (happiness, neutral, sadness, anger)
Emotion analysis (happiness, neutral, sadness, anger)
Data deletion / privacy controls
Data deletion / privacy controls

Emotion Analysis

The optional emotion analysis uses a neural network (HSEmotion) to detect emotional states such as joy, surprise, concentration or tension from facial expressions. This data is presented exclusively in statistically aggregated form and serves for self-reflection.

Important: Emotion recognition is an approximation method. It does not capture inner emotional states but interprets visible facial expressions. The results should be understood as guidance, not as a psychological diagnosis.

Legal classification (EU/GDPR)

Regulation (EU) 2024/1689 (AI Act) classifies emotion recognition systems in the workplace and educational institutions as particularly sensitive (Art. 5 para. 1 lit. f). Their use is only permitted under strict conditions.

The following safeguards apply within this project:

  • Voluntary: Emotion analysis is disabled by default and must be consciously activated by the user. There is no obligation to use it.
  • Consent: By actively ticking the option, you give your informed consent pursuant to Art. 6 para. 1 lit. a GDPR.
  • No biometric identification: No identification or categorisation of persons takes place. The analysis exclusively evaluates facial expressions within a single video.
  • Purpose limitation: The data is used exclusively for individual feedback on presentation skills.
  • Transparency: All analysis results are accessible to you. No automated decision-making takes place.

Data Protection and Voluntariness

Completely voluntary

The use of this service is entirely voluntary. There is no obligation to record or upload videos for analysis. All optional features (emotion analysis, AI coaching) must be actively enabled.

Automatic deletion

All uploaded videos and analysis results are automatically deleted after 14 days. After that, neither the video nor the analysis data will be accessible. Early deletion can be requested at any time via the admin panel.

Processing and storage

Processing takes place on a dedicated server at the University of Bayreuth. No data is shared with third parties. The AI coaching feature (optional) uses the Claude API from Anthropic; only the transcribed text is transmitted, no video recordings.

No tracking

This website does not use cookies, tracking or external analytics services. No personal data is collected beyond the email address required for the upload.

Technologies Used

VideoAnalytics combines specialised AI models and open-source libraries for comprehensive analysis:

Category Technology Purpose
Body Pose & Gesture MediaPipe (Google) Body pose (skeleton landmarks), face mesh (468 points), iris tracking
Speech Recognition faster-whisper (Systran) Speech-to-text (transcript), word-based filler detection
Emotion Recognition HSEmotion (HSE) Facial emotion classification (joy, neutral, sadness, anger, etc.)
Gaze Direction L2CS-Net (ResNet-34) Gaze angle estimation (yaw = horizontal rotation, pitch = vertical tilt) from the face crop
Facial Expression (Action Units) py-feat (Cosanlab) Action Units = smallest visible facial-muscle movements defined by the Facial Action Coding System (FACS); py-feat detects them automatically
Voice Analysis openSMILE (audEERING) Acoustic features following the eGeMAPS standard (extended Geneva Minimalistic Acoustic Parameter Set): pitch, loudness, jitter = pitch fluctuation, shimmer = loudness fluctuation, HNR = Harmonics-to-Noise Ratio (ratio of harmonic content to noise)
Voice Quality Parselmouth/Praat Voice quality metrics based on Praat (the de-facto standard tool of phonetics research)
Filler Words (audio-based) Eigenes Verfahren (ZHL UBT, auf eGeMAPS-Basis) Custom method: detects “uh” / “um” from acoustic features (eGeMAPS) instead of from the transcript — more reliable than text-only detection
Emphasis & Three-Channel Coherence Eigenes Verfahren (ZHL UBT) Custom method: measures whether vocal emphasis, gesture and pause align while speaking (three-channel coherence)
Orientation Toward Presentation Eigenes Verfahren (ZHL UBT) Custom method: infers from hand, gaze and body direction when the speaker is facing the audience vs. the presentation (slides/board)
AI Coaching Claude (Anthropic) Speech quality analysis, personalised coaching — under a Zero-Data-Retention agreement: content is not stored and not used for AI training
Video Processing OpenCV + FFmpeg Per-frame processing, video encoding
Charts Chart.js Interactive timeline visualisation
GPU Acceleration (graphics-card computing) NVIDIA CUDA 12.1 AI-inference acceleration on NVIDIA graphics cards
Framework FastAPI + Python Backend interface (API = Application Programming Interface)

Legal Notice

Organisation

University of Bayreuth

Centre for University Teaching (ZHL)

Universitätsstraße 30

95447 Bayreuth

Contact

paul.doelle@uni-bayreuth.de

www.zhl.uni-bayreuth.de

Responsible for content

Centre for University Teaching (ZHL)
University of Bayreuth

Legal form

The University of Bayreuth is a public corporation. It is legally represented by the President.

Supervisory authority

Bavarian State Ministry of Science and the Arts