Alljoined - A Dataset For EEG-to-Image Decoding
Alljoined - A Dataset For EEG-to-Image Decoding
Alljoined - A Dataset For EEG-to-Image Decoding
8 Blocks repeated x2
+ 24 oddballs
= 264 images
Trial 1 0-50ms
Trial 2 Trial 264
jitter
Figure 2. Schematic overview of the structure of trials, blocks, and sessions. Each of the 120 block-specific NSD images is presented
twice within each block, and each of the 8 session-specific blocks is presented twice within each session. Each participant performed two
sessions on different days. Each of the 10 sessions thus consists of 960 NSD images repeated four times within and across blocks, totaling
9600 unique NSD images per participant.
Figure 4. EEG topographic maps and corresponding signals at all 64 electrodes averaged over a) 3823 events for the fifth participant (left)
and b) across all sessions for all participants (43070 events) in the Alljoined1 dataset (right), highlighting individual and common brain
activity patterns associated with image presentation.
4.3. Discussion
Figure 5. ERPs averaged over occipital and parietal electrodes The ERP and topography analyses, as well as our analy-
for all participants and sessions. Shaded areas around the grand sis of the SNR reveal and reinforce several benefits of the
average ERP indicate standard deviations at all timepoints. acquired dataset, with regard to the stimulus design and tim-
ing.
1. Stimulus Duration and Stimulus Onset Asynchrony:
mined by calculating the standard deviation of the aggre- A 300 ms presentation window as well as a subsequent
gated waveform average for each event type across all trials 300 ms rest period allows the capture of both early and
and then dividing this by the square root of the event type’s late cognitive processes, as evidenced by the single sub-
occurrence count. The SNR is subsequently derived by di- ject peaks at around 262 ms up to 479 ms in Figure 6
viding the mean signal values by their corresponding SME. a), and the averaged peaks at 293 ms and 521 ms in Fig-
Figure 6 compares the average SNR across all events in a ure 6 b), respectively. The duration of 300 ms for image
single session for participant 5 with the average SNR across presentation is sufficient for the brain to engage in both
all events for both sessions concatenated. We see that the perceptual encoding and initial stages of memory pro-
SNR is noticeably lower in the multi session graph. This is cessing, which may not be as effectively captured with
due to the increased number of repetitions for a given event shorter presentation times. The subsequent 300ms rest
at different timepoints. This attributes to a disproportion- period provides a window to measure the brain’s higher-
ately higher standard deviation value and consequently a level visual and semantic response to the stimuli. The
higher SME and lower SNR. However, that is not to say that whole ERP thus not only reflects the initial feed-forward
the quality of the data is worse. It actually reflects more ac- transfer of sensory information to visual cortical areas
curate SNR values as there are more data points, distributed but also the subsequent recurrent interactions involved
across different sessions. It is also observed that the single in attention and semantic analysis, that unfold over hun-
sessions graph is more volatile across time, demonstrating a dreds of milliseconds after stimulus onset. The relevance
greater variance in SNR values which are captured by have of longer presentation times and longer stimulus-onset
a less accurate metric for noise with less trials to average asynchrony is additionally supported by the sustained
between. This fluctuation underscores the limited accuracy ERP activation presented in Figure 5, as well as the la-
of noise metrics derived from fewer trials, thus highlighting tency of SNR increase and peak in Figure 6.
Figure 6. Signal to Noise Rate (SNR) averaged across each session, across each block, and within each block for participant 5. Left: SNR
for only the first session 1, Right: SNR for all sessions.