This repo is the implementation of a research project aimed at enhancing Acoustic Side-Channel Attacks (ASCAs) using a novel combination of Vision Transformers (VTs) and Large Language Models (LLMs).
Sound is a crucial signifier which contains rich high-level semantic environmental information. Consequently, comput-erised audio classification, aiming to recognise a various of sound patterns, has ...