This image is from Left temporal lobe structural and functional abnormality underlying auditory hallucinations in schizophrenia (Frontiers in Neuroscience, 2009)
A new paper from clinical and neuropsychologist, Vaughan Bell (King's College London, United Kingdom), examines auditory hallucinations from the perspective of social cognitive and social neurocognitive processes. From this view, he proposes that auditory hallucinations may represent problems with the internalization of social models.
This is an interesting and important theory for those of us who work with clients afflicted with these hallucinations, especially when the voices are those of family members or friends. The evidence seems to suggest that people who experienced childhood sexual and emotional abuse—but not physical abuse—are more likely to have auditory verbal hallucinations in adulthood, suggesting a specific link with early relationship trauma.
by John Hewitt
Inner Speech Loops. Credit: faculty.washington.edu
(Medical Xpress)—Perhaps the most controversial book ever written in the field of psychology, was Julian Janes' mid-seventies classic, The Origin of Consciousness in the Breakdown of the Bicameral Mind. In it, Jaynes reaches the stunning conclusion that the seemingly all-pervasive and demanding gods of the ancients, were not just whimsical personifications of inanimate objects like the sun or moon, nor anthropomorphizations of the various beasts, real and mythical, but rather the culturally-barren inner voices of bilaterally-symmetric brains not yet fully connected, nor conscious, in the way we are today.
In his view, all people of the day would have "heard voices", similar to the schizophrenic. They would have been experienced as a hallucinations of sorts, coming from outside themselves as the unignorable voices of gods, rather than as commands originating from the other side of the brain. After a long hiatus, the study the inner voice, and the larger mental baggage that comes along with having one, has returned to the fore. Vaughan Bell, a researcher from King's College in London, recently published an insightful call to arms in PLoS Biology for psychologists and neurobiologists to create a new understanding of these phenomena.
A coherent inner narrative in synch with our actions, is something most of us take for granted. Yet not everyone can take such possession. The congenitally deaf, for example, may later acquire auditory and communicative function through the use of cochlear implants. However, their inner experiences of sound-powered word, which they acquire through the reattribution of percepts of a previous gestural or visual nature, is something not typically shared or appreciated at the level of the larger public. A similar lack of comprehension at the research community level exists regarding those with physically intact senses, but with some other mental process gone awry. We may note with familiarity the shuffling and muttering of a homeless schizophrenic, yet have no systematic way to comprehend their intuitions, no matter how deluded they may appear.
Bell notes that current neurocognitive theories tend to ignore how those who hear voices first acquire what he describes as "internalized social actors." In addition to live social interactions, "offline" social interaction with an internal model of those individuals holding significant power in our lives would seem like a handy feature to have. We can readily imagine entirely non-pathological situations where such a model would be of benefit. A young child cut from a school basketball team which they worked hard to make, may be temporality devastated, but hardly traumatized. If they renew their efforts to make the team the next year and practice each day in their backyard, they might imagine the coach who cut them watching their every shot with a critical eye. While this hallucinated guidance would be entirely benign, if the person they imagine is instead an abusive parent or classmate, the internal model might eventually take on a more sinister nature.
It would seem that at least in some individuals, the internal model seems able to get the upper hand, particularly when that hand is forced. We might imagine a school child tasked with the tedium of a seemingly endless recitation—saying the rosary beads, for example, in the catholic school days of yore. The familiar "Hail Mary, full of Grace……" might, after so many repetitions, transform in the mind into something else, despite the earnestness of the professor of faith. "Hail Mary, full of ….." might instead be completed with a different choice word that intrudes from another collective in the brain despite the alarmed child's efforts to suppress it. In the situation where this is vocalized externally, completely out of control as in full blown Tourette's syndrome, the child now has a problem.
The idea that separate voices represent separate hemispheres may be a good starting point, but it can readily be dispatched as far as being the whole story. Auditory hallucinations can take the form of multiple social actors, clearly outnumbering our hemispheres, and all with different tones, personalities, and persistence of identity. Attempts have been made to localize brain activity to a particular narrative using EEG recording, or to elicit a hallucination using magnetic stimulation. While the occasional inciteful anecdote may be gleaned from these kinds of investigations, we should not expect much fine detail to ever be had from them. The cortical area known as the temporoparietal junction routinely emerges as a favorite among brain imagers because of its geometric location at the pinnacle of the major fold in the brain. Unfortunately, until there exists a large scale minimally damaging recording technology we are probably going to have to content ourselves with looking closer at what subjects have to say about their own auditory hallucinations, than what their brains might have to say.
As children we learn to talk by talking to ourselves. Unless marooned on an island, we tend to abandon this behavior in polite company for fear of stigmatization, among other things. If the line between normalcy and pathology for hearing voices, or even talking to them, (so long as they do not command undesirable physical actions), is drawn with a greater acceptance for normalcy, a clearer understanding of the inner voice might be sooner in hand.
Bell, V. (2013, Dec 3). A Community of One: Social Cognition and Auditory Verbal Hallucinations. PLoS Biol; 11(12): e1001723. DOI: 10.1371/journal.pbio.1001723
Auditory verbal hallucinations have attracted a great deal of scientific interest, but despite the fact that they are fundamentally a social experience—in essence, a form of hallucinated communication—current theories remain firmly rooted in an individualistic account and have largely avoided engagement with social cognition. Nevertheless, there is mounting evidence for the role of social cognitive and social neurocognitive processes in auditory verbal hallucinations, and, consequently, it is proposed that problems with the internalisation of social models may be key to the experience.
Auditory verbal hallucinations, the experience of “hearing voices”, present us with an interesting paradox: the experiences are generated from within a single individual but are typically experienced as a social phenomenon—that is, a form of communication from another speaker.
Current theories attempt to explain auditory verbal hallucinations as alterations to individualistic information processing—namely, misattributions of internal thoughts as external phenomena due to biases in cognitive monitoring . The fact that voices stem from an internal source is, of course, clear, but the typical experience of “hearing voices” is not that thoughts seem to be “spoken aloud” but that hallucinated voices have a social identity with clear interpersonal relevance . In other words, voices are as much hallucinated social identities as they are hallucinated words or sounds.
Nevertheless, neurocognitive theories have largely ignored how people who “hear voices” acquire what amounts to internalised social actors. To illustrate the extent of this neglect, a recent consensus statement that described an integrated cognitive model of auditory verbal hallucinations  included only vague mentions of “perceptual expectations”, “top down influences”, and “emotion” to address how voices become distinguishable as social identities, without any specific suggestions for how these experiences take a social form.
This clear omission is all the more surprising given the significant advances in the field of social cognitive neuroscience. Although early theories in this field focussed largely on the development of abilities (like “theory of mind” and “mentalising”), more recent work highlights the importance of developing internalised models of social actors for both “inner” social reasoning and “live” social interaction (e.g., ,). Here it is argued that alongside the well-established difficulties with source and intention monitoring that lead to misattribution, auditory verbal hallucinations may also involve a change in the neurocognitive processes that support our internal social models of people and how they behave.
The Social Cognition of Hearing Voices
Although, at its root, all speech is a social phenomenon, hallucinatory voices have clear interpersonal characteristics above and beyond the social basis on which they rest. First, hallucinated voices are usually experienced as having identities and making coherent communicative speech acts  (see example in Box 1), and, second, they are primarily experienced as social actors the hearers can relate to and interact with .
Box 1. Hallucinated Social IdentitiesPhenomenological studies have reported that between 30% and 69% of people who hear voices experience them as having specific personal identities (e.g., members of the family, God, celebrities) ,–. The biggest study to date reported that 31% of 199 voice hearers with psychiatric diagnosis did not experience anonymous voices, 32% experienced a mix of known and unknown voices, and only 37% experienced purely anonymous voices . However, these figures are likely to be a minimum estimate of whether voices are experienced as having social identities, in terms of being able to reliably distinguish them from other voices by personal characteristics, because voices may have social identities but still be anonymous (e.g., identified only as an “unknown old woman” or “a man with a deep voice”, as in Leudar et al. , who name them “incognito” voices). There is evidence to support this from studies that have specifically investigated this aspect of the experience. Nayani and David  reported that 61% of psychiatric voice hearers knew the identity of their voices, but an additional 15% had voices that were familiar but unknown. McCarthy-Jones et al.  found that 70% of voice hearers reported voices that, regardless of specific identity, were “like” people they had spoken to in the past. In a qualitative study of 50 psychiatric voice hearers, Beavan  reported that characterising voice identity, regardless of citing a specific personal identity, was a major theme of the experience of hallucinating voices. In other words, regardless of whether a voice is associated with a personal identity in the outside world, the process of recognising it as a distinct social identity is likely to be a key experience for many voice hearers.
“I mean, there are two voices—Simon and Jeremy. Simon's…um…like a demon really. He's very demonic and he…says people read my mind and they know I'm evil…. I've got a year to live, that if I don't do as I'm told the horrible horseman of the apocalypse will come and get me and kill me and Armageddon will come and the world will be destroyed. And then Jeremy—he's just a little boy, he's just full of fun, you know, he'll tell me things like—um—‘Move the food from the cupboard and put it in mum's chest of 7 drawers.’ Just stupid things like that. It's funny but it's—it is annoying really.”
—Lucy from Knudson and Coyle 
Similarly, numerous studies have now found that voice hearers understand their connection with the voices in terms of relationships and interact with their voices in ways that “share many properties with interpersonal relationships within the social world” . Most obvious in this regard is the fact that over 80% of people who experience auditory verbal hallucinations have reported that they are able to engage in interactive conversations with their voices ,. Judgements about the identity of hallucinated voices rely on perceptual features similar to those required to judge identity when listening to the voices of other speakers, with perceived identity being an important mediator of distress .
With regard to the effect of social environment on voices, relationships with the hallucinated voices are experienced in terms of social power, while voice hearers' perception of the external social world is mirrored in their relationship with hallucinated voices—similarly a significant mediator of distress ,. Studies on psychiatric voice hearers have indicated that the social environment plays a role not only in the formation of hallucinations but also the maintenance of the experience ,. Although the risk of psychosis is raised after the experience of trauma in general, childhood sexual and emotional abuse—but not physical abuse—predict the presence of auditory verbal hallucinations in adulthood ,, suggesting a specific link with early relationship trauma. In adult life, the death of someone in a long-term, loving relationship will commonly lead to hallucinations of that specific person, and hallucinated voices are among the most common experiences . In line with phenomenological evidence that voices are typically associated with known people (either directly through identity or through resemblance ), it seems likely that intense emotion within a relationship is associated with both the formation and content of hallucinated voices.
These results suggest that the experience of auditory verbal hallucinations is, for most voice hearers, primarily a social one, with social environment through the lifespan having a specific effect on the presence and form of voices. The fact that most voices are perceived as having social identities with which the hearer interacts in ways verifiably similar to external social relationships suggests that voices often function as internal models of social actors.
Auditory Verbal Hallucinations and Social Cognitive Neuroscience
The majority of studies on the cognitive neuroscience of auditory verbal hallucinations have not looked at “voices” as specifically social phenomena, but there is evidence that the neural networks involved in supporting these experiences have significant overlap with areas that play a key role in social neurocognition.
Neuroimaging studies have linked auditory hallucinations to functional and structural differences in speech and language areas—most notably the superior and middle temporal gyri and the inferior frontal gyrus (Broca's area). In addition, a wider network of non-sensory areas is implicated. These include areas typically described in the auditory verbal hallucination literature as linked to cognitive monitoring—namely, the dorsolateral prefrontal cortex, anterior cingulate, and cerebellum—and areas typically linked to emotion and affect regulation—namely, the anterior insula, hippocampal and parahippocampal regions, and the orbitofrontal cortex ,. However, these areas typically described as monitoring and emotion areas are also key components in social neurocognitive networks that make up the “social brain” ,.
Of particular interest in the network commonly identified in neuroimaging studies of auditory verbal hallucinations is the temporoparietal junction, now clearly identified as having a key role in verbal working memory and social cognition  and, according to Saxe , a central role in representing others' mental states. Using electroencephalography, this area has been found to be active in the second before the onset of auditory verbal hallucinations , and 1-Hz repetitive transcranial magnetic stimulation (used to dampen down neural excitability) applied to this area reduces hallucination intensity . In the only study that has directly stimulated the temporoparietal junction (increasing neural excitability in the area), Arzy et al.  induced a clear experience of social imagery in non-voice-hearing, non-psychiatric participants in the form of a “sensed presence”.
Neuroimaging studies that have directly compared the experience of auditory hallucinations to imagined inner voices have found remarkably similar brain activity, indicating that voices are not likely to be just “misidentified thoughts” but specifically “misidentified voice images”, potentially experienced as unintentional because of an altered sequence of activation in the supplementary motor area and auditory perceptual areas ,. Such voice images have been found to have dissociable neural substrates for spatial and social identity properties .
Towards a Social Cognitive Approach to Voices
There is good evidence that auditory verbal hallucinations are usually experienced as social entities with which voice hearers have dynamic relationships and which are associated with specific changes to brain networks used to support social cognition. It is surprising, therefore, that cognitive and neurocognitive theories of hallucinated voices have been almost entirely individualistic in their approach.
Perhaps one exception is Fernyhough's  developmental theory that draws on Vygotsky's account of inner speech. According to Vygotsky , children learn language through interaction and dialogue with others. The dialogue format is then used, first out loud, to problem-solve (children literally talk to themselves), and later becomes internal as we learn to internalise speech as thoughts, with thoughts retaining dialogic qualities and having elements of “inner self-talk”. Fernyhough  uses this to explain how the “misattribution of thought” model of auditory verbal hallucinations could produce speech-like experiences. However, this account still does not explain why hallucinated voices are typically experienced as distinct social identities with which the hearer has a relationship. In essence, it still strips the social phenomenology—and, therefore, it could be argued, key aspects of social cognitive processing—from the experience.
Considering the evidence that auditory verbal hallucinations involve a misattribution of internal phenomena as external due to biases in cognitive monitoring , we need to be clear about the nature of the internal phenomena that are being misattributed. It would be most parsimonious to assume that these phenomena stem from our normal ability to internalise models of people we know and their voices, rather than auditory hallucinations involving a de novo generation of persistent and internally vocal social identities. Accounts including internalised models of social actors suggest that we internalise others' voices and personalities so that we can predict what someone would say or do in any given situation . These internal models can be for specific people, so I can imagine how my spouse might respond in a hypothetical conversation, or for generic stereotypes, so I can imagine how a policeman or shopkeeper might respond.
Neuropsychological models of voice hearing , suggest there is an important component involved in the generation of heard speech (changes to activity in the language system) and an important component involved in difficulty distinguishing internal from external phenomena (altered cognitive monitoring). The hypothesis suggested here is that, in addition to these well-established factors, there is an alteration to the social cognitive or social neurocognitive systems that support internal models of social actors and their associated voice imagery, to explain why voices are typically experienced as having an identity and acting socially.
It is worth examining what this implies. First, it implies that an ability to internalise models of other people and generate associated imagery of their voices is a normal developmental process. Longitudinal studies looking at developmental psychology should see that this ability develops in line with other social cognitive processes, and that people with difficulties in internalising predictive models of other individuals should have marked social difficulties. Secondly, it implies that changes to the normal functioning of this system play a causal role in many voice-hearing experiences. Although typically experienced as social, voice-hearing experiences can range from simple repetitive syllables to conversations between the hearer and several hallucinated vocal identities. Drawing on the work of Hassabis et al. , the more a voice is experienced as having a social identity, the more it should involve processes that support the creation of and prediction from internalised personality models at the cognitive and neurobiological level.
Furthermore, considering that one of the key experiences of voice hearing is the lack of agentive control over the voices, and that there is a link between social stress, trauma, and auditory hallucinations, the internal social models of individuals associated with intense traumatic or emotional experiences should be less predictable—and resultant imagery more intrusive—than for individuals not associated with such experiences. In addition, recent evidence has emerged that psychiatric voice hearers are much more likely to identify voices as specific living people than non-clinical voice hearers , suggesting that social cognitive factors may differ depending on the level of disability associated with the experience.
To further clarify these issues there is a clear need for more fundamental evidence to be gathered about the nature of voice-hearing experiences. Studies on how voices are distinguished as “individuals”, which social characteristics they are perceived to have, and how this is distinct from being associated with specific identifiable persons in the outside world (as opposed to voices with distinguishable personalities who remain “anonymous” or “incognito”) are still lacking. Furthermore, little attention has been paid to how voices evolve over time , and knowing when voices become “social” may be key to understanding the role of social cognition in the aetiology of auditory verbal hallucinations.
Over the last decade, psychological-level research has focussed on the link between social cognition and auditory verbal hallucinations and has amassed a significant amount of evidence as a result. However, researchers working in cognitive neuroscience, who are specifically looking to make links with neurobiology, have only occasionally engaged with studies that have investigated the social neurocognition of hearing voices. Despite some provocative results, they have not yet used paradigms that would disentangle the extent to which the “social brain” is part of the hallucinatory experience. This is clearly an area where more targeted research needs to be completed. Similarly, more effort needs to be put into developing theories that include the socially relevant evidence, as this has been largely ignored in both cognitive and neurocognitive accounts.
As one of our most enigmatic experiences, “hearing voices” is at once both individual and social. There is a clear need to understand it in terms of the individual mind and brain, and a clear opportunity for it to shed light on the social world that lives within us.
Many thanks to Charles Fernyhough for useful and stimulating discussions during the writing of this article.