The Irish Undergraduate Journal

Page 1

The

Undergraduate Journal of& Ireland Northern Ireland

• Volu m e I

A collection of winning essays from our inaugural year



The

Undergraduate Journal of& Ireland Northern Ireland

• Volu m e I

A collection of winning essays from our inaugural year

The Undergraduate Awards of Ireland & Northern Ireland


Published in 2010 by the Undergraduate Awards of Ireland & Northern Ireland. Undergraduate Awards of Ireland & Northern Ireland c/o Google Ireland Ltd, Gordon House, Barrow Street, Dublin 4 www.uaireland.com info@uaireland.com Copyright © 2010 The Undergraduate Awards of Ireland & Northern Ireland. All rights reserved. This book may not be reproduced, in whole or in part, including illustrations, in any form without the written permission of the publishers. Set in Kepler 9.75/11 by Gearóid O’Rourke of 50RSt.com, Dublin, Ireland. Printed by Conway Media, Rathnew, Co Wicklow.


The

Undergraduate Awards of& Ireland Northern Ireland

• Founders Patrick Cosgrave Oisín Hanrahan

Editorial Board Jim Barry – Chair Aine Maria Mizzoni – Vice-Chair Dr R. Barnett Prof. Don Barry Bobbie Bergin Dr. James Browne Martin Curley Maeve Donovan Dr. Peter Kennedy Dr. Brigid Laffan Prof. P.J. Prendergast Dr. Ferdinand von Prondzynski Dr. Frances Ruane Founding Partners


Leading the charge to unlock Ireland’s true potential The following address was delivered by President Mary McAleese at the inaugural awards ceremony of the Undergraduate Awards of Ireland & Northern Ireland. This event took place in the Library of the Royal Irish Academy on Tuesday October 20th 2009.

D

ia dhíbh a chairde. Cuireann sé áthas orm bheith anseo libh inniu agus tá mé thar a bheith buíoch daoibh as an chuireadh a thug sibh dom. My thanks to Jim Barry for the invitation which allows me to be part of this gathering which celebrates the achievements of Ireland’s finest undergraduates. The Irish Undergraduates Awards are new and they are an important step forward in acknowledging the role played by our undergraduates in helping to advance Ireland’s ambition to be not just a smart economy but a just and decent sophisticated society. Mostly undergraduate essays, research and projects are done for the reward of marks that get them through the eye of the academic needle. Sometimes that important utilitarian end obscures the value to the wider world of scholarship of work that is insightful, creative and worthy of a much wider platform than the examiner’s pile. These awards encourage our top undergraduates to believe in the validity of their work and in their entitlement to a public place of respect within scholarly discourse. So it is particularly fitting that these awards should be inaugurated here in the Royal Irish Academy and that the winning entries should be published in the Undergraduate Journal of Ireland opening up the work of these awardees to a much larger audience. The Journal will quickly become required reading in academia and the public and private sectors as it not only showcases their ideas, observations, analyses and provocations but showcases too the foundational and inspirational work undertaken by the lecturers, departments and universities across Ireland which helped develop these young, curious minds and give them their confident voice. 6


If anyone thought our students were overwhelmed by the ambient economic uncertainty they can take courage and correction from the fact that over 1500 students submitted work for this inaugural award. Of these, thirty-three winners have been chosen from the nine Universities across the island of Ireland and from every conceivable discipline. The range of subject matter is breathtaking, from the topical “The Feasibility of Onshore Wind Farms in Ireland” to the intriguing “Can Sea Sponges Cure Cancer?” and the eminently practical “Integrated Ticket for Public Transport”. I also took particular pleasure in an essay entitled “In the male dominated writing culture of the Middle Ages, can a female voice ever be represented as authoritative and reasonable.” It all makes for one very exciting, eclectic journal a real melting pot of the best thinking of our best students and a new forum for the synergies that come from crossing disciplinary boundaries, different approaches and fresh thinking which will be one of the keys for a return to sustainable prosperity. A newly educated generation turned the tide of history to give us peace for the first time in centuries. It turned the tide of emigration and brought us from poverty to a new level of prosperity. The upward trajectory of economic growth has stalled in Ireland and elsewhere thanks to human frailty and fallibility. This generation lives with the consequences and is challenged to learn the lessons so that by their ingenuity there will be a new surge of prosperity and by their integrity it will be rooted in more enduring values and virtue. These young award winners whom we are celebrating today will lead the charge to unlock Ireland’s potential and deliver the Ireland of wide and equal opportunity that we have aspired to from the first courageous steps towards independence. I extend special thanks to all those who helped make the Irish Undergraduates Awards a reality in particular Oisín Hanrahan and Paddy Cosgrave who proposed the awards and all the Board Members who have driven this project of encouragement and recognition at a seminal moment in the lives of our students. A special thank you to the Universities for their important contribution, to Pauric Dempsey of the Royal Irish Academy for providing this magnificent venue for today’s ceremony and Kingsley Aikens of the Ireland Funds, Hugh O’Regan and Brendan Tuohy all of whom have been instrumental in taking the idea of the awards and turning it into reality. But the biggest thanks is due to all those who entered the Awards and especially those who have the distinction of being the first ever winners and so first ambassadors of what we hope will become a regular event. Every success to each one of you and I hope this recognition will encourage you to new levels of belief in yourself and in your potential service to our country. Comhghairdeas libh arís agus go raibh fada buan sibh. Go raibh maith agaibh. President Mary McAleese

Uachtar án na hÉireann

7


Contents Leading the charge to unlock Ireland’s true potential Introduction from President Mary McAleese

6

Acknowledgements 13 Award Nominees

15

Nutritional factors affecting the fertility of Dairy Cows Claire Fitzsimons

21

Transcription & analysis Benjamin Larkin

39

The rectilinear houses of the Irish Early Neolithic: The introduction of new identities, ideologies & economies Russell Ó Ríagáin

53

Investigating the structural characteristics of transient protein-protein interactions Niamh Parkinson

71

Different cultures, came culture? International HRS in Whole Foods Market Inc. Anne Byrne, Grainne Conroy & Megan Huxhold

97

Catalytic methods for the destruction of chemical warfare agents under ambient conditions Linda O’Connor

113

Can Sea Sponges cure Cancer? Roisin O’Flaherty

127

The Son he never had: Zeus’ parthenogenetic creation of a surrogate son? Melanie Hayes

159

8


Beetlz – BON software model consistency checker for Eclipse Eva Darulová

167

The aetiology & management of gingival hyperplasia in organ transplant patients Emer Walshe

181

Bigas Luna’s Retratos Ibericos & the gendered performance of Self Ciara Barrett

197

Early human settlement in the British Isles & 211 Northern Hemisphere glaciations Caroline Martin An econometric investigation into whether the term spread helps to explain the dynamics of GDP growth in the euro area. Michael Curran

225

Should we allow a market for kidneys? An Economist’s consideration M. Lorraine Chadwick

239

In Ireland, recent legislation & policy in health, education & social services have changed the nature & practice of early childhood education care & services Dairine Taaffe

253

An integrated condition assessment & empirical approach to predict risk levels due to subsurface construction Julie Clarke & Laura Hannigan

261

Tragedy in triumph: The lost paradises of Hal the Hypocrite Tim Mc Inerney

275

The evolution of the adaptive immune system Darren Fitzpatrick

289

The Earth’s disciples: Geographers & the reinterpretation of Space in the 21st Century Drew Reid

301

9


Kingsley’s natural selection: The significance of Darwin in his works Abigail Rowe

311

Innéacs de bhailiúchán amhrán a rinne Cosslett Ó Cuinn i nGabhla, i dToraigh agus in Árainn Mhór Kayla Reed

329

A hundred indecisions: Paralysis in Mansfield’s “The Daughters of the Late Colonel” Emily Bourke

343

Irish nervous shock law is flexible but underdeveloped in comparison with the stricter doctrine adopted by the courts in other common law jurisdictions Peter Dunne

349

Adolescent preferences & priorities for the design of current augmentative & alternative communication devices Erika Jane Dowling

361

Connection Games James Leahy

381

How do we run on bumpy ground? Frederick A. English

383

Inflammation of the upper airway in obstructive sleep apnoea Brian Mac Grory

397

Men’s health promotion strategies & erectile dysfunction Maria Jarosinska

413

Define nursing & discuss what you consider to be the key components of nursing care Geraldine Galand

417

Analysis of the p53 protein activation in Xenopus embryos Lynne O’Shea

423

Consequentialists can never be good friends Thomas Morris

445

10


Domain dynamics & switching in ferroelectric mesowires Raymond McQuaid

453

Evolution of the measurement of body segment inertial parameters since the 1970s LaurA-Anna Furlong

465

Party mobilisation & turnout decline in Sweden Mark Canavan

477

The direction of influence between language & thought Cliodhna O’Connor

489

Docetism & 1 John Eimhin J. Walsh

497

The “modern concept of childhood” is at odds with the everyday lives of children in the South & may lead to inappropriate programmes of action Yvonne O’Reilly

507

11


12


Acknowledgements

T

he inaugural awards and journal are a credit to the quality of the submissions and the commitment of the 33 academic panels who selected the winning papers. Excellent education is the cornerstone of any competitive and smart economy. The Undergraduate Awards of Ireland and Northern Ireland underpin this by recognising and celebrating the highest standards of undergraduate academic scholarship. My sincere thanks to all those who assisted in making the Undergraduate Awards of Ireland and Northern Ireland a reality and my congratulations to our Gold Medal Winners. Jim Barry & Aine Maria Mizzoni

Chair & Vice cHAir of the Editorial Board

A

s we look back on the first year of the Undergraduate Awards of Ireland and Northern Ireland what is most striking is the overwhelming support this initiative has received from so many diverse individuals and organisations. From academic institutions, to lecturers, to government bodies, to private individuals, to corporations, to students, the support has been incredible. To those who supported the awards in its first year and to those who supported the establishment of the awards prior to its first year we would like to extend our sincere thanks and express our utmost gratitude for their assistance in turning the idea of recognising undergraduate excellence into a reality. To all of the students that participated and to our winners, we would like to say thank you and congratulations. We hope the economic environment into which you will graduate will serve only as motivation to succeed, and that you will see opportunity and potential where those who have gone before you may have seen only uncertainty. We hope that those in government with the responsibility for creating the right environment for your spark of brilliance to succeed will deliver. Finally we would like to recognise and thank those who brought this journal to life through judging, editing, and through layout and design. Paddy Cosgrave & Oisin Hanrahan Founders of the Awards

13


14


Award Nominees Agriculture Michelle Guthrie Miriam Hirzel Niamh Bourke Breige Flynn Martin Breen Kevin Oxx Anthropology Eileen Murphy Dwaine Martin Kate O’Brien Margaret Dunne Sharon Costello Jindrich Mraz Tamryn Reinecke Tara McAssey Joanna McClatchie Archaeology Lorraine Shannon Jane O’Dwyer Ronan Considine Margaret Williams Mark O’Callaghan Biochemistry Eamon Geoghegan Christina Dix Lisa Vincenz Sarah Conmy

Georgina Murphy Jim O’Connell Ronan Lyne Darren Ruane Tatiana Papkovskaia Jonathan Roddy

Business Kristin Huber Orla Caffrey Mary Redmond Stephen Denham Alexander Mann Lisa de Jong Christoph Walsh Charlotte Wickham Mark O’Flynn Jack Berrill Jason O’Connor Joseph Cummins Danielle Ryan John Lavelle VictoriaWhelton Colin Kuehnhanss CaitrionaO’Connor Rory Costello Jennifer Cowman David Butler Jo-Ann O’Sullivan Declan Clancy Paula Corcoran

Jennifer Foley David Breslin Paul Gallagher Nathalie Ennis Chemistry Michelle McKinney Laura Moran Teresa Loftus Aoife Amy Malone Nordon Kathleen Melzer

Classics Robert Stratton Michael Debets Computer Science Fergal Walsh Niamh Nic Clámha Rebekah Burke Adrian Seung-Bum Gabriel-Miro Kwizera Lee Muntean Colin O’Brien John Reddin Jimmy Cleary Jeff Warren Wu Hao Brent Kelly Fathi Ramly Nicola Burns 15


Jonathan Synott Noel Kennedy Michael Waters

Dental Science Zohaib Ali Paul Hooi Marian Cottrell Harry Stevenson Erin Cecelia Bolton Paul Kielty Drama, Film & Music Emer FitzGerald Christopher Corcoran Aisling Byrne Emily Griffin Ellen-Jane Kruger Catherine Hughes Nina-Maria Häggblom Grace Kelley Laoibhse Louise Griffin Sara Joyce Cillian O’Connor Sarah Cronin Ben Murnane Sinéad Finegan Christopher Collins George Jackson Roisin McMullin Susannah Norris Michelle Cleary Ross Fortune Pauric Havlin Erica Mills Alexandra Christy Aoife Mac Alister Paul Fennessy Economics Xiang Fang Xin Xu James McLaughlin 16

Aidan O’Hare Jonathan Wyse Christopher Sale David Madden Michael Bracken Cillian Murphy

Education Oonagh MarianKeane Liam O’Reilly Sarah Falvey Engineering Colm Bhandal Ivan Rochford Deaglan Gibbons Aoibhín Gaynor Tomas Kelly Catherine Keigher Brian Kelly Nick Hyland Chris Hurley Rory Clune William Horgan Jeremiah O’Riordan Rory Gallagher Steven Ferguson Mark O’Connell Kate Smith Patrick Murphy Eamon Lannoyle David Ferns Paul Durcan John O’Donoghue Yuanyuan You English Helen Heaton Matthew Callaghan Brian Doyle Orna Mc Donald Karina Jakubowicz Robert Kiely

Conor Minogue Deirdre Ni Annarachain Nathaniel Forde Sinéad Murphy Stephanie Courtney David Bernard Alexandra Duchene Colin Sweetman Duncan Wallace Alyson Bailey Mary-Ann O’Dwyer Mary O’Halloran Steven Kelly Muireach Shankey-Smith Niamh Campbell Jean Hogan Eileen O’Mara Walsh Matthew Callaghan Vicki McKenna Ann Marie Wade

Genetics & Microbiology Helen Devine Colm O’Rourke Gareth O’Dwyer Aisling Miller Enda Shevlin Kate Ferguson Kaia Berstad Sarah Louise Gill Conor McKenna Geography Theresa Connell Joseph Usher Richard Webb Rory Flood Craig Rankin Laila Higgins Earth Sciences Shona O’Rourke John Bill


Robert Weatherill Wood Rotherham Jennifer Scully Bill Wood Eoin Mulvihill

History Hugh Taylor Giulia Ni Dhulchaointigh Aidan Conway Caitriona Ní Dhubhda Grace Bolton Oisín Smith David Gareth Toms Abigail Duignan David Durnin Brendan Corcoran Penny Baxter Frances Nolan Meaghan Woulfe Maire Breathnach Ní Ghormain Irish Colm Ó Neachtain Meabh Ní Coileain Catherine Curtin Giollosa Uí Lorcain Clair Johnston Languages Grzegorz Grzybek Matthew Callaghan Kathe Rothwell Gabija Guogyte Eithne Lonergan Sandra Quinn Matthew Krasa Meabh Keane Mathilde Chaigneau John Ryan

Law Brian J. Doyle Sana Farooq Khan Rosemary Henningan Alison Shanley Nikki O’Sullivan Sarah O’Meara Mary Flanagan Jane Mc Cooey Helen Kerr Joanne O’Toole-Byrne Martin Corrigan Deborah Magill Linguistics Orla Tighe Gavin Murphy Medicine & Health Sciences Peng Hor James M. O’Donnell Andrew W. Murphy Timothy O’Brien Thomas J. B. Kropmans Shane Corcoran Alison Cregan Edward O’Connor Sinead Healy Gareth Kiernan Ishwarya Balasubramanian Karen Connell Pádraig J. Mulholland Aoife Carey Kirsty Porter Ruth O’Connell Laura Gleesol Nursing & Midwifery Sinead Hayes Esther Funmilayo Afolalu Michelle Carroll

Pharmacy Emer Woods Louisa Conlon RebeccaRing FionaCarr Philosophy Michael James Regan RonanDaly Stephen McCarty Siobhan Moriarty Matthew Mckeever Olivia Russell M. David Walsh Evan Hargadon David Hastings Patrick Hastings Alexander Court Katie Mcneice Physics Jason Jensen Daniel Ryan Sonia Buckley Laura Horan Anna O’Faoláin de Bhróithe JenniferJoyce Physiotherapy Louise Reilly Sean James Ledger Andrea Mc Carthy Aideen Shinners Catriona O’Dwyer Politics Chad Keveny Rachel Gilliland Carmel Joyce Fakhra Zafar Patrick Kilmartin Lorcan Patrick Byrne 17


Sean Ó Conghaile Holly Wilson Byrne Aisling Lynch Paula Kennedy

Psychology Mark Glennon Odhran Irwin Michelle Downes Damien Daly Colin McDonnell Louise Smyth Sheila Armstrong Karen McAllister Ciara Amory Caoimhe Nic a’ Bháird Laura Mangan

18

Niamh Skelly Charles Crandon Fionnuala Malone Manus Moynihan Maria Aisling Higgins

Religion, Theology & Ecumenics Brendan Rea Stephen Murray Jill McArdle Ruth Lee Ronald A. Geobey Niamh Murphy Anna Williams Egle Zinkute Claire Dunne

John Philip Magennis Stefan Bartik

Sociology Clodagh Ni Chearbhaill Nicole Byrne Ciara Finlay Judy Brown Jane Wigglesworth Jean Byrne Violet Wilkin Sarah Lagan McGreevey Nicola Donnelly Stacey Thom Natasha Moore


19


AGRICULTURE PANEL

Judging Panel Prof. Dolores O’Riordan (University College Dublin) – Chair Prof. Pat Lonergan (University College Dublin) Prof. Séamus Fanning (University College Dublin) Dr. Eileen Gibney (University College Dublin) Dr. Deirdre O’Connor (University College Dublin) Judges’ Comments The paper entitled the Nutritional factors affecting the fertility of dairy cows authored by Claire Fitzsimons was unanimously selected as the winning paper by the assessment panel. The paper explores how the nutritional management of dairy cows can influence the reproductive efficiencies of the dairy herd. The subject material is well organised and there is a logical clear presentation of the all the relevant sections. The content of the paper is accurate and the conclusions drawn are based on well presented evidence. The panel were particularly impressed with the level of critical appraisal of the literature and the high level of synthesis demonstrated by the author. Overall the submission was considered to be a well- researched, topical original manuscript. The judging panel were very pleased to recommend Claire Fitzsimons for this award.

20


AGR IC U LT U R E

Nutritional factors affecting the fertility of Dairy Cows

Claire Fitzsimons

T

Introduction he production of milk from the dairy cow is dependent on getting the cow in calf, maintaining the pregnancy and obtaining trouble-free parturition of the foetus. The success of this process relies on the fertility of the individual cow. There are many different measures of fertility, e.g. non-return rates after artificial insemination (AI), number of days not pregnant and calving interval (Butler et al, 1989), however conception rate is a measure that can be used internationally. Conception rate is defined as the percentage of cows which hold to service, with first service often used as the benchmark. A figure of 65% conception to first service is regarded as a very good figure (Blowey, 1999), however conception rates have fallen well below this figure, as seen in Fig. 1. This phenomenon of low conception rates to first service is being experienced world-wide in high yielding dairy cattle and has occurred in the last 10 to 15 years (Blowey, 1999). Ireland is by no means unique in this respect and has also experienced a decline in reproductive performance in the dairy herd since the mid seventies, not only is this characterised by low conception rates but also high rates of embryonic mortality (Moore et al, 2006). This decline has been linked with the increased proportion of North American Holstein Friesian genetics in dairy cows through intense artificial selection for high milk yield (Buckley et al, 2000), however this is not the only factor (Mulligan et al, 2007). It is widely accepted that there is an inverse relationship of conception rate to milk yield. Such intense selection 21


Fig. 1. Diskin et al (2006) estimate the reproductive performances of British-Friesians in 1980 compared to those of Holstein-Friesians in 2006. has altered the metabolism of the Holstein Friesian dairy cow. The large demand of high milk production immediately after parturition cannot be met by feed intake during early lactation and thus the cow falls into ‘negative energy balance’ (NEB) (Butler et al, 1989). NEB is where the cow uses her own body fat reserves to meet the demands of the mammary gland (Ball et al, 2004), commonly known as when the cow ‘milks off her back’ (Murphy, 1998). In order to minimise this NEB and excessive mobilisation of body fat, the nutrition of the cow must be optimised (Ball et al, 2004). Body condition scoring is a management aid that can be used to assess the current physiological status and fat reserves of the cow. In Ireland body condition scoring is conducted on a five point scale with the animal scored on a monthly basis, or however often the farmer desires (Gordon, 1996). Blowey (1999) states that poor conception rates may be a result of high protein diets, especially diets with high levels of rumen degradable protein (RDP). This may be due to the deleterious effect that high protein (especially high RDP) has on oocyte quality (Chagas et al, 2007), however this is contradicted in a study conducted by Kenny et al (2001) where it was established that high ammonia and urea had no negative effects on embryo survival. High crude protein must be excreted as urea which is an energy demanding process and may exacerbate the NEB being experienced by the cow (Roche, 2006) therefore it is possibly the affect that high protein has on NEB that influences fertility and not high protein per se. From a study conducted by Siciliano-Jones et al (2008) there is conflicting evidence in relation to the effects that supplementation of diets with trace elements has on the fertility of the dairy cow. Some studies suggest there is a role for these minerals in ameliorating poor reproductive performance and in parallel to this 22


Fig. 2. Graph showing how milk yield demand surpasse appetite (Blowey, 1999). other results contradict these findings. However, Blowey (1999) asserts that trace element and mineral deficiencies have been associated with substandard fertility, with an emphasis on the calcium-to-phosphorus ratio. There is a lot of evidence to support the beneficial role of feeding supplemental fatty acids on the fertility of dairy cows. Such benefits include assistance of uterine involution, reduction of uterine infections (Roche, 1996), increased numbers of ovarian follicles and increasing the level of progesterone release from the corpus luteum (Verkerk, 2000). Feeding supplemental fat in the diet can often lead to increased milk yield with no beneficial effects on the NEB of the cow. Nevertheless, some studies have shown positive effects on reproductive traits (Gardner et al, 2001). On consideration of all these factors, the part nutrition and its associated hormones and metabolites play in the reproductive efficiency in cattle is emphasised in this project.

Negative Energy Balance The most metabolically demanding time for the dairy cow is the transitional period between late pregnancy and early lactation (Gardner et al, 2001). Butler et al (1989) state that energy balance is the term used to describe the relationship between dietary energy intake and energy utilisation. Energy balance is calculated using the equation: NE1 (consumed) – NE1 (required) = daily energy balance (Where NE1 requirement includes maintenance & production). The reason why energy balance is used to establish the metabolic status of the 23


cow is because it is more accurate than measuring milk yield or other production traits (Butler et al, 1989). In early lactation, the dairy cow will often meet the demands of high milk yield regardless of dietary energy intake (Verkerk, 2000). This energy deficit is known as a negative energy balance (NEB) as demand exceeds supply (Butler et al, 1989). During this period there is mobilisation of body tissues (Verkerk, 2000, Wathes et al 2002) and thus the cow loses body condition (Wathes et al, 2002). After calving the subsequent NEB may last for several weeks (Butler et al, 1989). It must be remembered, however, that this is a normal process within the cow’s physiological system and some utilisation of body reserves will take place (Lucy, 2001). The point to remember is that the aim of the producer is to minimise the extent of this NEB as much as possible. NEB reaches its nadir within the first (Rizos et al, 2008) or second week of parturition (Butler et al, 1989). Butler et al (1989) also say that recovery or improvement in NEB may be the initiator of resumption of ovarian activity. The difficulty with NEB is more acute in primiparous heifers as they are still striving to achieve their mature body weight whilst in lactation. A factor which complicates the matter is that heifers are also changing their teeth at this time and makes eating uncomfortable. Despite all this, the nutrition must be sufficient for the cow to replace her lost body condition and gain some condition prior to being served by artificial insemination (AI) or natural Service (NS) (Blowey, 1999).

Gonadotrophins Under normal conditions, the dairy cow resumes ovarian activity shortly after parturition. From as early as 7 days post partum, follicle development may commence. Plasma concentrations of oestradiol decrease following expulsion of the calf and the placenta. The inhibition of FSH secretion is halted which leads to increasing plasma concentrations of FSH and follicle development recommences (Thatcher et al, 2006). Proceeding parturition the interval to resumption of ovarian activity is correlated with the duration of and the severity or nadir of the NEB (Butler et al, 2006). Negative energy balance can be seen to affect hypothalamic GnRH secretion, pituitary gonadotrophic secretions or the growth hormone – insulin-like growth factor (IFG) – insulin axis (Diskin et al, 2003). From Fig. 3. it is easy to understand how this balance can be easily disrupted. All the factors are intertwined and are cyclic. Disruption of one has negative effects on the rest of the cycle. The effect of NEB on GnRH of the hypothalamus is an example of this complex relationship. NEB does not directly affect secretion of GnRH from the hypothalamus, rather it is through a lack of positive feedback from oestradiol. GnRH secretion is dependent on the secretion of oestradiol from the pre-ovulatory dominant follicle (DF). The oestradiol from the DF stimulates GnRH secretion from the hypothalamus which acts on the anterior pituitary to secrete LH. This LH is required by the follicle to produce androgens and hence oestradiol. During periods of nutritional restriction in heifers, it has been shown that low con24


Fig. 3. The endocrine changes during the bovine oestrous cycle correlated with ovarian follicular and corpus luteum development from Moore et al (2006). E2 = Oestradiol, IGFBP -4 and -5 = insulin-like growth factor binding proteins 4 and 5, OvF = ovulatory follicle, LH = luteinising hormone, FSH = follicle stimulating hormone, PGF2Îą = prostaglandin F2Îą. centrations of oestradiol and therefore low positive feedback in the follicular phase may have contributed to reduced GnRH pulsatility (Diskin et al, 2003). The role of LH in the reproductive cycle of dairy cows is of paramount importance. It is responsible for the interval between calving and first ovulation, oocyte growth and oocyte maturation. Therefore, disruption of LH pulsatility and amplitude is considered a major side effect of NEB (Leroy et al, 2008). From Diskin et al (2003), it is suggested that LH pulsatility in beef heifers is only altered after a certain amount of body fat loss has occurred. However, due to the dairy cow being under severe metabolic stress from lactation, the amount of body weight loss needed to trigger this reduction in LH may be significantly reduced. In nutritionally restricted ovariectomised ewes it was found that the under nutrition led to lower concentrations of mRNA responsible for LH synthesis which led to the pituitary producing lower levels of LH. Diskin et al (2003) also make reference to a study which suggests that nutritional reduction in LH secretions may also in part be due to factors affecting GnRH secretion and pituitary receptivity to GnRH. There is evidence to suggest that NEB does not affect concentrations of FSH postpartum. Follicle growth commences in cows in response to elevated concentrations of FSH which occurs naturally around 10 days after calving. Research suggests that neither energy balance nor dietary intake has an affect on the initial rise 25


in concentrations of FSH after calving. The effects of NEB are not manifested in the limitation of FSH secretion or follicle development but are exhibited through the processes involving ovulation, namely follicle viability and LH secretion (Diskin et al, 2006; Diskin et al, 2003).

Metabolic Hormones and Metabolites Diskin et al (2003) state that the role of metabolic hormones such as growth hormone (GH), insulin, insulin-like growth factor I (IFG-I) and leptin in the control of ovarian follicle development is very important. These metabolic hormones are also vital intermediaries in the effects of energy balance or dietary intake. Falkenberg et al (2007) also state that changes in the GH – IGF-I axis are associated with severe NEB, e.g. GH resistance of hepatic tissues and down-regulation of liver GH receptors. Conventionally, high producing dairy cows experience a dip in the plasma concentration of the hormone IGF-I immediately after parturition. However, this is corrected by subsequent up-regulation of growth hormone receptors in the liver which stimulate the production and release of IGF-I (Taylor et al, 2004). The hepatic tissue of cows during NEB becomes resistant to GH, liver GH receptors are down-regulated, IGF-I synthesis decreases and plasma levels of IGF-I remain low despite an increase in GH (Taylor et al, 2004; Santos et al, 2007; Rizos et al, 2008). Diskin et al (2003) conclude that GH has more of a facilitatory rather than a direct role in reproduction. This is because of its regulatory effects on hepatic synthesis and secretion of IGF-I (Diskin et al, 2003; Santos et al, 2007). During early postpartum when cows are in NEB concentrations of insulin are seen to decrease in high yielding dairy cows. Research has shown that insulin appears to be a metabolic signal that restarts the GH – IGF-I system. Insulin enhances the follicular response to gonadotrophins which in turn regulates the growth of the follicle. This metabolic hormone may also have direct stimulatory effects on the maturing oocyte, however further research is needed in this area (Leroy et al, 2008). Insulin and IGF-I promote LH-stimulated androgen production from thecal cells of the follicle, which in turn promotes oestradiol secretion from the follicle which influences follicle development and competence (Diskin et al, 2003; Santos et al, 2007). It has also been shown that insulin increases circulating concentrations of IGF-I via an escalated rise of hepatic expression of mRNA for IGF-I (Diskin et al 2006). Insulin-like growth factor – I (IGF-I) is a growth factor involved in many process within the body to stimulate cell division or differentiation. The actions of IGF-I can be seen in the reproductive tract of bovines and is thought to play a role in the establishment and maintenance of pregnancy (Taylor et al, 2004). The site of action of IGF-I is generally assumed to be within the follicle, namely affecting the ability of thecal and/or granulosa cells to produce steroids and also stimulating their proliferation (Diskin et al, 2003; Stewart et al, 1996; Taylor et al, 2004). There is some evidence to suggest that IGF-I concentrations also affect pitui26


Fig. 4. Changes in mean concentrations of IGF-I in plasma and milk of 50 multiparous cows from two weeks before calving to 20 weeks after calving (Taylor et al, 2004). tary and hypothalamic function. Studies in vivo by Stewart et al (1996) have shown that IGF-I increases the number of LH binding sites in thecal cells which may account for the increased LH-stimulated androstenedione and progesterone production by these cells (Diskin et al, 2003). This leads to the subsequent enhancement of oestradiol production by granulosa cells which is a prerequisite for ovulation (Stewart et al, 1996; Diskin et al, 2003). It is also documented that IGF-I is considered a survival factor by preventing ovarian follicular cell apoptosis (Taylor et al, 2004; Stewart et al, 1996). Taylor et al (2004) state that with regards to the survival of the embryo, the GH – IGF-I system has an important role. In addition to this, IGF-I may have a direct role in the regulation of the growth of the embryo. In a study by Taylor et al (2004) it was shown that multiparous cows with low concentrations of IGF-I before and after calving were associated with a failure to conceive: see Fig 4 from Diskin et al (2006). It was also stated by Diskin et al (2003) that low concentrations of IGF-I in beef cows resulted in an extended post-partum interval (PPI). The results from the study conducted by Taylor et al (2004), showed that lower IGF-I concentrations post-partum and periods with a lack of ovarian activity were longer in cows with higher peak milk yields, Table. 1. This negative correlation between high milk yield and low IGF-I concentrations in dairy cows (Spicer et al, 1990) is related to higher concentrations of GH in these high yielding cows. More GH in the system leads to excessive fat mobilisation and an increased level of nutrient availability for milk production. This excessive fat mobilisation is associated with liver GH resistance and a subsequent reduction in the amount of IGF-I produced by the liver (Taylor et al, 2004). Although a lot of research is focused on the role of IGF-I in the reproductive 27


Fig. 5. Relationship between plasma concentrations of IGF-I during first 28 days of lactation & probability of conception rate to first service in dairy cows (Diskin et al, 2006). tract of dairy cows, it is important to realise that it is the Insulin-like Growth Factor binding proteins (IGFBP) that modulate the bioactivity of the hormone itself and contribute to fertility (Leroy et al, 2008; Spicer et al, 2008; Chagas et al, 2006). The IGFBPs transport (Jones et al, 1995) and increase the half life of IGF-I (Diskin et al, 2003; Diskin et al, 2006). These binding proteins are involved in the regulation of availability of IGF-I to target cells in the follicle. The NEB experienced by the cow post partum lowers the concentration of IGFBPs which limits this availability of IGF-I to the follicle cells which in turn limits the cells’ ability to act in association with pituitary gonadotrophs to stimulate cell proliferation and production of steroids. This proliferation of thecal and granulosa cells and their steroidogenesis is vital for ovulation (Diskin et al, 2006). Diskin et al (2003) state that glucose has an effect on the pulsatility of LH through its effect on GnRH. Glucose does not directly affect the pituitary gland but modulates GnRH release from the pituitary via detection sites in the central nervous system (brain). From this research, glucose may be considered a metabolic signal involved in the regulation of GnRH secretion. Low concentrations of blood glucose are thought to inhibit GnRH pulses from the pituitary and thus lead to low pulses of LH. As a result of this low LH pulsatility, the PPI of dairy cows in NEB is prolonged (Rizos et al, 2008). Leroy et al (2008) also hypothesise that low blood concentrations of glucose directly affect oocyte quality. Glucose is a vital metabolite for the developmental capacity of the oocyte. The conversion of glucose to pyruvate and lactate provides substrates for ATP production in the cumulus cells. Within the oocyte itself, glucose is metabolised for DNA and RNA synthesis via the pentose phosphate pathway. This DNA and RNA synthesis is involved in the meiotic progression or maturation of the oocyte. It has been demonstrated that follicular fluid concentrations of glucose can be influenced by nutritional status 28


Peak milk yield (kg/day) 26 to 39

Cows 41

IGF-I concentrations 1 week before calving 39

Minimum plasma IGF-I 32

IGF-I at first service 79

40 to 45

43

38

29

61

46 to 52

43

30

25

62

53 to 66

42

29

23

55

Table 1. Quartiles of peak milk yield, periods to return to cyclicity and plasma IGF-I concentrations (ng/ml) after calving (adapted from Taylor et al, 2004). of the cow. This low concentration of glucose or hypoglycaemia experienced in NEB alters the microenvironment of the pre-ovulatory follicle which is likely to compromise the developmental capacity of the oocyte. Circulating levels of metabolites such as non-esterified fatty acids (NEFAs) and beta-hydroxy butyrate (BHB) are indicative of the extent of the NEB experienced postpartum by the dairy cow. In early lactation the cow undergoes body fat mobilisation to satisfy the energy requirements for maintenance and lactation. The breakdown of these lipids results in the production and increase of NEFAs in the blood (Hoedemaker et al, 2004). This large rise in lipid mobilisation increases the uptake of NEFAs by the liver. When lipid mobilisation becomes excessive, the ability of the liver to metabolise the NEFAs is surpassed and triglycerides accumulate within the hepatic tissue causing fatty liver. This results in suboptimal liver function which has a negative effect on fertility (Hoedemaker et al, 2004; Leroy et al, 2008). Leroy et al (2008) say that previous research has shown that high NEFA levels experienced in NEB were reflected in the follicular fluid of dominant follicles of dairy cows. They also state that NEFAs may have toxic effects on the maturation rate of the oocyte. The relatively low fertilisation rate, cleavage and blastocyst formation rates may be from induction of apoptosis and cumulus cell necrosis. These trials were conducted in vitro; however, where trials were conducted in vivo, the results were not in agreement (Rizos et al, 2008). Accumulation of NEFAs in the liver leads to the formation of ketone bodies such as beta-hydroxy butyrate (BHB) (Beever, 2006). These ketone bodies have toxic effects on the cells of the immune system which leaves the cow susceptible to many infections. With the immune system of the cow at suboptimal levels, the fertility of the cow is indirectly compromised by these BHBs. The increase in circulating concentrations of BHBs is reflected in the follicular fluid. In vitro studies showed that high levels of BHBs were detrimental to oocyte quality, however this was due to a lack of glucose rather than the elevated levels of BHBs (Leroy et al. 2008). This is supported by in vivo studies where there was no relationship found between measured BHBs and commencement of luteal activity or subsequent 29


conception rate to first service (Rizos et al, 2008). Leptin, which is secreted by white adipocytes, is a peptide that is involved in the regulation of body weight and food intake (Boland et al, 2000; Block et al, 2001). High plasma concentrations of leptin have been associated with suppression of appetite, therefore it is considered a modulator of feeding behaviour. Leptin receptors have been found in reproductive organs and the pituitary of humans and rats (Boland et al, 2000; Diskin et al, 2003). Research has shown that genetic expression of mRNA for leptin receptors is different in ewes that were well fed and feed restricted (Boland et al, 2000) and the same has been observed in cattle (Diskin et al, 2003). From a study conducted by Block et al (2001) it was found that plasma concentrations of leptin were positively correlated with plasma concentrations of insulin and glucose and that leptin was negatively correlated with plasma concentrations of GH and NEFAs (Diskin et al, 2003), see Table 2. Diskin et al (2003) state the potential role that leptin has in relating energy balance with fertility, possibly through inhibition of neuropeptide Y (Boland et al, 2000). Neuropeptide Y regulates gonadotrophin release by inhibiting LH release from the pituitary of ewes (Diskin et al, 2003). All these data strongly suggest a link between leptin and reproduction in cattle, however the mechanism has not been clearly defined. Nevertheless, Vogue et al (2004) have conducted a study in vitro showing that leptin has no effect on the levels of granulosa cell IGFBP mRNA. Leroy et al (2008) make reference to studies conducted in vitro that suggest leptin promotes the survival of cumulus cells enveloping the maturing oocyte and enhances the developmental competence of the matured oocyte.

Body Condition Score According to Ball et al (2004) minimising NEB and thus excessive deposition and mobilisation of body fat in the dairy cow is a prerequisite for optimum fertility. A method of monitoring the cow’s body reserves is the technique of body condition scoring. The technique can be described as manual palpation of the quantity of subcutaneous fat cover on various parts of the body and allocating the cow a score on the basis of their covering of fat. Cows are usually scored on the thickness of fat cover over the tail head and lumbar area (see Fig. 6.). Edmonson et al (1989) devised the body condition score method mentioned above. It was based on a scale from 1 to 5, using 0.25 unit increments, with a score of 1 indicating emaciation and a score of 5 indicating obesity. The guidelines proposed by Edmonson et al (1989) allow the assessor to give an accurate BCS to the cow without assessing all areas of the cow. Due to the link between cattle body condition scores, milk yield and reproductive performance (Edmonson et al, 1989) (Table 3.), BCS is used as an indicator of overall nutrition (Borsberry, 2001) and as a component of a herd health plan (Mulligan et al, 2006). From Table 4. cows should be calving down at a BCS of 2.75 – 3.0 (Mulligan et al, 2006; Ball et al, 2004; Roche, 2006) and should not lose more than 0.5 units of BCS between parturition and first service (Roche, 2006). Mulligan et al (2006) state 30


Weeks relative to parturition Variables

-4

-1

+1

+3

+8

NEFA (μm)

107

121

546

293

144

Glucose(mg/dl)

55

55

50

47

55

Leptin (ng/ml)

5.8

5.5

3.0

3.0

2.9

Insulin (ng/ml)

0.8

0.7

0.3

0.5

0.8

GH (ng/ml)

6.7

6.0

8.3

8.5

8.8

IGF-I (ng/ml)

124

77

40

36

40

Metabolite

Hormones

Table 2. Changes in plasma metabolites and hormones during the transition period (adapted from Block et al, 2001). that cows with a BCS of ≥4 in the last three weeks of gestation had a significantly lower feed intake in the period immediately pre-calving than cows with a lower BCS at the same time. These over-conditioned cows are automatically predisposed to fatty liver syndrome, difficult calving, retained placenta, reproductive tract damage, susceptibility to infection of the tract, displaced abomasum and an increased likelihood of developing milk fever (Mulligan et al, 2006; Ball et al, 2004). Chagas et al (2006) say that cows appear to have a target level for body reserves in early lactation, which would explain why fatter cows at calving have a tendency to lose more body fat at calving than thinner cows. Overfeeding in the dry period had deleterious effects on developing oocytes in vitro and embryo quality in vivo. The tissues in these over-conditioned cows are less responsive to insulin which leads to the reduction in uptake of glucose by the cells (Santos et al, 2007). Conversely, cows in very low BCS (<2.5) at calving are predisposed to a longer PPI which may be due to low LH pulsatility and reduced concentrations of oestradiol (no LH surge and ovulation). Cows with a low BCS after calving have dominant follicles (DF) with a decreased diameter, reduced insulin and IGF-I concentrations and low LH pulse frequency (Roche, 2006). Cows in very low BCS after calving have very little, if any, subcutaneous fat. BCS loss at these low levels is indicative of protein loss, not the loss of internal fat reserves (Chagas et al, 2007). According to Chagas et al (2007), cows in ‘low BCS at any time during early lactation are associated with delayed ovarian activity, infrequent LH pulses, poor follicular response to gonadotrophins and reduced functional competence of the follicle’. 31


Fig. 6. The technique of body condition scoring of cattle over the lumbar vertebrae (Ball et al, 2004).

Protein Balance in the Diet Dietary protein of dairy cows has two components, rumen degradable protein (RDP) and rumen undegradable protein (RUP). The RDP component is ingested protein that is degraded to ammonia (and various other non-protein nitrogen substances) by the rumen microbes whereas RUP is amino acids that bypass the rumen and are digested in the small intestine (Mulligan et al, 2007). It has been widely speculated that protein metabolism can impair reproductive efficiency (Tamminga, 2006) and much research has been conducted on this topic, however the results are not all in agreement. Present day cow diets typically have high levels of crude protein which leads to an excess of RDP (Ball et al, 2004). These high levels of RDP result in high levels of ammonia in the rumen with subsequent elevated systemic concentrations of ammonia and urea. The abnormal levels of these metabolites in the blood have been associated with reductions in dairy fertility (Kenny et al, 2002a; Verkerk, 2000; Chagas et al, 2007). Santos et al (2007) say that studies in vitro have shown that disruption of embryonic development has been associated with excessive concentrations of ammonia and urea. They state that high levels of urea nitrogen reduced conception rates of heifers and impaired the quality of embryos in lactating dairy cows. Santos et al (2007) suggest that the reason for these results was that the elevated concentrations of ammonia and urea alter the follicular fluid (Leroy et al, 2008) and oviductal environment. High concentrations of ammonia and urea were also seen to decrease uterine pH (Guo et al, 2004) which was associated with a reduction in fertility and embryo development (Santos et al, 2007). 32


Body Condition Score

Pregnancy Rate (%)

<1.5

51

1.6 – 2.0

59

2.1

57

2.6 – 3.0

56

3.1 – 3.5

64

>3.6

58

BCS at drying off

2.75

BCS at calving

3.0

BCS at breeding BCS at 150 days in milk (DIM) BCS at 200 DIM

>2.5 2.75

Table 3. The effect of body condition at service on fertility of dairy cows (Borsberry, 2001)

Table 4. Target BCS for dairy cattle (Holstein/Friesian) at different stages of the lactation cycle (Mulligan et al, 2006).

2.75

A trial conducted by Kenny et al (2002a) using nulliparous beef heifers found that high systemic concentrations of urea and ammonia did not have a deleterious effect on embryo survival rate (see Fig. 7). Elevated levels of these metabolites also did not affect systemic concentrations of glucose, insulin or progesterone in these heifers. Kenny et al (2001) also conducted a trial involving nulliparous heifers at pasture with high and low crude protein (CP) concentrations. They found from this trial that high CP levels lead to an increase in systemic ammonia and urea, however again they found no deleterious effect of these systemic concentrations on embryo survival rate or development. Despite these findings, Kenny et al (2002a) state that ‘the outcome may be different in high yielding dairy cows where high dietary protein and lactation-induced severe negative energy balance could lead to deleterious interactive effects on embryo viability.’ Feeding excess protein to dairy cows may exacerbate the negative energy balance, primarily due to the energy demanding process of converting this excess protein into the form of urea which can be excreted by the cow. Any enhancement of the NEB is going to further effect reproduction and therefore fertility (Roche et al, 2006).

The Role of Fatty Acids in Fertility The importance of lipids as components of the cell membrane cannot be underestimated. Lipids also serve as a dietary energy source (Santos et al, 2007; Childs et al, 2008). Gardner et al (2001) state that in previous studies supplemental fat was added to diets to decrease the extent of the NEB by increasing the energy density 33


Fig. 7. Embryo survival rate to day 40 within varying quartile concentrations of plasma urea on day 7 after AI (Kenny et al, 2002a). of the diet, however, often this just increased the milk yield of the cow, or reduced her DMI without alleviating the NEB. Despite these previous findings, there is evidence to show that fat supplementation improves fertility but the effects varied with different fatty acid (FA) sources (Santos et al, 2007). Roche (2006) says that feeding rumen by-pass fats can lead to the alteration the blood FA profile of cows and increase linoleic acid. It has been suggested that certain FAs may stimulate ovarian function, act as precursors of prostaglandins and increase cholesterol availability (Childs et al, 2008). The polyunsaturated acids, of the n-3 family, eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) have been used in a trial by Mattos et al (2004) looking at the effect of fish oil on uterine secretions of PGF2α of periparturient Holstein cows. Their study found that dietary supplementation with EPA and DHA reduced uterine synthesis of PGF2α. This may be from EPA and DHA displacing arachidonic acid, the precursor for PGF2α, and, or by competing with molecules for enzymes required for PGF2α synthesis. The inhibition of PGF2α around the time of maternal recognition of pregnancy (MRP), luteolysis (regression of the corpus luteum) may be halted, increasing the survival rate of small or underdeveloped embryos, thus increasing reproductive efficiency of dairy cows (Mattos et al, 2004; Childs et al, 2008). However, a trial conducted by Childs et al (2008) found that the aforementioned mechanism was not clear and needs further research. They stated that the role of FAs in improving fertility was probably through increased systemic cholesterol and/or increased corpus luteum size leading to a rise in progesterone. It is worth noting nevertheless that this trial used crossbred beef heifers that had 34


not the huge physiological burden of lactation on their system. The increase of plasma concentrations of arachadonic acid (Childs et al, 2008) results in the increase of prostaglandin F (PGF) synthesis. During the early post partum period PGF is important for uterine involution, upregulation of the immune system and neutrophil function. Increasing PGF synthesis would assist in normal involution of the uterus and enhancing immune function would also help prevent uterine infections during the transition period. These effects would significantly reduce the PPI and improve the reproductive efficiency of dairy cows (Roche, 2006).

Trace Elements It is an accepted fact that minerals or trace elements, such as zinc (Zn), manganese (Mn), copper (Cu) and cobalt (Co), are a vital component in the ruminant diet. These trace elements are important in immune function, lactation and fertility, however the role they play in fertility is highly debated (Siciliano-Jones et al, 2008). Mulligan et al (2007) state that trace element deficiencies have been associated with retained foetal membranes, abortion and weak calf syndrome. Hostetler et al (2003) also refer to studies where inadequate transfer of trace elements from the dam to the calf has resulted in impaired foetal growth and abnormalities to the central nervous system, skeleton and metabolism. A study conducted by Black et al (2004) found that cows treated with glass boluses of Se and Cu had higher conception rates and service to conception probabilities than other trace element strategies. Siciliano-Jones et al (2008) say that Mn is required for cholesterol which in turn is the precursor for oestrogen, progesterone and testosterone and Zn deficiencies have been linked to abnormal oestrus, abortion, altered myometrial contractibility with prolonged labour. However, the trial conducted by Siciliano et al (2004) concluded that supplementation of the diet with amino acid complexes of Zn, Mn and Cu and Co glucoheptonate did not improve the fertility of the cows in the trial. Conclusion Roche (2006) states that poor reproductive efficiency on farms results in a reduction in herd profitability, this being a universal finding. This is through a number of factors, namely prolonged calving interval leading to less milk produced per cow and fewer calves born, increased culling for poor fertility and increased replacement costs, increased labour, veterinary and semen costs. From the reviewed literature, it is apparent that the most taxing time on the dairy cow is the transition period. The transition period starts three weeks prior to calving and lasts up to three or more weeks after calving (Grummer, 1995). Therefore, it can be assumed that preparation of the cow for this demanding time begins long before the end of gestation. From Roche (2006), table 5. gives a summary of the targets needed to be obtained on farm for a successful and profitable reproductive cycle. This table shows that many of the risk factors affecting the targets to good reproductive efficiency can be controlled through nutrition and management at 35


farm level. During the last three weeks of pregnancy, it is possible for cows to fall into NEB prior to calving. This can occur in cows with excessive body condition. As it is recommended that a cow’s intake should be maximised during the transition period, ensuring that the cow is in optimum body condition helps to achieve maximum voluntary feed intake during this time (Mulligan et al, 2006). Where cows are on a grass silage diet, only the silage with the highest intake potential should be used (Mulligan et al, 2006). To achieve adequate intake of grass silage, it should have a high DM and digestible organic matter content and low ammonia nitrogen as a percentage of the total nitrogen content (McDonald et al, 2002). Unwittingly, farmers may decrease the DM intake of freshly calved cows by abruptly introducing them to pasture (Mulligan et al, 2007) without adequately preparing the rumen for this diet. If cows are out at pasture, sward height must be kept above 7cm, as heights below this have been shown to compromise DM intake of cows (Mulligan et al, 2006). As an alternative to this, cows can be fed a total mixed ration (TMR). According to Mulligan et al (2006) TMRs have been associated with higher DM intakes and lower NEB nadirs when compared to cows fed at grass. By utilising TMRs, palatable feed ingredients such as molasses can be used, which have been demonstrated to improve NEB in transition cows. Diskin et al (2006) have also stated that intakes of DM for cows have been higher on maize-based TMRs than cows grazing pasture. Santos et al (2007) point out that TMRs with less than 28% neutral detergent fibre (NDF) increase the risk for digestive upsets and acidosis. NDF has been shown to be more effective at maintaining proper rumen function than other non-forage fibre sources, however, diets above 35% NDF would restrict feed intake. Ensuring that 35 – 41% of the total DM of the ration is composed of non-fibrous carbohydrates will optimise overall energy intake, overall feed intake and production of microbial protein within the rumen (Santos et al, 2007). Although BCS is a practical method of monitoring body fat reserves of the cow and controlling the NEB to a certain extent, another possible technique to reveal the extent of the NEB is to monitor blood metabolites such as BHBs and NEFAs (Mulligan et al, 2006). The same authors found that the ration of milk fat : milk protein is much more useful. A ratio of <1.5:1 or 1.3:1 is an indicator of problem cows in early lactation. The development of an on-farm milk testing kit would enhance the farmer’s ability to pick out the cows that require attention and adjust their diet accordingly. However, the development of this technology would be difficult as there is still some contradicting evidence in the research (Mulligan et al, 2006). Another suggested method of diminishing the NEB experienced by dairy cows is the addition of propylene glycol to the diet (Nielsen et al, 2004). Nielsen et al (2004) conclude that propylene glycol does have beneficial effects on carbohydrate and fat metabolism of cows in early lactation, however the beneficial effects on fertility are unclear. In a study conducted by Rizos et al (2008), they found that supplementation with propylene glycol had no effect on reproductive efficiency of lactating dairy cows. 36


Reproductive process Normal uterine involution Resumption of ovulation High oestrus detection High conception rate to AI

Target to be achieved

Risk factors affecting targets

Dystocia Retained foetal membranes Uterine infection Loss of >0.5 BCS unit 90% by day 42 Low feed intake Uterine health Infrequent checks 85% per cycle Sub-oestrus High yields Excess BCS loss 50% per breeding Prior uterine problems Low P4 days 4-7 of pregnancy Day 50 pp

Table 5. Postpartum (pp) reproductive targets to be met in order to obtain high reproductive efficiency and the associated key risk factors affecting these ‘Undoubtedly, the reproductive performance of our dairy cows is limited by their nutritional status’ (Verkerk, 2000). The nutritional management of dairy cows has a huge influence on the reproductive efficiencies of the dairy herd. Although many of the reproductive problems manifest themselves in lactation and mating time, often the root of these problems lies in the previous dry period or early lactation. Mulligan et al (2007) also say that a healthy transition period, achieved through adequate nutrition has both direct and indirect affects on the reproductive performance of commercial dairy herds.

37


ANTHROPOLOGY PANEL

•

judging Panel Dr. Abdullahi El-Tom (NUI Maynooth) - Chair Dr. Steve Coleman (NUI Maynooth) Dr. Patty Gray (NUI Maynooth)

Judge’s comments This is an excellent essay that displays considerable scholarship, dediction and imagination. It shows clearly that candidate is capable of generating original material and excercising sophisticated panache of anlysis. Application of analytical tools proposed by Goffman, Kiesling, Duranti and others is indeed very impressive. We do not hesitate to nominate this essay for the award.

38


A n t h rop ol o g y

Transcription & analysis Benjamin Larkin I. TRANSCRIPT Person A and Person B, brothers, both students, discuss their mutual interest in American football (NFL) and its recent impact overseas in Europe. Setting: Living room, sitting on couch. (1)1A: Okay!2 Ehh, first of all..ehh…3 could you tell me, eh, when you first got into NFL, eh American football, and, eh, why? B: Yes, eh… it was in September of 2006…emm..4 Why? I’m not really too sure; it was ----à5 just on the TV one time and, eh, sat down to watch it, and I just so happened to be supporting the team that I saw first on TV right now, so it was kind of strange the way it --à worked out. 1     2     3  4  5

Numbering of interactional exchanges for convenience. Interruptions don’t count as the start of an exchange. Syntactical accent: “Okay!” characterized by and partnered with a brief rise in intonation often to indicate floor presence. Medium pause. Short pause. Rising intonation (gradual).

39


A:

[Carolina Panthers6

B: [And why???...because-- cos, it’s great, and I could go into that for a long time, but, y’know, y’know am just gonna say that. (((((((((((((((((((((((((7 (2)A: [wha-what, like, whad’you think makes.. the American football, as a sport, so superior to… say the likes of Rugby, or--8 or even soccer not in terms of its…emm..*ponders*9… not in terms of its fluidity, or its rules or anything, but but just in terms of its…excitement factor, sort_10 of? B:Well…the big word in the NFL is parity, and I think you see that every weekend… in ---à other words, every team has a chance. The system is set up in such a way that, y’know, if you’re the worst team in the league then the next year you have the top[]11 pick in the draft, i.e., you have the best chance of getting the best…eh… coming eh-- the upcoming college prospects, so, eh, that’s a big factor—But also just the action in the game and its, eh, à Although—plays are kinda broken up and some people say “Oh well it’s too slow” and all that, it’s eh, I just think it’s really fair and really, kind of, y’know, the best team usually does win in the end, and everything like that, so…yeah. It’s—it’s a game ----------------à with a lot more depth as well, it’s—it’s got more systems and there’s a lot more, eh, complications to the game and in my opinion that makes it better, so.._ ((( (4)A: [Yeah I mean, like, compared to soccer…it’s certainly emm….the way they have referees on the lines and stuff, like, like five. B: [Yeeah, yeah, more officials obviously helps. *laughs*. I agree. (5)A: [Four or five referees………Now we recently…emm…in the last 6  “Aaa” sustained speed increase. 7  Volume decrease (gradual). 8  False start. 9  Gestural brackets *aaa*, indicating a particular action or gesture not audibly describable. 10 ����������������������������������������������������������������������������������������� Syntactical sharpness: an abrupt cessation of syntax by the manual closing of the larynx     (“Oh_” when in a state of surprise; pre-empting your next statement, for example). 11 � The ������������������������������������������������������������������������������������� meeting of two syntactically sharpened words, often to illustrate their degree of    connectedness or emphasis.

40


couple days in fact, and ))) à_ ehh… last year, the UK, br-brought over the, ye-, eh, NFL was brought over to the UK… I don’t know…. what do you ------------à Think_… d-d’you think America is just trying to—sort of—still-- still promote its sort of superiority and say that “We’re America and we—we’re giving you this sport” or do you ^^12 ^^ ^ à think it’s genuine (?) because--- I saw, when I looked up at the NFL logo in the stadium, I saw the NFL logo, the American flag on the left and the British flag on the right, and it just seems very………*purses* emm…it doesn’t seem very genuine because it just seems that the—they’re just sort of—it’s sort of a hand-me-down or something. They’re just—they’re not intending to merge in on any real level. B: Well *more assertively*, it depends what you mean by merge, but the-- obviously the -à popularity of the game in the UK is such that they felt that coming over to play a game would be a great, eh, thing to do for them. Obviously it’s all about money in the end. A:

[Yeah.

B: The fans clearly enjoy the game and last weekend’s game was… emm.. reeeally enjoyable so the—ehh—some people are saying they’re gona come back next year for two games, so… y’know… it’s---it’s just---the game is just getting bigger in the UK and Ireland, so expect a lot more games in the future, that’s what I’d say. ((((((((((((((((((((( <<<<<<<<ß----(6)A: Yeah, uhh, I mean, but what I say as—as--, what I see, as you said, is---is that it’s very commercialized sort of thing whereas in soccer—soccer’s been a woorld sport for ------à hundreds of years---Emm well maybe not hundreds of years maybe a hundred years, maybe a hundred years. But… what I see… is America, being such a commodity-driven, sort of, Federal state that it just---its sports and all of its 12 ���������������������� Sustained high pitch.

41


excitement, and even the waaay it---it emm… promotes its sports, and the way it does its sports in terms of four quarters. -à à )))))) That’s all to do with the advertising and the money they bring in. So, I-I-I’m just thinking that the “hand-me-down” if I can use that phrase again by—to the UK is really just another means of getting money and they’re saying that they’re gonna bring it back for two games next year---that’s just[]more[]money for them. B: [Yeah but keep in mind in soccer there’s no limit on the money you can spend whereas in the NFL there’s a salary cap so no matter how much money you get… from that, I mean it’s only gonna help the guys in suits in New York in the NFL offices there, but, y’know, that’sss *smirks*….nothing to do with the game---th-the bottom line is: as long as the salary cap’s in place, teams won’t be able to… y’know… kind of, overspend or anything like they do in soccer where a few teams dominate every year. <<<<<<<ß------v13 (7)A: Yeah, the-yeah, tha-that’s—it’s a lot fai-- B: [Going back to that parity word again.] (((((((((((((((((((( (8)A: Mmm. It’s a lot fairer in terms of, eh, the salary cap. That was one of the things that I originally, emm, thought……was the most fair aspect of American football---And of (B:

[Yep )

course the refereeing which, I mentioned already………… which is extremely… good_. ((((( (( ((((((((( B: ([Yes ) [Totally agree. (9)A: Emm… so_ Do you ever intend to eh…. pursue a maybe eh commentary car-er ^^ career commentary or something along those lines for s-s-some sports? B: Emm… I don’t know how I’d go about doing that, but I don’t know really if 13 ������������������������������������������������������������������������������������������������� Instantaneous drop in pitch, often after a gradual drop, where the last syllable indicates that the speaker is finished.

42


I’d be… ^^^^^^^^^^^ good at[]that kind of thing…I might be good at, maybe articles in newspapers and stuff ß----------------v like that or whatever A: [Yeah. ^^^^^ B: But… ehh… I don’ know—it’s—you’d have to move ‘n’ everything and it would be a very awkward thing to do, but, you never know! ^^^^^^^^^^ (10)A: Yeah well sure-B: A: B:

[laughs] like— [See what the future holds] *optimistically*

A: [Yeah weh-well sure in-in Maynooth emm… I’m starting to contribute to the Advocate our broadsheet newspaper soo… that’s a good i—that’s a good thing for practice— B-but I think you are good at -----------------à ))) )))))) kind of the…… the particular.. way of.. talking in-in your podcasts and stuff. That’s th-B: a podcast reference, Ben??? *laughs* -----------------------------à

[Oh_ Is that

(11)A: Yes, it is B: [******14 I have a podcast… Aaanyway, continue! ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ 15 (12)A: Ehh…*exhales*…okay *smirking*…and John is good as well, but I think John ^^^^ 14 �������������������������������������������������������������������������� Inaudible utterance, its degree of inaudibility characterized by overlap. 15 ���������������������� Sustained low volume.

43


))))) is—is More emm… he—he puts it on more—y-y-you sound more natural because you just… y-you don’t sound any different from the way you actually talk when you’re[] doing your podcast. B:

[Yes… ‘‘‘‘‘

(13)A: You d-don’t really sound any different at all. You sound the exact[] same, whereas John—sort of-- … becomes a bit more boisterous and “louds it up” as it were. B:

[Mmm…he gets a bit hyper at times

(14)A: [Yeah—he_—yeah he gets a bit hyper an’_speaks really fast… but emm… yeah. Where do you see that podcast going, actually? ^^^^^^^^^^ B: Welll…. there’s always gon[]na be limitations because… y’know the fact that we don’t have full-time jobs or anything means that we can’t..sign up to these things online where you have your own kind of_domain name and everything, y-y-your kind of__you have this, a kind of a money-based thing where you have your own website—obviously that would mean it would get more—downloads and everything but… we just don’t have the resources at the moment… But for the moment it’s just kind of a fun thing to do -----------------------------------------------------àvvvvv And, you know, that’s what we’ll do for the—y’know--the upcoming..ehh… months and so on—w-we’ll see if we can[] ad-advance it all—I-I think we can, but[]that’d be great. ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ A: Cos’ your downloads have been increasing B: A: tially B: 44

[Yeah

[exponentially *laughs **in mockery* exponen[well not really exponentially, but—yeah they’re very solid now,


there’s about eighty a week or something which is..pretty good. A:

[Yeah that’s pretty good

B: Yes…humble beginnings..to…y’know…hopefully a successful future. ((((( ((((((((((((((((((((((((((((((( (15)A: Ye—and of course meeting the-the “Rants’n’Raves” guys has given you a certain (B:

[Yeeeeaah_)

degree of… inspiration— B: [It suuuure diiid. I was very happy with that. Eh-obviously I ------------à_ Didn’t..emm..kind of [] mention it in the podcast or anything, but[]..yeah… they have a very good podcast themselves which has been going for about four years now, so..yeah… I (((((((((((((( could probably e-mail them of something…and ask them how they do it_. ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘ ‘’ ‘ ‘ ‘ ‘ ‘ ‘’’’’’’’’ ‘ ‘’ ‘’ ‘ ‘ ‘ ‘ ((((((((((((((((((((((((((((^^ (16)A: [Excellent. *laughs* And Nick Halling..ehh… seeing him at breakfast, y’know—not the biggest deal (B:

[Yeeeaah_)

But (B: [Yeah! That was pretty good too—an-an seeming as how he was on TV last night ß------------à_ And I couldn’t really believe the fact that “Oohh I just met that guy” yeah—it was kinda ^^^^^ weird… yeah… out of all the hotels he stayed in our one *smirks*… just how it turns out sometimes. (17)A:

[Well it was the closest hotel to the stadium 45


B: Mmm… yeah it was pretty close, yeah, but, I mean—there’s a lot of hotels around so ---à (A: [Mm For him to stay in that particular one, was… and it’s raining…very very badly now_. ))))))))))))))))))))))) vv (((((((((( (18)A: Yeaaah -----àv B: [It’s always like this… ‘‘‘^ ^’ ‘ ‘ A: Not good ball weather… )))))))))) (((( B: Noo….I’m afraid not ß--v ‘‘‘‘‘‘‘ A: [Not good ball season, full-stop_*laughs*…… B: Yeah.. ‘’’’’’’’16 (19)A: It was like this last year as well ^^^^ ^^^ ^^^ B: [Yeeah… V (20)A: We jus-we just had a downer cos’ we couldn’t play ball. B: Yeah..well you know_—it’s that time of the year ^^^^ ----à ‘‘‘‘‘‘‘‘‘‘‘‘‘‘‘ (21)A: Right, well, I think we’ll leave it there…and emm…y’know )))))))))))) ----à B: *sarcastically* Oh, no! Oh, okay. vvvv (22)A: [Thank you very much B: [Well I have to eat something now, anyway, so… A: [*Laughs awkwardly* A-alright. Okay! Ehh B: welcome 16 �������������������������������������� Sustained low volume (barely audible)

46

[You’re


(23)A: Yes, goodbye. B:

*satirically* [Byeee ^^^^^^ ------à )))))

II. CONVERSATION ANALYSIS Rather than forge an extensive, sequential, episodic list of the indexically rich “happenings” of this conversation, I will rather treat the form of talk in question here (the interview) as primary referent, and allude to several key instances when this form is disrupted. Analysis in this way ensures that the context of the interaction itself qua interview, as a social situation, a sum of cues, social capacities and stances and so on, is the point of departure and not, rather, the first instance at which, in a general sense, something “happens.” Taking a panoptic view at the outset, the conversation maintains coherence and intersubjectivity at most points (See Duranti 1997: 255). The rules for turntaking which apply to the interview are maintained, the register is generally formal on my part, but with the voluminous presence of discourse markers, and the subject matter sequential. Prosodically speaking, however, there is a gradual fluctuation, which is epitomized at the very end of the interaction. There are also periodic lapses in the interview response parameters by the interviewer himself (i.e. question-answer format), examples of which will be given below. As a result of such things, the footing fluctuates constantly from interview to informal conversation. Arguably, the interview itself is being continually reformulated into something more proper to the participants’ own coordinative abilities towards one another (Gumperz 1983: 141). Curiously, the very first exchange “(1)” between the two participants is worth brief comment. First of all, there are no ritual brackets used between the participants (Goffman 1983: 130). I immediately attempt to take the floor by the use of “Okay!”, a colloquially recognized expression. The pragmatic effectiveness of this single-worded utterance thus establishes my stance in relation to the interviewee (Briggs 1984: 21). An uneasiness of demeanour can be observed in the interviewee due to the above omission. Prosody generally hovers below habitually confident levels, and there is an overuse of discourse markers (Kiesling) such as “emm…” and “y’know.” Still, the social capacity of the interviewed subject is recognized instantaneously, evidenced by the first few responses: formalized, to-the-point replies. There are certain points in the early stages, however, when the interviewer attempts to elicit a reply, themselves considerable as adjacency pairs and indexical signs based on norms for response (Duranti 1997: 250): 47


B: “and in my opinion that makes it better, so… ß--A: [Yeah I mean, like, compared to soccer[emphasis added]…it’s certainly emm… the way they have referees on the line and stuff, like, like five. In this particular instance, judging from my own overuse of discourse markers and stuttering, I am actively picking up on the fading intonation, and switching register and hence topic, with the overall intention of maintaining coherence. This can be exemplified by Duranti’s “self-selection” in turn-taking, where the next speaker selects himself (Duranti 1997: 249). In this case, the subject matter of soccer is used as a common ground by myself, through which to re-engage the other participant as interview subject. In exchange (9), for instance, an elongated transition-relevant point is created by continual pauses, indicating my incomplete knowledge (and resorting to conjectural statements) with regard to the subject matter (Duranti 1997: 249). An unwelcome opportunity is provided for the interview subject to take control. Prosodic dips and syntactical sharpness pave the way for the discussion’s conclusion, and the first major shift in footing. Stepping back from the subject of soccer (having been utilized for reparative purposes), a reversion to the question-answer style arrives. A: “Emm…so_! Do you ever intend to eh…pursue a maybe eh commentary carer-career commentary or something along those lines for s-s-some sports? The difference between the first ten or so interactions and this one is that my prepared subject matter has been exhausted, and hence I am less confident. My tone becomes heightened to signify formality of register, and I stumble often with discourse markers. Broadly speaking, the footing changes overtly from informal conversation, with the back-and-forth style it permits, to the preferred interview format (Goffman 1983: 128). The second “part” of the conversation thus begins when a re-acceptance of this form is established, only for it to be subverted once more. Between interactions (9) and (11), the subject responds as interview subject once again, until perhaps the most indexically rich exchange in the conversation: A: [Yeah weh-well sure in-in Maynooth emm… I’m starting to contribute to the Advocate our broadsheet newspaper soo… that’s a good i—that’s a good thing for practice—B-but I think you are good at -----------------à ))) )))))) kind of the……the particular.. way of.. talking in-in your podcasts and stuff. That’s th-B: [Oh_ Is that a podcast reference, Ben??? *laughs* -----------------------------à 48


(11)A: Yes, it is B: [****** I have a podcast… Aaanyway, continue! ‘‘‘‘‘‘‘‘‘‘‘ By my mentioning of something analogous to his commentary aspirations, I’m steering away from the preferred topic. At the same time, however, a hesitance can be envisioned from the transcript on my part, as indicated by discourse markers, the topic arena I am now necessitated to shift towards is one in which personal success and confidence are embodied for the person being questioned. Upon mentioning the podcast, my brother latches onto the opportunity to make his pride known and with accompanying rises in prosody and syntactical accents, interrupts the prior flow of the conversation to adopt a social voice of inquisition more properly connected with sports commentary. He was, in a sense, trying to ratify outside participants to respond (Goffman 1983). Moreover, as a first-pair part of an adjacency pair and as an utterance of pointed indirectness, my impetus is to respond, but in a subdued fashion (11), after which the conversation immediately reverts back to its previous footing (Duranti 1997: 250; 301). As the supposed primary influence on the conversation, I refuse to reveal any scintilla of an index which might prompt a further focusing of the topical spotlight away from London. The final part of the conversation is curious because the original subject matter of London is reintroduced entirely as a marker of register. The context of the shared experience was of course recreational, and by utilizing this imagined contextual space, the interviewer is demoting himself to a position on this interactional floor beneath that of the interviewee, as the primary bearer of knowledge: (16)A: [Excellent. *laughs* And Nick Halling..ehh… seeing him at breakfast, y’know—not the biggest deal (B: [Yeeeaah_) But (B: [Yeah! That was pretty good too—an-an seeming as how he was on TV last night ß------------à_ And I couldn’t really believe the fact that “Oohh I just met that guy” yeah—it was kinda ^^^^^ weird… yeah… out of all the hotels he stayed in our one *smirks*… just how it turns out sometimes. In this instance, I was essentially attempting to create a footing based on familial solidarity, emblematized by my envious attitude towards his encounter with a celebrity. Put more simply, I was prompting a recreation and subsequent sharing of the experience. 49


The most obvious shift in footing which appears is the sudden switch in subject from the nostalgic (experiences in London) to the present context (the rain outside). The allusion to the rain can be taken as an indexical sign, especially when combined with the dipping in volume and clarity of phonetics, for my introduction of pre-closings and thus for my conclusion of the conversation. This eventually does arrive, after a blatant “winding down” in speed, pitch, and volume, between the two participants, with the collective intention of breaking off the conversation. This wasn’t done smoothly, though. The phatic function of language permeates lightly, in an attempt to drag out the conversation, but ultimately settles with the use of pre-closings: (21)A: Right, well, I think we’ll leave it there…and emm…y’know )))))))))))) ----à B: *sarcastically* Oh, no! Oh, okay. vvvv (22)A: [Thank you very much B: [Well I have to eat something now, anyway, so… A: [*Laughs awkwardly* A-alright. Okay! Ehh B: [You’re welcome (23)A: Yes, goodbye. B: ------à )))))

*satirically* [Byeee ^^^^^^

The satirical closing used by the interviewee is both a first and second order index (Kiesling 2004); a first in the sense that it completes the adjacency pair with a rising, satiricised intonation and a contrastingly loud volume to my first-pair part; a second as it establishes in a single syllable the broader, brotherly social relationship between the participants, and how strongly it is built in dialectic relation to that which the interview attempted to impose. Macro-analysis of the interaction thus reveals that the changes in footing, openly displayed and indexed by our attempts to maintain a solidarity through coherence, can only be done by keeping “the interview” as a perennial and loose referent. By our attempts to import discourse markers and vernacular variants (Kiesling 2004: 297), and indeed the norms for one-to-one talk shared individually between us (centred in our familial tie), the structured nature of the interview in fact deconstructs itself. Furthermore, mounted on top of the habitual context of discussion of the subject matter at hand, a formalized system of response (whether 50


prosodic, syntactical or morphological) as enforced periodically by myself, inevitably results in an interactional faux pas, as the very constituents of the frame for interaction are detrimental to that on which this frame relies to maintain its coherence, the topic of American football.

51


ARCHAEOLOGY PANEL

Judging Panel Dr. Colin Rynne (University College Cork) – Chair Dr. Carleton Jones (NUI Galway) Dr. Tomas Ó Carragáin (University College Cork) judges’ comments For the adjudication committee, there was only one clear winner of the best archaeological essay/dissertation award, Russell Ó Riagáin’s The rectilinear houses ofthe Irish early neolithic: the introduction ofnew identities, ideologies and economies. The excellence of this entry far exceeded that of the others, although some of the latter, in our view, were of considerable merit. The committee also notes that this essay was completed as part of course work for an undergraduate degree at University College Galway, and we request that this association is properly credited in the final award This dissertation is an outstanding overview of the key issues relating to the origins and development of this early neolithic house type, in terms not only of their form, function and fabric, but also of the meanings they embodied for their builders. The author argues, convincingly, that they were strongly linked to the introduction of agriculture into Ireland, and to the new ideologies, social forms and material culture associated with this development. Further linkages to the appearance of wheat and to their apparent physical relationship to both court and portal tombs are also adeptly explored. Indeed, throughout this essay there is such an analytical depth and sureness of touch with the material that the non­specialist can easily follow even the more complex arguments.

52


a rc h a e ol o g y

The rectilinear houses of the Irish Early Neolithic: The introduction of new identities, ideologies & economies Russell Ă“ RĂ­agĂĄin

T

I. ABSTRACT he appearance of the rectilinear house form in the Irish Neolithic landscape was both unprecedented and short lived. Their appearance coincided with the definite appearance of arable agriculture on the island. Their disappearance also coincided with the disappearance of this form of agriculture. The island wide similarity in their form and probable functions is remarkable. They served as both functional and symbolic focal points in the landscape for their inhabitants, and they had a major influence on burial practice. Evidence for both deliberate deposition and deliberate destruction has been found at a number of sites, indicating their ritual significance to contemporaries. They also had an important role in the socialisation of successive generations of agriculturalists. II. INTRODUCTION Much ink has been spilled assigning various symbolic roles to larger scale Neolithic monuments. Surely if contemporaries manipulated, and in turn interpreted, a vast repertoire of symbols in connection with various forms of burial, they must also have assigned symbolic value to the rest of their environment. The mental separa53


tion of domestic and ritual has only really been made in these post-Enlightenment times. Even today, as Bradley points out, rituals and symbols permeate everyday life (2005, 3). This study will examine the symbolism and evidence for ritual inherent in Irish rectilinear houses. It is possible from a close consideration of the evidence to see common cognitive processes at work, while at the same time using the differences in cognition between sites to gain a point of entry into the Neolithic mind. The form, dating, function and origin will be examined in this light, before discussing their symbolism, how contemporaries might have perceived them. Following this possible ritual activity, such as deposition, deliberate destruction and the link to burial rites will be discussed. The evidence from the houses will be used to help illuminate the circumstances surrounding the advent of agriculture in Ireland. The relation of the houses to their occupants and to other monuments will also be examined. Their disappearance from the record will also be examined, and the theory put forward that it was due to a change in subsistence patterns brought about by ecological factors, which had a subsequent effect on the ideology of contemporaries. The structures at Lough Gur will not be included in the classification under discussion, as they are of later date and a differing form (Grogan 2004, 106-7; Smyth 2006, 233-4).

Iii. LITERATURE REVIEW The greater majority of writing on the subject has been in the form of the published reports of a number of the sites. These take on a descriptive format, and are usually limited by their being chapters in edited books such as Armit et al (2003), Thomas & Darvill (1996), or in the journals. They are usually low on interpretation, with only 10-20% of the text, if at all, being devoted to this. Grogan’s three articles (1996, 2002, 2004) all outline the various Neolithic house forms in depth. While it is possible to dismiss them as “routinely cultural historical” (Gibson 1998, 359), they are nonetheless important syntheses of the evidence. Cooney & Grogan (1994) offer more interpretation on the houses, but only briefly. Cooney (2000) offers far more interpretation. In the first three chapters he provides a very useful analysis of the Neolithic landscape in Ireland and the role of the house within it, making use of a variety of interpretive approaches. Thomas (1996) and Cross (2001, 2003) provide the dissenting voices. They both question their designation as houses at all, offering varying alternatives. McSparron’s (2003a) article offers a revision of the dating methods and new dates for a number of sites, and as such is one of the most important articles published regarding the Irish early Neolithic. Smyth (2006) focuses on the rectilinear form, and builds upon the interpretive approach of Cooney, while providing an up to date synthesis of both the house details and revised dating. The article also focuses on the intentional use of fire, and on other symbolical aspects of the houses. Bradley (2007) discusses both the Irish and British evidence, and focuses on the origins and obsolescence of the houses, their symbolic destruction, their role in the community and their connection to tomb forms. 54


iV. FORM The majority of these rectilinear structures employed post and planks in their construction, with some use of wattle walling also, and use of slot trench foundations for the planking (Grogan 2004, 106; Smyth 2006, 237-8). Most of the structures had four external walls, but some houses possibly had apsidal ends, such as at Ballygalley 1, Ballyharry 1.2b and Knowth 2 (Grogan 2004, 107) or Tankardstown 2 and Ballyglass (Gowen & Halpin 1992, 25). The houses have been found both in isolation and in small groups of two or three (Grogan 2004, 109; Smyth 2006, 234). They have often been found in association with other evidence of activity, such as lithics scatters, pits or more ephemeral structures (Grogan 1996, 51). There seems to have been a marked preference for house location on sheltered south to west facing slopes, frequently overlooking bodies of water (Grogan 1996, 57). Cloghers overlooks the valley of the River Lee (Dunne & Kiely 1999, 14), and Drummenny overlooks the river valley of the same name (Dunne 2003, 165). Internal divisions could have been either wattle screens (Hughes 2004, 29), or more substantial walls that were continuations of the foundation trench, such as at Corbally 1 and 2 (Purcell 1999, 15). There might be a good deal to learn from the nature of these internal divisions, as they may be taken as representing functional, social and symbolic divisions. No direct evidence for roofing techniques or materials has been recovered (Grogan 1996, 49). Internal joists are suggested by the arrangements of posts and slot trenches at Ballygalley, Ballyglass, Ballynagilly and Tankardstown (Grogan 1996, 49). The roofs themselves could have been in the order of three to five metres or more in height (Grogan 1996, 55). Grogan states that “despite the range of sizes and variations in structural details, the houses form a homogenous group” (2004, 106). Smyth suggests that there was a common template or ideal form in use, based on inter-site similarities (2006, 242). If there was such a commonly held template, it could have also meant that there were other shared ideologies. However, a degree of heterogeny is also apparent in the form of these structures, which might also have been mirrored in differing functions and symbolic messages. There is considerable variation in shape, with some houses having one or more curved walls, others almost square in form (Grogan 2004, 107; Smyth 2006, 234). Their form can be subdivided into a number of subcategories. Grogan (2004, 105-6) divides them into the following groups, and while this is a little arbitrary it is also useful for the analysis of the structures: The orientation of the houses is also important. The overwhelming majority of the houses have their long axis aligned within 45º either side of east-west (Topping 1996, 160-162; Smyth 2006, 237). This conforms to pan-European patterns apparent in similar building forms (Topping 1996, 162). This has a functional explanation, as having the long wall of the house facing roughly south would maximise solar radiation and heat absorption by the roof (Topping 1996, 161-2). However, there are deviations from this norm apparent in the Irish evidence, such as at Ballyglass, (Topping 1996, 162; Smyth 2006, 237; Ó Nualláin 1972, 54). These may be explained as being due to a number of factors, such as topographical, social, locational or 55


House Type

Group A

Group B1

Group B2

Group C

Group D

Morphology

Square

Rectangular Rectangular 2 rooms

3 rooms

No. Sites

5

10

7

11

4

Size range m²

26.9-36.0

29.3-85.5

21-26.7

35.0-73.0

42-87.6

Average size m²

31.7

45.9

24.6

59.5

64.9

Table 1: Range of house sizes symbolic factors. Doorway location is also responsive to factors other than environmental (Topping 1996, 162-3). They may or may not be positioned to admit light into the structure, Ballyglass has evidence for a possible screen inside the door at the north-west corner (Topping 1996, 163; Ó Nualláin 1972, 54). Thresholds have important symbolic connotations. Their positioning may be due to some ideological factor, such as the position of particular features in the landscape, or possibly even the positions of stellar features. However, their location might also have been chosen for more functional reasons such as the position of other structures in the immediate area. They may also be orientated towards routeways. The houses at Enagh and Thornhill both have their entrances pointing towards the River Foyle, which at the time would have been an important routeway in a forested landscape (McSparron 2003b, 12). The river itself may have been assigned sacred properties by these people; there are certainly many examples of rivers being designated as deities in cultures right across time and space. This may have provided further reason for the orientation of their entrances towards the Foyle. The evidence so far points to the exclusive use of oak for the structural elements of the houses. The choice of oak may be a functional one, oak timber splits well (Cooney 2000, 58), and grows to quite large proportions. It also weathers quite well, becoming harder with exposure to the elements. However, there may be some symbolic significance in the choice of oak, as in many cultures the oak tree holds a sacred role. Its longevity of life may have been symbolically utilised by the builders, who may have sought to convey a message of their legitimacy in the landscape through its use, and also possibly a message of their own intended longevity. Interestingly, at Drummenny Lower oak pollen was absent from the suite of arboreal taxa identified at the site, which suggests that the oak used had not grown locally (Dunne 2003, 168). This could have meant that oak was sourced from beyond the immediate hinterland of the site in order to conform to a possible common template. Further evidence for possible conformity to a common template comes from the examination of the foundation trenches of a number of these sites. The foun56


dation trenches at sites such as Ballyharry, Cloghers, Haggardstown and Kishoge were cut into the underlying bedrock (Smyth 2006, 242). This represented considerable extra labour, and can be taken to indicate the adherence to a particular pattern, or to the importance of the site’s exact location. At Ballyharry, the foundation ditches of the various phases were cut into basalt (Moore 2003, 156), which would have required a considerable investment of labour due to the nature of the rock (Callan, J. 2008, pers comm.). In phase 2, round and possibly water-rolled stones were used for packing the foundation trenches, rather than the debris from the initial cutting of the trench (Moore 2003, 156). This shows that while adhering to a possible template, the builders also were incorporating a more localised ideology in their choice of rounded packing stones, which illustrates the need to examine each of these sites individually as well as on more general terms, as Smyth suggests (2006, 244). The continuous hewing of foundation trenches as one continuous unit also occurs at a number of sites, such as at Corbally, Gortaroe and Mullaghbuoy (Smyth 2006, 242). Interestingly, these trenches were then backfilled in the places where planks were not to be erected, such as at the entrance (Smyth 2006, 242). This might signify a symbolic importance attached to the enclosing aspect of the trench.

V. RECTILINEAR HOUSES AND THE INTRODUCTION OF AGRICULTURE Grogan assigns a timeframe of c. 4050-3850 BC to the houses on the basis of “the combined radiocarbon evidence” (2004, 111). However, a large proportion of these dates used in Grogan’s calculation are from structural. Dates from structural timbers can be somewhat misleading (McSparron 2003b, 11-2; Smyth 2006, 238-9), due to the fact that oak trees live for quite a long time, and the timbers may also have been weathered for a number of years before usage. Their possible reuse is also a factor. It is obvious from this evidence that the use of structural timbers as an absolute dating method for the houses leads to a skewed chronology. The use of samples from short lived species, such as hazel or charred grain, provides a far more accurate picture (McSparron 2003b, 11). Using short lived species samples, obtained from 9 houses on 5 sites, McSparron established a date range of 3800-3520 BC for 95% of the samples (2003b, 11). He was further able to assign a date range of 37303630 BC for 60.7% of the samples (ibid). Smyth has further expanded upon this, assigning 21 houses to the years c.3800-3520 cal. BC (2006, 238). However, it should be noted, that, as Smyth points out, it is difficult to tell if the dated material relates to the period of construction, use or abandonment of these buildings (2006, 238). These dates should lead to a rethinking on the introduction of the Neolithic to Ireland. If we are to assign a date of c.4000 BC or before to the introduction of agriculture (Cooney & Grogan 1994, 29-33), then it would mean that the houses developed long after the introduction of farming. However, there is no evidence of large scale agricultural activity until c.3800 BC (ibid, 32), which synchronises quite well with the updated chronology of rectilinear houses. Indeed, it is the charred triticum at Tankardstown that provides this date 57


(Cooney 2000, 40; Gowen 1987, 9). Up until that point, the evidence is fragmentary. The appearance of cereal type pollen in early post-glacial settlements may be accounted for by genetic mutation of wild grasses (O’Connell 1987 as quoted by Cooney 2000, 39). The elm decline of c.3800 BC causes problems in identifying the level of intentional forest clearance. The decline in the levels of elm pollen in the record may have occurred due to non-human factors (Cooney & Grogan 1994, 29). Pollen cores, such as those from Lough Sheeauns, Co. Galway and Beaghmore, Co. Tyrone record an increase in wheat-type pollen in these years also (Cooney 2000, 39-40; Waddell 2000, 27-8). The appearance of the rectilinear house at the same time as the appearance of wheat pollen in more substantial numbers, and the fact that charred grain or cereal-processing waste has been found on at least 12 sites (Smyth 2006, 240) points to the two being strongly linked. This essay argues that they were in fact two aspects of the same package, which arrived from outside Ireland with some form of population movement of scale as yet unknown. Agriculturalists arriving onto the island would have already built up lactose and gluten tolerance,1 which may have taken native populations a number of generations. Once these natives adopted the agricultural ‘package’ there would have been little to distinguish them those who had introduced agriculture. In the future, perhaps DNA analysis might be used to provide some evidence to clarify this. The dating of their disappearance will be discussed under the heading ‘obsolescence’

VI. FUNCTION There are a number of differing opinions as to the functions of these structures. Thomas (1996) doubts their designation as houses at all, as does Cross (2003). Grogan (1996, 2002, 2004), Cooney (2000), Smyth (2006) and Bradley (2007) all designate these structures as houses. None of the published descriptions of the excavations of these structures doubt that they are houses. However, the exact functions of each house within this framework may vary (Smyth 2006, 244). There is also no reason to preclude multifunctionality, but their primary function remains domestic. To refute Cross’ 2003 argument for their non-designation as domestic structures, houses can also be used on occasion as meeting halls, feasting halls, workspaces and for a variety of other functions and still remain, in essence, houses. Multifunctionality does not preclude domesticity. Thomas (1996, 2) states that the structures are atypical, and that the Neolithic in Britain and Ireland was characterised by transience (1996, 2, 4). While he does state that there was a complexity and heterogeneity in Neolithic settlement pattern (1996, 2), the article is an attempt to use a transient model to explain Neolithic settlement in both Britain and Ireland. He attempts to portray the rectilinear houses on both islands as atypical, and goes on to state that generalising using Céide as a model was also difficult, due to its uniqueness and doubtful date (1996, 1  In this I am indebted to Prof. Peter Woodman (October 2007) for bringing to my attention the fact that higher levels of lactose intolerance and Coeliac disease occur in those areas of Europe into which agriculture was introduced latest.

58


4). Thomas also assigns storage roles for the structures at Balbridie, Scotland and Ballygalley (1996, 9-10). The weight of evidence alone discounts Thomas’s model for Ireland in the early Neolithic, especially in light of the numerous discoveries in recent years. The most recent figure for the number of rectilinear Neolithic structures in Ireland stands at approximately 70 (Smyth 2006, 234). Smyth also points out that this figure is likely to continue to grow in the coming years (ibid). Thomas’ assertion that they were atypical hardly holds true on this evidence, at least for the period that these rectilinear structures appear in the archaeological record. However, when found in clusters, some of the buildings, such as the smaller ones, may not actually have functioned as houses. They might have had roles as stores, smokehouses, or a range of other functions. However, clusteration might not mean contemporaniety. The direct dating relationship between houses in many house clusters has not been established, and there are some sites such as Corbally where there is evidence for successive houses (Purcell 1999, 2002). Grogan outlines the evidence for other contemporary settlement forms of a more ephemeral nature (1996, 51). This evidence is scant, but these sites may represent specialised activity (Grogan 2002, 523), rather than evidence of some form of hierarchy, or of a level of settlement heterogeny à la Thomas et al. Some of the houses had sloping floors, such as that at Drummenny Lower (Dunne 2003, 170; Cross 2003, 200). This would not have made for comfortable sleeping arrangements if the buildings were domestic. However, these may have been exceptional buildings; the house at Drummenny seems to have had a very short life-span and may not have even been a domestic building in the strictest sense (Dunne 2003, 170). While there are dangers of projecting any form of modern conceptualisation back in time (Thomas 1996, 1-2; Jenkins, K. 1991 et al), the house remains a useful concept for the analysis of Neolithic social structures and settlement. It would seem from the amount of artefacts associated with domesticity found in relation to the structures in Ireland that they can be deemed as houses (Smyth 2006, 240-2). Cooney points out that, on the basis of the ethnographic record, the house can be shown to be a very cross cultural idea, but that the concept of home varies (2000, 52). In this he is correct, the term home is extremely subjective, relating to a person’s interaction, on an emotional and conceptual level, with an undefined structure. The term house refers more to a structure form within which some form of habitation is made. No clear correlation can be made between house floor area and the number of occupants (Cooney & Grogan 1994, 47). Cooney and Grogan assign a rough estimate of 5-10 individuals, before stating that this implied that the family was the basic social unit (1994, 48). They do, however, concede that factors such as social status and function also had an effect on house size (ibid). Grogan goes one step further and assigns 4m² to each person in these dwellings (1996, 57; 2004, 106). This can be criticised on many different levels. It is far too positivistic. It does not 59


take into account that houses might have been used in different ways by different groups. For one, there may have been a division of the sexes between houses, or the removal of liminal figures such as adolescent males. Moreover, this might only occur at different times of the year. Some may have been occupied by people of more elevated social status than others, meaning they may have had bigger dwellings, or possibly smaller ones by virtue of their non-performance of certain activities. Lévi-Strauss in his 1969 book The Raw and the Cooked spoke of the transformation value of fire. Fire is a symbol with many different meanings, and it has an important role in structuring daily routine (Cooney 2000, 61). Richards asserts that the hearth was the centre of people’s lives, a reference point from which they took their position and orientation (1990, 116). He also argued that it symbolised the unity and well being of the family (ibid). Cooney also points out that, in Neolithic houses, fire, as embodied in the hearth, had both an important functional and symbolic role (2000, 61). It was a central feature that people would have had to move around, cook on, and obtain heat and light from (ibid). Cooney also illustrates that there were a number of dichotomies involved, due to its capacity as a focus to divide the house or room into central and peripheral areas, warmer and cooler, brighter and darker (2000, 61). Cross disputes the multi-functional role assigned by Cooney to the hearths in rectilinear houses (2003, 199). She points out that “ these fires do not seem substantial enough to have played so many roles, the lack of elaboration and the overall size and the shallow penetration of oxidisation so not point to a central focus for the daily lives of a family group” (ibid). This is indeed a valid observation, and when taken in conjunction with ethnographic evidence it might be taken to mean that the fires were lit for light, heat and secondary cooking in linear feasting halls (Cross 2003, 1999). However, the houses might also have made use of a smaller internal fire and larger external cooking fire so as to prevent the danger to its inflammable structure posed by such a big fire.

VII. ORIGIN The form was not unknown in the contemporary British early Neolithic. There has been a long tradition of attempting to link the form to the continent also, both in Britain (practically everything written prior to the 1980s) and in Ireland (ApSimon 1969, 168; Ó Nualláin 1972, 56). There is a remarkable variance in the numbers between Ireland and Britain, and even more so between Ireland and lowland England (Bradley 2007, 38). Bradley points out that this can hardly have been due to differential survival (ibid), and states that the “contrast is a real one and has to be explained” (ibid, 39). However, the contrast may not have been as sharp as today, as while both islands have had long traditions of cultivation (Bradley 2007, 38), England has had a far stronger tradition of arable farming in later years, which would have led to a more severe truncation of the evidence. Also, far more landscape alterations have taken place in lowland England than anywhere else on the two islands. When this is taken in conjunction with the traditional higher popula60


tion density in England it can be seen that there is less chance for survival of this house form. On the continent, the relative stability of the linearbandkeramik period had come to an end almost a millennium before the beginning of the insular Neolithic (Last 1996, 27). By the end of the fifth millennium BC there had been a shift towards the use of small, non-rectilinear houses, rather than the use of the large longhouses of the LBK period (Last 1996, 27-30). Bradley points out that the rectilinear houses of the insular Neolithic(s) were much smaller in form, with less evidence for organisation into clusters (2007, 40). He does concede, however, that the latter pattern may be changing due to recent fieldwork evidence (ibid). Topping points out there was a similarity in form to be found in a series of structures around the northwestern periphery (1996, 158-9). However, he does this without providing any dating evidence, just drawing similarities on the grounds of morphology. It is most probable that a particular agriculturalist package spread into Atlantic and northern Europe. The fifth millennium BC was a time of flux, and it is quite possible that there were population movements occurring in north-west Europe. It is difficult to ascertain whether or not there were large scale movements of population into Britain and Ireland. The wild ancestors of wheat and barley, and of cattle and sheep did not exist in postglacial Ireland (Waddell 2000, 25). This implies human agency in their introduction, via immigration. With these people came new social and ideological forms, new subsistence methods and attendant material culture. Their difference from the previous norm on the island certainly would have set them apart for contemporaries. Pottery, cereal production, animal husbandry and rectilinear houses all occupied a central position in this new framework. New tools associated with agricultural practice, such as the ard and saddle querns also appeared at this time (Waddell 2000, 29). However, a degree of continuity in some lithic forms may also be seen in the record at some house sites. At Enagh, McSparron points out that the flint blade found there displayed Mesolithic attributes (McSparron 2003a, 170). The assemblage at Drummenny Lower also might indicate a level of continuity in form and use (Dunne 2003, 170). It is highly probable that these farmers introduced monumental tomb construction to Ireland also. This draws an interesting contrast between the longevity of their oaken houses and the permanence of their tombs (Cooney 2000, 58). The early dates found at Carrowmore and Croghaun Mountain, Co. Sligo may indicate tomb construction prior to the advent of Irish agriculture (Waddell 2000, 26). However, in this author’s opinion this region might be a special case, as it is quite possible that there was a stable population in the area exploiting the abundant natural resources of the two estuaries and their hinterlands. Also, the Carrowmore dates might have been derived from activity prior to the monumental phase there (Waddell 2000, 26). The Croghaun dates might also be distorted by old wood effect.

vii. SYMBOLISM The houses are certainly metonymic. To contemporaries they would have been 61


seen as representing the entire agricultural system. They can be seen as a statement of intent by the agriculturalists, as an assertion of their identity, both by the possible colonists and by the indigenous people who adopted the system also. Their angular form thrust skyward can be taken as representative of the humans’ new found mastery over their environment. With the introduction of this form came the introduction of the concept of angularity into the minds of contemporaries. This must surely have had an effect on the ideology of these people. With angularity came the introduction of corners into people’s perception of their built environment. This could have led to an increased definition of the roles of particular individuals within the social group, as there now was a possibility for people to be mentally assigned to different corners and thus categories. It could have led to a greater differentiation between the sexes. A particular concept from linguistics, and developmental psychology, might be applicable here. Words associated with female related concepts usually take a rounded and continuous form, be it at the level of a child first grasping at language, or in the languages spoken worldwide. In contrast to this, male related words usually take on a more angular and staccato form. With the introduction of angularity into the built environment, is it possible to take this as signifying an assertion of male dominance? It is also possible to engender in some ways another dichotomy also, that of external and internal. The outside be seen as male, with its angles projecting into the outside world. This outside world can be seen as the male domain in many ways. It was where danger lurked in the minds of contemporaries. It was where hunting and warfare took place. It was where those outside of the immediate occupant group were located. On the other hand, the internal area of the house symbolised enclosure, protection, nurturing and safeness, which can be more associated with maternity. Tool production and farming both may still have been mixed activities, although there was probably some division of activity based on gender. The morphology of court and portal tombs also suggests symbolic reference to the male and female form. Portal tombs may have been situated at the edge of a community (Jones, C. 2006, pers comm.). If court tombs were located at the symbolic centre of communities, then it may be that the male signifying portal tombs were put at the edge of the domestic zone to provide function, and the female court tomb at the centre of this zone. When taken in conjunction with the other cognition carried over from domestic to funerary settings, this might be significant. The rectilinear house is a conspicuous manipulation of the resources of the agriculturalists, and may have been an assertion of their power addressed both to the elements and to other humans. It was also a symbol of the triumph of domesticity over wildness. It is possible to apply here Hodder’s post-structuralist view of the emergence of Neolithic culture in light of the triumph of the cultivated (interior) world of the house, or domus, over the uncultivated, wild and savage exterior, or agrios (Dark, 1995, 186-7). This is an extremely useful concept which can be applied to the spread of agriculture in general and the changes it brought on so many levels. 62


The Irish rectilinear house’s morphological similarity with other European examples is also of interest. While it is of much smaller proportions to the LBK house and its derivatives, cognitive process may be identified in its use in Ireland. It may be that in the minds of the early farmers in Ireland that the rectilinear house was associated with a time of stability in the past, from where the knowledge of agriculture originated. This might explain the corrupted form, which is far smaller and housed what was most likely a single familial group, as opposed to the multiple families resident in the LBK houses. Lévi-Strauss described ‘house societies’, organised along the lines of kinship, which in turn led to the emergence of hierarchies (1979/1983 as quoted by Bradley 2007, 59). Whittle states that “the house provided much more than shelter, it encouraged the formalisation of behaviour” (1996, 26). The rectilinear house had a positive feedback effect on its occupants. It at once was affected by and had an effect on the ideology of contemporaries. Its genesis was due to an assertion of identity, and that identity came in turn to be shaped by the concept of the house. The house functioned as a means of socialisation for successive generations. The houses functioned as nodal points in the landscape and their use in a formalised way provided a means by which people created new attachments to place, and through this, new senses of identity and time (Whittle 1996, 26). They were the main arena for the unconscious passing on of the habitus from one generation to the other (Hodder, 2003, 92). The symbolic importance of wheat and stone implement production would also have been important in shaping the identity of these people, as would their burial monuments.

iX. RITUAL ACTIVITY Evidence for possible ritual activity is also apparent in the rectilinear house. The appearance of evidence for ritual activity does not mean that there were special ritual houses, more that ritual and domestic activity were inextricably linked (Bradley, 1998; 2003; 2007). Intentional deposition at the houses can be taken to indicate ritual, or at least symbolic activity. The later association of the burial monuments to the houses has important symbolic connotations. The deliberate destruction of the houses, either by demolition or by fire can also be viewed from a ritual perspective. Structural deposition had an important role in the life cycle of the house. Sifting through the published excavation results it is apparent that intentional deposition took place at every site, and often both at the beginning and end of each phase of use. This enables us to identify common cognitive processes at work on a general level, with local variations providing us with a point of entry into the culture of those carrying out the acts. A large-scale interpretative study on a site by site basis would certainly provide an opportunity to reconstruct early Neolithic ideology, especially if done in conjunction with the evidence from court tombs and portal tombs. 63


A large amount of pot sherds of carinated ware,2 lithics and other objects have been recovered from the foundation trenches of many of the houses. That such new technology would be discarded accidentally into these ditches is doubtful. Clay was transformed by fire into pottery, and when taken with the evidence for the other symbolic aspects of fire, the deposition of pottery into the ditches must have had a symbolic role. 2500 sherds of western Neolithic ware were found at Ballyharry (Moore 2003, 160), and there was also considerable evidence for deposition of stone artefacts at the site (ibid, 156-63). The use of rounded stones in the packing of House 1, Phase 2’s foundation has already been mentioned. There also was the possible ritual deposition of a basalt axe and large leaf shaped arrowhead in the central post-holes of the E and W slot trench, both blade edge downwards, under burnt material, but unburned themselves (Moore 2003, 158). This may have been in reference to the previous attack on the site, and if so it gives an insight into how the occupants thought another attack could be prevented, or that they chose to commemorate it in some way. Deposition also took place in a series of shallow pits containing a large range of material including porcellanite, possible jadeite axe rough-outs, pottery, Group VI polished flakes, burnt bone, charred hazelnut fragments and charred cereal grain (ibid). At nearby Ballygalley, the occurrence of such valuable material as Langdale tuff, from Cumbria, a possible Cornish greenstone axe, pitchstone from the Scottish island Arran and porcellanite from Tievebullagh or Rathlin can be taken to indicate profitable long distance trading contacts for the site (Simpson 1996, 129-132; Topping 1996, 167). That such wealth would be left behind accidentally is doubtful, implying intentional deposition. The association of food items in the deposits must also be considered to have been part of the same ritual, perhaps symbolically ‘cooked’ before deposition. Topping points out that it may have been competitive material exhibitionism at either a local or regional level (1996, 167). Ballygalley was covered by a cobbled surface some time after its intentional destruction, and this may have been the final phase of ritual associated with it (ibid). There may have been some ritual significance involved in the actual materials deposited. At Cloghers near Tralee, Co. Kerry a quartz core was found placed at the base of the substantial north-west post (Kiely 2003, 184). Quartz is a material that is vested with much symbolic value in ritual contexts (Parra, J. 2006, 2008, pers comm., Bergh 1995, 156, Cooney 2000, 176-8). It is used by Amazonian tribes in Columbia to accompany cremations, and is also used in shamanic and ceremonial rites (Parra, J. 2008, pers. comm.). Deposition at the base of post-holes occurs in both northern Thailand and Malaysia (Smyth 2006, 246). Quartz debitage has also been found at a number of house sites, such as Drummenny (Dunne 2003, 166), 2  It would be highly informative if a detailed lipid analysis was done on pottery from the sites, as it would provide an interesting insight into the diet of these people (Roycroft, N., 2007, pers comm.). Further stable isotope studies of human remains of the period would also prove useful in this regard. The rate of the build-up of lactose and gluten tolerance might be illustrated by studies such as these.

64


Enagh (McSparron 2003a, 172), Corbally 3 (Purcell 1999, 15) and Tankardstown South (Gowen 1987, 8). At Enagh, three pieces were also found in the fill of a posthole (McSparron 2003a, 172), which provides a possible cognitive link to Cloghers. Serpentine beads were found at Corbally 3 (Purcell 1999, 15), and serpentine items were also found at Ballygalley (Simpson 1996, 132). Serpentine3 has been assigned spiritual properties by a number of different cultures at different points in time (http://www.crystal-cure.com/serpentine.html), and it is possible that it might also have had some symbolic significance to the house builders. The transformational value of fire has already been discussed, and its role in the destruction of some of the houses might be significant. Its possible use for the clearance of land ahead of cultivation (Waddell 2000, 29), could also have been a symbolic act, with fire being employed as an agent in the triumph of the domestic over the wild. Drummenny, Tankardstown South 1 and 2, Kishoge, Ballyharry 2 and 3, Cloghers, Coolfore 2 and Monanny C were all destroyed by fire (Dunne 2003, 165; Gowen 1987, 7; Gowen & Target 1988, 156; O’Donovan 2001, 6; Moore 2003, 156, 158; Kiely 2003, 187; Ó Drisceoil 2003, 181; Walsh 2004, 36). Smyth refutes the common interpretation that these wooden thatched buildings burned down accidentally (2006, 246-7). She uses examples from experimental archaeology (carried out by Bankoff & Winter 1979; Shaffer 1993; Stevanovich 1997), in order to show that in order to burn a house down to its foundations, as at Monanny C, it would be necessary to artificially prolong the fire and maintain quite high temperatures (2006, 247-250). The examples used by Smyth also showed that an accidental fire would only burn for a short duration, in the order of 20 minutes, after which time it would be safe to enter the structure and put an end to any smouldering still ongoing (2006, 247). While the timbers used by the experimenters might not have been dried out to the same extent as in reality, the argument is still convincing. Houses may have been burned for non ritual purposes also, such as inter-polity warfare. The burning activity at Thornhill is one such example of this, with seven arrowheads being found in context with the burning of an external palisade (Logue 2003, 149-51). Ritual burning of a defeated enemy’s house might have been a possibility for some of the other sites. At Ballyharry, Phase 3 seems to have been ended by an attack which resulted in the burning of the north wall, leaving a number of projectile points in its vicinity (Moore 2003, 157; Smyth 2006, 247). Interestingly, in the subsequent rebuilding phase one of these projectiles was deposited under a rebuilt wall (Moore 2003, 158; Smyth 2006, 247). The evidence points to the life-cycle of the houses being deliberately ended by fire, at least in some cases. Perhaps the house was regarded as almost human, and strongly linked to its occupants (Bradley 2007, 59). The society might have assigned an important role to fire in the various rites of passage or indeed in daily 3  The mineral’s most prolific source is Clew Bay (Ryan, P. 2006, pers. comm.), and this may indicate trade between these areas.

65


life (Smyth 2006, 250). It might have been used to purify a site after the death of an occupant (ibid; Bradley 2007, 61). The abundance of cattle bones in the foundation trenches at so many sites might indicate a ritual feast at the end of the buildings’ life cycle. Cooking is another wayn which fire acts as an agent of transformation. Cross points out that it would have taken a large number of people to consume a cow, probably even an entire lineage (2003, 199). This feast might also have served as a reward for provision of labour for the building of another house or a court tomb. The placing of burnt bones in the foundation might then have been a symbolic feeding of the house or its spirit. Also significant is that fire also had a role in the rituals surrounding court tomb use, such as cremation and tomb preparation (Bradley 2007, 60). The artifactual assemblage found in court tombs bears remarkable similarity to the domestic assemblage (Bradley 2007, 60). The domestic items found in court tombs might actually be the contents of the tomb’s occupant’s house (Bradley 2007, 61). Bradley also points out that “it seems much more than a coincidence that human corpses should have been treated in exactly the same way as these buildings” (2007, 61). There may have been a similar thought process at work, with fire providing the means for movement from the world of the house to the world of the ancestors. The orthostats used in megalithic tombs might be representative of the timbers used in house construction (Cooney 2000, 58). The shape of the chambers in the tombs also might be taken as representative of the house form. It is possible to apply here Thomas’ notion of tombs being a mnemonic technology (1991, 9-11), serving as reminders of both the house and its occupants long after their demise. On this evidence, it is no coincidence that court tombs and portal tombs are to be found in the vicinity of many rectilinear houses. Both court tombs at Ballyglass are to be found built over the sites of houses (Ó Nualláin 1972, 49). Indeed, one of the tombs’ layout respected the ground plan of the rectangular house, leading the excavator to conclude that it was “difficult to avoid the conclusion that the house was deliberately demolished to make way for the construction of the tomb (Ó Nualláin 1972, 55). Habitation refuse has also been found under other court tombs such as Ballymarlagh, Co. Antrim and Ballybriest, Co. Derry (ibid, 56). Ballybriest itself is located to the north of the house at Ballynagilly. Extensive evidence for early Neolithic habitation, including rectilinear houses has been found underneath the passage tomb at Knowth (Roche 1989, 102-3). 2 sherds of Western Neolithic ware were found at a habitation site under the small passage tomb at Townleyhall by Eogan in 1963 (51). Indeed the chamber of the tomb resembles a rectilinear house in its ground plan. The passage tomb at Ballycarty is located a short distance from the house at Cloghers (Kiely 2003, 182). The portal tomb at Gaulstown is located in a townland neighbouring the house sites at Granny and Newrath (Hughes 2004, 26). The court tomb at Drumrat is intervisible with the house site at Drummenny (Dunne 2003, 166). Dunne also points out that the 50 or so megalithic tombs in south-west Donegal are located on warm slopes and ridges (2003, 169), which 66


matches the general siting of rectilinear houses. What this evidence shows is that in some locations the siting of a rectilinear house has an influence of the choice of site for megalithic tombs. It may be that later groups sought to legitimise the present by using sites associated with ‘the ancestors’. There may, however, be an even closer connection, as has been pointed out above, in that some house occupants may have been interred in court tombs (Bradley 2007, 61 etc.). This might explain why there seems to be direct continuity between the rectilinear house at Ballyglass and the subsequent court tomb.

X. OBSOLESCENCE Their disappearance may have been more due to ecological factors rather than cultural. A marked decline in cereal pollen occurs in the palynological record in the middle years of the fourth millennium BC. This period is also marked by the sharp rise in the abundance of dandelion pollen which indicates open grassland. An example of this is to be found in the core sample results from Lough Sheeauns (Waddell 2000, 28). This can be taken to indicate a shift towards a more pastoral economy. This could have been for a number of reasons, such as the appearance of pests (Bradley 2007, 43), climatic conditions or over-farming. There is evidence for accelerated bog formation in the later half of the millennium, and by 3200 BC the Céide system was becoming unviable (Cooney & Grogan 1994, 41). A cooling and dampening climate would have meant a cooling of the soils and a reduction in sunshine, which would have greatly diminished crop yields. With the shift to pastoralism came a shift towards greater mobility in the landscape. This would have made the rectilinear house redundant, as the population would have remained at one location for shorter periods. Circular form houses became the norm from these years, a domestic form which would remain in exclusive use until the medieval period. Perhaps the inhabitants also felt less need to assert their identity in so dramatic a fashion once the shift to pastoralism began (Bradley 2007, 44). The place of rectilinear houses in the landscape and ideology came to be replaced by the megalithic tombs, which may have replaced them as nodal points in the landscape for various social groups. XI. CONCLUSION The rectilinear house is the quintessential monument of the Irish early Neolithic. It is strongly linked to the introduction of farming and the new ideology, social form and material culture that accompanied it. Their distribution indicates that there may have been a degree of contact between different parts of the island, or a shared ideology from a shared heritage. The evidence for trade at sites such as Thornhill, Ballygalley and Ballyharry also illustrates inter-regional interaction. The evidence for small scale warfare at Ballyharry and Thornhill indicates that this interaction was not always positive. The house occupied a central place in both the formation of ideology and socialisation. It can be seen from the evidence outlined above that the people who dwelled in these houses manipulated a wide repertoire of symbols 67


in their everyday lives. The houses themselves were loaded with symbolism, symbols which displayed humanity’s dominance over its environment, which relayed unconscious messages about the structuring of society, gender symbolism, the battle between domus and agrios and many more. The houses functioned as the focal points of their communities, and it seems from the evidence that similar social forms were pan-insular. The occupants manipulated a number of symbols for ritual purposes regarding the houses. Evidence of this is to be found in the deliberate deposition before and after the house’s lifecycles, and in the acts of deliberate destruction by fire and/or demolition. As society shifted towards a pastoral economy, megalithic tombs came to replace the houses as the foci of their respective communities.

68


69


BIOCHEMISTRY PANEL

Judging Panel Prof. Cliona O’Farrelly (Trinity College Dublin) – Chair Prof. Rhodri Ceredig (NUI Galway) Dr. David Lloyd (Trinity College Dublin) JUDGES’ Comments The submission entitled Investigating the structural characteristics of transient protein-protein interactions was a laboratory project report which all three judges independently selected as a winner, each equally impressed with how it explored and analysed a complex biochemical topic in a lucid, engaging manner. This was a clearly written, well presented report of a research project carried out by the candidate, consisting of a Summary, Introduction (outlining the specific aims of the project), Materials and Methods, Discussion, References and Appendix. The report focused on the analysis and presentation of results from a series of challenging technologies used to study the interactions of proteins, yet was written and presented in an understandable and accessible way. This presentation was enhanced by the well chosen, elegant molecular-modelling figures included in the manuscript. What pleased us particularly about the report was that it was evident the candidate had thought profoundly and independently about the project as evidenced by a section in the Introduction entitled “Why bother?” and one in the Discussion entitled “Future studies”. The candidate is to be congratulated on the execution of this report and certainly deserves to win the prize.

70


bio C h e m i st ry

Investigating the structural characteristics of transient proteinprotein interactions Niamh Parkinson

T

Summary his investigation focuses on the interactions between the two electron transfer proteins, Cytochrome c (Cyt c) and Flavodoxin (Fld), as a model system for transient protein-protein interactions. The aim is to define the characteristics of such interactions that play roles in both driving the complex formation and in enabling optimum orientation of the proteins for a successful interaction. 2-D Heteronuclear Single Quantum Correlation (2D HSQC) NMR spectroscopy was used as the main tool for this study as it allows a fast analysis of these short-lived interactions in solution. The above method requires labeling of one of the interacting proteins with the 15 N isotope in order to detect changes to the NMR spectrum upon complex formation. Because of this each of the proteins involved were expressed in both the labelled and unlabelled form. Once proteins had been successfully isolated and purified sufficiently, 2D NMR experiments were carried out from the perspective of; A) 15N-Labelled Cyt c interacting with Fld and B) 15N-Labelled Fld interacting with Cyt c. The resulting chemical shift perturbations, when mapped onto the surface of the proteins, highlight the surface areas and residues most involved in the complex interface. In the case of Cyt c, the region most affected by complex forma71


tion is located around the edge of the heme co-factor responsible for the transfer of electrons. A similar result was found in the case of Fld in which the surface area affected was also located around the protruding FMN co-factor. Further NMR experiments exploiting the paramagnetic properties of oxidised Cyt c were also carried out. These results hold specific information regarding the orientation of the proteins, relative to the heme co-factor, during complex formation.

Introduction Protein-Protein Interactions: Proteins do not act alone but usually carry out their relevant functions within a static protein complex or via transient (short lived) complex formation. For how long and to what extent the proteins interact depends on the nature and purpose of the interaction. This is generally determined by the structural characteristics of the surface of the protein such as the three dimensional shape, electrostatic charge and chemistry occuring at the protein interface. Two proteins known to form a transient complex are Flavodoxin (Fld), and Cytochrome c (Cyt c). Although these proteins are not physiological partners, they are used here as a model for electron transfer or redox protein interactions. Redox protein-protein interactions are known to occur transiently and the proteins involved in such complexes have evolved to suit their respective functions. As described by D. S. Bendall, (Protein Electron Transfer, 1996) â€œâ€Ś the affinity between a redox protein and its reaction partner must be high enough to achieve rapid electron transfer, but not so high as to prevent rapid dissociation of the products and turnover of the chain of carriers as a wholeâ€?. Bendall also mentions that, alongside affinity, specifity is another factor that must be brought into the balance. Cyt c, for example, is involved in the mitochondrial electron transport chain where it shuttles electrons between protein complexes. Therefore, it would not suffice if Cyt c displayed too high a specificity for one or more of the interacting partners as this would impede the overall turnover of electron transfer. Typical rates of electron transfer for similar types of complexes are of the order 105 M-1 s-1 (Feng and Swenson, 1997). For transient complexes it seems that both association and dissociation rate constants (kon and koff) are high, but what kind of features or characteristics of the interacting proteins support this mechanism of interaction? This was the question addressed by Crowley and Ubbink (2003) in their investigation of the protein interactions of the photosynthetic redox chain involving Cyt c6 and Plastocyanin. The outcome of this study was that the underlying factors of complex formation are conserved. For transient interactions this includes (a) the effect of the solvent shell on forming an encounter complex, (b) the effects of electrostatics on preorientation and the rate of association of the complex, (c) hydrophobic interactions in their contribution to specificity by aligning the cofactors and stabilising the encounter complex and (d) the size of the interface. However, although these aspects are conserved, the degree to which each one contributes energetically, varies among different complexes. For example, one protein complex 72


may be stabilised or driven mainly by electrostatics with hydrophobic effects playing only a small role. On the otherhand, for another protein complex, the opposite may be true.

Aims of this investigation This project aims to investigate the structural characteristics involved in the transient interactions occuring between yeast-iso-1-Cyt c from Saccharomyces cerevisiae and Fld from Escherichia coli using 2D (1H, 15N) NMR spectroscopy. The investigation is focused on highlighting the specific amino acids involved in the complex interface from the perspective of both Cyt c and Fld. This information provides a basis to understanding the chemistry and three-dimensional architecture that promotes complex formation while at the same time enabling fast complex dissociation. The interaction sites will be identified using chemical shift perturbation mapping. As previous research has also investigated interactions of these proteins with other partners, both physiological and non-physiological, (Hall et al., 2000; Hall et al., 2001; Worrall et al., 2003; Volkov et al., 2006) the results obtained here will be compared to some of those findings in the discussion. Introducing Cyt c and Fld The most widely known function of Cyt c is as an electron shuttle between complex III and complex IV in the sequence of events that leads to oxidative phosphorylation in the mitochondrial electron transport chain. It also plays an important role in apoptosis. Structurally speaking Cyt c is a globular, heme containing, all alpha protein, with a molecular weight of approximately 12.5 kDa. Its heme cofactor is located in a hydrophobic pocket, but close to the surface and slightly protruding from the protein. It is this heme which confers electron transfer ability to the protein. Around the heme is a hydrophobic patch comprised of non-polar amino acid residues which is further surrounded by a region rich in positively charged amino acids such as lysine and arginine (Figure 1A). Its partner for this study, Fld, is a negatively charged protein, comprised of alpha helices and beta sheets organised in the Rossmann fold structural motif, common to many nucleotide binding proteins. The cofactor in this case is a flavin mononucleotide (FMN) which, similar to the heme of Cyt c, is also protruding from the protein surface and is flanked by non-polar residues, (Figure 1B). Here the hydrophobic patch is surrounded by negatively charged amino acids such as glutamate and aspartate. With a molecular weight of approximately 14.5 kDa, it is slightly larger than Cyt c. Flavodoxin has several functions involving electron transfer. Originally thought to replace ferrodoxin in microorganisms during times when iron is lacking (Laudenbach, 1988), many other processes in which it is involved have since been uncovered, for example, the activation of anaerobic ribonucleotide reductase, (Bianchi et al., 1993). 73


Fig. 1: Electrostatic surfaces of (A) Cyt c and (B) Fld. Positive and negatively charged regions are represented by blue and red surfaces respectively. White/grey areas are representative of non-charged patches. The heme cofactor of Cyt c and the FMN cofactor of Fld are shown as spheres.

History of Cyt c and Fld The first 1H NMR data recorded in the study of this complex was that obtained by Hazzard and Tollin (1985). In the proton NMR spectra, several changes were observed in the heme related resonances of oxidised Cyt c upon complex formation with Fld, such as chemical shift changes and line broadening. This evidence supported previous kinetic studies which suggested a structural model, in which the complex is stabilized by four positively charged lysine residues on the surface of Cyt c and four complementary negatively charged carboxylates on the flavodoxin surface. Such a model served to explain the ionic strength dependance of interprotein electron transfer (Smith et al., 1981). This prompted Mathew et al., (1983), using computational studies, to investigate the role of electrostatic interactions in the favourable preorientation of the molecules before complex formation. These studies strongly supported this hypothesis that the interacting electrostatic fields align both proteins along a trajectory that enables close contact of the cofactors. Here, the ionic strength dependance of the reaction rate was attributed to the ions having a “shielding� effect on the electrostatic fields of both proteins, disrupting the attractive Coulombic forces over longer distances in solution. Similar observations were made by Weber & Tollin (1985), supporting evidence of the importance of electrostatic contributions in electron tranfer complexes. More recently Regarding the geometric complementarity of interacting proteins it would be natural to assume that one interacting protein would structurally accommodate the 74


other, almost like “molecular lego� (Gilardi et al., 2001). However true this may be of static protein complexes, the situation is different for electron transfer interactions in which fast dissociation is a key factor (to maintain sufficient turnover). The architecture at the interface of such interactions along with a statistical approach to the presence of certain amino acids occurring in redox interfaces has been previously described, (Crowley & Carrondo, 2004). The conclusion of this work is that the most abundant amino acids in complex interfaces are lysine and acidic residues such as aspartate and glutamate. Crowley and Carrondo also conclude that the degree of atom packing at the interface is low and the majority of complexes interact via small, flat interfaces. In the present investigation, the archcitecture of the interface will be studied on a different scale using a molecular docking program PATCHDOCK. This program constructs hypothetical protein-protein complexes using a computer based algorithm that takes into account mainly the three dimensional shape of the protein surface, with less weight being placed on the surface chemistry and other factors.

Main Strategy Crystallographic methods can often prove difficult when it comes to electron transfer complexes due to the high rate of dissociation. For this reason the main tool used here is 2D (1H, 15N) HSQC NMR spectroscopy as it allows fast analysis of these short-lived interactions in solution (Zuiderweg, 2002; Crowley and Ubbink, 2003). The approach using this technique is to obtain reference spectra for each of the free isotopically labelled proteins. 15N labelling of the proteins is neccessary for the detection of the backbone amide resonances, and was implemented during protein expression by growing the expression host on a medium containing 15 NH4Cl as the sole nitrogen source. Each amide group in the backbone of the protein is represented as a peak in the spectrum (except proline). The position of the peaks in the spectrum corresponds to the chemical shifts of the 1H and 15N atoms in the protein backbone. Chemical shifts hold information regarding the chemical environment of the atoms in the molecule. Therefore, changes in the chemical shift of a peak (seen as a change in the position of the peak in the spectrum) during a titration with a partner protein are interpreted as a change in the chemical environment of the amide group to which that peak corresponds. We in turn interpret this as idicative of complex formation. Paramagnetic studies Another part to this project involves paramagnetic studies, and is the combined exploitation of both the NMR technique and the different redox states in which Cyt c can exist. Paramagnetic experiments often require the insertion of paramagnetic spin labels into an interacting protein. (Volkov et al., 2006). However, the heme of ferric Cyt c provides a single naturally occurring probe as an unpaired electron orbits the iron at the centre of the heme. This unpaired electron acts like a small 75


bar magnetic within the applied magnetic field and as a result, exerts an effect on neighbouring resonances. Therefore, we can interpret the effects seen on Fld resonances due to the ferric Cyt c (FeIII), as opposed to ferrous Cyt c (FeII), as paramagnetic effects that can occur only if Fld comes in close contact with the heme. These effects hold specific information regarding the orientation of the proteins, relative to the heme co-factor, during complex formation.

Why bother? The study of the interactions between redox proteins is a vast and busy field of biochemistry today with a lot of focus on uncovering and understanding the chemical mechanisms underlying redox protein interactions. In the bigger picture there is a lot to be gained by broadening our understanding of such interactions. Identifying the structural and chemical characteristics responsible for a successful interaction between two proteins, may lead towards the discovery of compounds that can inhibit the interaction and potentially be developed as therapeutic agents. The paper “Towards a new therapeutic target; Helicobacter pylori flavodoxin” by Cremadesa et al., (2005) is a prime example of how uncovering a small difference in a generally conserved protein structure opened up a huge range of possiblilties and significance of this discovery is amplified by the pathogenicity of H. pylori alongside its resistance to many modern antibiotics. This would seem to be quite a random discovery, but would it have been discovered if nobody had bothered to look? Materials All reagents were obtained from Sigma. The chromatography column materials that were purchased from Amersham Scientific. See appendix for media recipes. Expression of Cyt c Prior to expression, the host, Escherichia coli strain BL21, was transformed with the expression vector pBTR1. This vector encodes for the yeast iso-1-cytochrome c protein along with an ampicillin resistance gene. The details of the transformation procedure are outlined in the appendix. Protein expression was carried out according to the method outlined by Morar et al. (1999). Using aseptic technique, 5 ml of LB media with added carbenicillin (70 μg/ml) was inoculated with a single colony. The tube containing the inoculated medium was then incubated on a shaking platform (220 rpm) at 37 ºC for 4.5 hrs. This pre-culture was then added to 1 L of LB Media containing carbenicillin, previously warmed to 37 ºC. The inoculated LB Media was then split between two sterile 2 L conical flasks. Cells were grown overnight at 30 ºC on a shaking platform (220 rpm). For the expression of 15N-labelled Cyt c the procedure was as described above, however pre-cultures were added to M9 minimal media containing 15NH4Cl as the sole nitrogen source 76


Expression of Fld The transformation procedure followed was similar to that described for Cyt c. In this case the host, E. coli strain BL21, was transformed with the expression vector pDH07, which encodes for the E. coli Fld protein along with an ampicillin resistance gene. The procedure differed in that the pre-culture grown overnight at 30 ยบC before inoculating the media for the main cultures. Furthermore, the procedure for expression of this protein was again different in that the expression vector pDH07 contains a lac promoter region that requires induction using the lactose analogue, (Isopropyl-ฮฒ-D-thio-galactoside) IPTG. Therefore, during incubation of main cultures, cell growth was closely monitored starting approximately 1 hr after inoculation. When the OD of the cultures at 600 nm reached 0.6, (found to occur ~ 2 hrs into growth period), IPTG was added to each flask to a final concentration of 1 mM. Cells were grown overnight at 30 ยบC on a shaking platform (220 rpm). For the expression of 15N-labelled Fld the procedure was as described above, however pre-cultures were added to M9 minimal medium containing 15NH4Cl as the sole nitrogen source. Harvesting Cells In all cases the cells were collected by centrifugation at 5000 rpm for 15 minutes. The supernatant was discarded and the cell pellet was re-suspended in the minimum amount of suitable buffer required for re-suspension. In the case of Cyt c, cells were re-suspended in 25 mM KPi and 50 mM NaCl at pH 7. For Fld, cells were resuspended in Tris buffer, pH 7. EDTA was added to a final concentration of 1 mM. Cell Lysis The cell paste was frozen at -20 ยบC and thawed to assist in lysing cells. DNase (1 mg/ ml) was added to a final concentration of approximately 0.05 mg/ml. MgSO4 was also added to a concentration of 1 mM. This was mixed until the paste took on a more water-like viscosity. Cells were lysed further using a French Press operating at a pressure of 1000 Pa. Unwanted cell membranes and debris were removed by centrifugation for 15 minutes at 15,000 rpm. The supernatant (cell extract) was collected for purification. In the case of Cyt c the cell extract appears a red/pink colour, for Fld the extract is a very dark yellow/brown. The cell extract was frozen at -20 ยบC until purification procedures were ready to be initiated. Ammonium Sulphate Precipitation This step was implemented in the purification of Cyt c only and was carried out in an ice bath. (NH4)2SO4 was added gradually (towards a point of saturation) until precipitation of unwanted proteins is seen to occur in the base of the beaker. This is evident as a white powder-like aggregate in the bottom of the beaker. During the procedure care was taken to stir at a slow enough speed so as not to froth the solution. The precipitated protein was removed by centrifugation at 15,000 rpm for 15 minutes. 77


Dialysis Dialysis was necessary after ammonium sulphate precipitation in order to remove high levels of salt ions in the protein sample. The ionic strength at this point would prove too high for binding of Cyt c to the chromatography column. The cell extract was dialysed against a low salt, potassium phosphate buffer at pH 7. Cyt c purification Prior to purification the protein solution was filtered. For initial purification an ion exchange (CM Sepharose) chromatography column was used with fast protein liquid chromatography (FPLC). The dialysed cell extract was loaded onto the column and unwanted proteins were eluted using a low salt buffer of 25mM KPi and 25 mM NaCl at pH 7. Cyt c bound to the negatively charged column and was visible as a distinct red band at the top of the white column material. Cyt c was eluted at an increasing ionic strength gradient to a high salt buffer, 25 mM KPi and 100 mM NaCl at pH 7. Pink/red coloured fractions containing Cyt c were pooled and concentrated using ultrafiltration methods. Further purification was carried out on a Gel Filtration Column (Superdex G-75). The buffer used for Gel Filtration was 25 mM KPi and 100 mM NaCl, pH 7. Again pink/red coloured samples containing Cyt c were collected, pooled and concentrated. The purity of Cyt c was determined using the purity index (A 280/A410) ≤ 5 as a measure of high purity. Ascorbate was used to fully reduce the protein before calculating the purity index. In the case where oxidised Cyt c was required for paramagnetic studies, ferricyanide was used to oxidise Cyt c and then washed away by ultrafiltration methods.

Fld purification The purification method used to isolate Fld is similar to that outlined by Mayhew and Massey (1969). Filtered protein solution was loaded onto a DEAE-Cellulose column equilibrated with a low salt phosphate buffer 25 mM KPi, 100 mM NaCl at pH 7. Fld was seen to bind as a dark blue/grey band. After unwanted proteins were washed off, Fld was eluted using a gradient of increasing ionic strength to a high salt buffer, 25 mM KPi, 1 M NaCl, pH 7. UV/vis Spectroscopy confirmed that the yellow/brown fractions contained Fld, which absorbs maximally at 466 nm, (Vetter & Knappe, 1971). These fractions were pooled and concentrated by ultrafiltration methods. This procedure was repeated in order to achieve better separation. Final purification was achieved using a Gel Filtration Column (Superdex G-75). Pooled fractions from the previous column were concentrated to 1 ml and the buffer was exchanged to 25mM KPi 100 mM NaCl, using ultrafiltration methods. Eluted fractions (now yellow) containing Fld, were pooled and concentrated again using ultrafiltration. Both the protein concentration and subsequent yield were calculated using the molar extinction co-efficient for bound FMN, ε466= 8.25 mM-1 cm-1 (Vetter & Knappe, 1971). The purity of Fld was determined based on the purity index used for D.vulgaris Fld (Mayhew et al., 1991), of A 273/A458 ≤ 4.4. However, due to the high absorbance of histidine, which would have an effect on this ratio, a ratio of 78


A 273/A466 ≤ 6.7 for the His6-tagged Fld was found to be of adequate purity and gave excellent NMR spectra. The yield of Fld was within the range of 80–100 mg of protein per litre of cell culture. This was slightly lower for Cyt c which around 50–60 mg per litre of cell culture.

Preparation of NMR samples Preparation: Ultrafiltration methods were used to concentrate protein samples to the desired concentration for NMR experiments. The buffer was also exchanged by this method to 20 mM KPi, 50 mM NaCl at pH 6. The concentrations of Cyt c and Fld were determined optically according to the absorbance peaks at 550 nm, ε550 = 27.5 mM-1 cm-1 , (Frohwirt & Margoliash, 1959) and 466 nm, ε466 = 8.25 mM-1 cm-1 (Vetter & Knappe, 1971) respectively. The working concentrations for the labelled proteins were ~0.2 mM in the case of Cyt c and ~0.15 mM in the case of Fld. The stock concentrations of proteins used to titrate during the NMR experiments were ~1.24 mM for Cyt c and ~2.7 mM for Fld. In all cases NMR samples contained 10% D2O to provide a lock signal. NMR Titration Experiments For all titrations: 2D (1H, 15N) HSQC NMR experiments were carried out on a VARIAN 600 MHz NMR Spectrometer at 30 ºC. The pH of the samples was adjusted to pH 6.0 prior to titration and this value was maintained throughout by checking the pH after each addition of partner protein. The spectra obtained were processed using VNMRj and CARA software. [15N]-Cyt c & Fld: Aliquots of Fld (2.7 mM) were added to Cyt c sample (0.15 mM) in microlitre amounts and 2D (1H, 15N) HSQC were spectra recorded after each addition. [15N]-Fld & Cyt creduced: Aliquots of Cyt c (1.24 mM) were added to Fld NMR sample (0.2 mM) in microlitre amounts and 2D (1H, 15N) HSQC spectra were recorded after each addition. Paramagnetic Studies [15N]-Fld & Cyt coxidised: As described above however the Cyt c sample (1.24 mM) was in the ferric (oxidised) state before it was added to Fld. Data Analysis Calculating the average change in chemical shifts (Δδavg): Changes observed in chemical shifts of resonances were monitored in MONOSCOPE by overlaying the spectra of the bound protein (at each step of the titration) onto the spectrum representing the free protein, (the reference spectrum). The peaklists for each spectrum were imported into EXCEL and compared. Differences were calculated for both the 15N and the 1H dimensions. The average change in chemical shift was calculated for the final step in the titration only. Averages were calculated using the equation (1) 79


where ΔδN represents the change in chemical shift of the nitrogen of the amide group and ΔδH represents the change in chemical shift of the proton of the amide group. Δδavg was then plotted as a function of residue number of the protein. Chemical Shift Perturbation Mapping: The molecular visualisation programme, PyMol (De Lano Scientific), was used for this process. The changes in the average chemical shifts were categorised into large, medium, small and insignificant. This was done for analysis and illustrative puposes only. By taking this information and mapping it onto the surface of the 3D crystal structure of the relevant protein, a 3D representation was constructed that highlights the region(s) on the proteins surface that are involved in the interface upon transient complex formation.

NMR results Linebroadening: An increase in linewidth of ~43% was observed in the peaks of the Cyt c spectrum after titration with Fld. An example of this is illustrated below.

Fig. 2. (A) Slice of the 1H dimension showing the increase in linewidth of the Cyt c T49 resonance. The red peak represents T49 of free Cyt c (0.15 mM) and the black peak represents T49 of Cyt c in the presence of Fld (Molar ratio Fld : Cyt c = 1.8). 80


Fig. 3. 2D (15N, 1H) HSQC reference spectra for (A) Cyt c and (B) Fld. The blue boxes highlight the sections which are enlarged in the lower images (C) and (D). (C): Overlay of a section of the Cyt c spectra. Black peaks represent free Cyt c. Red, blue and green peaks represent Cyt c in the presence of 0.02 mM, 0.06 mM and 0.25 mM Fld, respectively. (D): Overlay of a section of the Fld spectra. Black peaks represent free Fld. Red, blue and green peaks represent Fld in the presence of 0.05 mM, 0.1 mM and 0.3s mM Cyt c, respectively. Peaks are labelled using the one letter abbreviations for the corresponding residue, (* indicates unassigned resonances).

81


Observed changes in chemical shifts Figure 3A and B show the reference spectra obtained for Cyt c and Fld. Changes in the chemical shifts for several resonances of Cyt c and Fld were observed in the NMR titration experiments in both the 1H and 15N dimensions. This can be viewed by the overlaying the spectra obtained during the titration (Figure 3C and D). The average change in the chemical shifts (Δδavg) observed for all resonances is illustrated in Figure 4. These changes were categorised based on magnitude as illustrated by the vertical colour strips and dashed lines. Chemical shift perturbation maps The categories for Δδavg were mapped onto the surface of crystal structures of Cyt c (Louie and Brayer, 1990) and Fld (Hoover and Ludwig, 1997). This highlights the regions of the protein most affected by complex formation. In the case of Cyt c the resonances most affected are predominantly located on the face of the protein that surrounds the heme cofactor. The amino acids of Cyt c that correspond to significant Δδavg are K-2, K5, A7, T8, L9, T12, R13, C14, Q16, V20, E21, K72, K73, Y74, G77 and K79. Some of the effects extend around the sides of the heme containing face but no residues at the rear of the protein were significantly affected. The observed Δδavg for Cyt c resonances in the presence of Fld were within the range of 0-0.1 ppm and categorised as follows: Red (large) ≥ 0.08 ppm, orange (medium) ≥ 0.04 ppm, yellow (small) ≥ 0.02 ppm, grey (insignificant) < 0.02 ppm (Figure 5A). In the case of Fld similar results were observed regarding the location of the significantly affected amides. The Δδavg were predominantly observed for amides located around the FMN cofactor and, to some extent, either side of this face. Again, none of the residues at the rear of the protein were found to be affected by complex formation. The amino acids of Fld that demonstrate significant Δδavg are T11, N13, S39, K40, W56, Y58, G59, D67, D76, E73, E95, T103, E150 and D195. The observed Δδavg for Fld resonances in the presence of Cyt c were between 0-0.08 ppm and varied more within this range. For this reason the categories differed slightly compared to Cyt c data, in order to retain the detail of this variation. The categories are: Red (large) ≥ 0.04 ppm, orange (medium) ≥ 0.02 ppm, yellow (small) ≥ 0.01 ppm, grey (insignificant) < 0.01 ppm (Figure 5B). Paramagnetic results Paramagnetic effects were obtained by subtracting the chemical shifts observed for Fld, in the presence of (i) reduced Cyt c and (ii) oxidised Cyt c. Resonances which showed a difference of greater than + 0.03 ppm in the 1H dimension and/ or + 0.1 ppm in the 15N dimension were taken as significant (see Figure 6A and B). These values differ as the scale of the chemical shift in the 15N and 1H dimensions differ by a factor of ~5. Effects are seen on both polar and non-polar residues (Y57, W56, Q62, G12, N16, I75, T11) along with two negatively charged residues (D67 and D10). This entire region appears to be located just on one side of the cofactor (Figure 7A). However, on 82


Fig. 4. (A) Changes in the

average chemical shifts (Δδavg) for 15N labelled Cyt c in the presence of Fld, plotted as a function of residue number. (B) Δδavg for 15N labelled Fld in the presence of reduced Cyt c plotted as a function of residue number. (C) Δδavg for 15N labelled Fld in the presence of oxidised Cyt c plotted as a function of residue number. (All) Coloured bands and dashed lines indicate Δδavg categories: Red: Large Orange: Medium Yellow: Small Grey: Insignificant Fld categories differ to those seen for Cyt c as the Δδavg for Fld were more varied and on a smaller scale. See text.

83


Fig. 5. Chemical shift perturbation maps for (A) Cyt c in the presence of Fld and (B) Fld in the presence of Cyt c. Colour categories are described in the main text. The heme and FMN cofactors are shown as blue spheres. Each map view has been rotated 90ยบ clockwise around the Y-axis, relative to the map above it. 84


further inspection it is evident that I143 is also affected. This hydrophobic residue is located just below the surface of the protein, within the binding pocket of the FMN cofactor (Figure 7B). These results signify the close proximity of the heme and the FMN cofactors during complex formation. They also hold specific information regarding the orientation of the proteins in the complex as will be discussed further.

Results of Docking Studies PATCHDOCK is a program that uses a computor algorithm to construct hypothetical complexes of any two molecules. The complex is constructed by piecing the surface of each protein into segments based on shape i.e: concave, convex or flat. The segments are then filtered and only segments which contain certain residues, considered ‘hot spot’ residues by the algorithm, are retained. These are then matched and scored based on geometric complementarity. Therefore the basis of matching is mainly by shape with less emphasis placed on chemistry. The program then outputs the best maches based on these criteria and discards other configurations. The PDB files for Cyt c and Fld, 1YCC (Louie and Brayer, 1990) and 1AG9 (Hoover and Ludwig, 1997) respectively, were submitted to PATCHDOCK and the top twenty results were analysed in PyMol. The top five results are shown in Figure 8 as an example of the output. None of the results showed the FMN and heme cofactors either within the suggested complex interface or in close proximity to one another. The significance of this observation will be outlined in the discussion. The Cyt c - Fld complex The chemical shift perturbation maps in Figure 5 illustrate the residues of each protein that are significantly affected upon complex formation. The perturbations are interpreted as a change in the chemical environment of the corresponding residues upon addition of the partner protein. There are several aspects of complex formation that may be responsible for these amides experiencing a change in their chemical environment. One such effect is changes in electrostatic potential. It is highly likely that this is responsible for some of the observed perturbations as the proteins are oppositely charged. Many charged residues demonstrated changes in amide chemical shifts. In the interface uncovered for Cyt c for example (Figure 5A) positively charged lysines (K-2, K5, K72, K73, and K79) along with one arginine (R13) make up for almost 50 % of the involved residues suggesting that charged residues are imperitive for complex formation. It seems possible that these positive charges are complemented by the negatively charged glutamates (E73, E95, and E150) and aspartates (D67, and D195) on the periphery of the binding site on Fld. This correlates well with previous findings on electrostatic interactions between Cyt c and Fld interactions (Mathew et al., 1983; Weber and Tollin, 1984; Hazzard and Tollin, 1985), and with other redox protein interactions such as Cyt c and Plastocyanin (Ubbink and Bendall, 1997) and also Cyt c and Cyt c Peroxidase (Volkov et al., 2006). 85


Fig. 6. Chemical shifts for Fld in (A) the 1H dimension and (B) the 15N dimension due to paramagnetic effects of oxidised Cyt c. Categorisation for chemical shift perturbation maps; (A) Blue (significant for 1H) ≥ ∓0.03 ppm and (B) Blue (significant for 15N) ≥ ∓ 0.1 ppm. 86


Fig. 7. (A) Chemical shift perturbation maps for Fld highlighting the regions of the protein surface that experience the paramagnetic effects of ferric (oxidised) Cyt c. This Figure illustrates how remarkably close to the FMN cofactor this region lies. (B) I143, located just beneath the protein surface, is also affected. The enlarged section on the left shows how this residue is also very close to the FMN. 87


Fig. 8. (A-E); Top five PATCHDOCK results for the complex of Cyt c and Fld. Cyt c is represented as grey cartoon with the heme in blue. Cyt c is in the same orientation for all five illustrations to highlight the different positioning of Fld around it. The heme and FMN cofactors are labelled. None of the complexes have the cofactors positioned together suggesting that the complex has low geometric complementarity at the actual interface. 88


Fig. 9. Comparisons between chemical shift perturbations and electrostatic potentials for Cyt c (A, C) and Fld (B, D).

89


Desolvation is a necessary component of complex formation. As proteins come closer together, water molecules that were previously bound to the surface groups are released resulting in a net entropy gain for the complex therefore, a large hydrophobic region at an interface confers more stability to a complex than a small hydrophobic region. This gain in entropy is a driving force for complex. In the Cyt c:Fld complex, desolvation is most likely responsible for changes seen in the amides of hydrophobic residues such as L9, W56, Y58 and G77 as well as for those residues that are able to partake in H-bonding such as T12 and Q16 of Cyt c and T11 and N13 of Fld. (Figure 9). With the release of water, inter-protein interactions are formed and many side-chains undergo conformational changes, thus altering their chemical environment or that of neighbouring side chains. This is another reason why changes in chemical shifts are observed. A particularly interesting finding is the presence of an arginine (R13), close to the heme of Cyt c and two aromatic residues (Y58 and W56) close to the FMN cofactor of Fld. Arginine, a positively charged residue with a long and flexible side chain, is one of the most abundant residues present in protein complex interfaces (Crowley and Golovin, 2005). It has the ability to form electrostatic interactions, in a co-planar manner, with the aromatic side chains of tyrosine, tryptophan and phenylalanine (particularly tyrosine). This type of interaction is called a cation-π interaction. Crowley and Golovin found that such interactions occur widely among all classes of protein complexes. It is possible that in the case of the Cyt c - Fld complex the R13 of Cyt c is engaging in cation-π interactions with Y58 and/ or W56 whose chemical shifts have been moderately perturbed by complex formation (Figure 5B). Mutagenesis experiments have found that a cation-π interaction has an energy of ~0.6 kcal/mol but that the overall contribution of cation-π interactions are small, possibly due to the high cost of desolvation upon burial of the long arginine (Paddock et al., 2005). This type of counteraction in terms of energy cost and benefit may be one of the mechanisms that promotes the right balance between specificity and affininty of the Cyt c – Fld complex, thus enabling the interaction to remain transient. The degree of specificty, conferred by the cation-π interactions, may be balanced with the level of affinity, which is driven by hydrophobic interactions and desolvation. Even more interesting however, is that on further inspection of this region of the Fld surface, another tyrosine (Y93) remains uneffected, which raises the question as to what other chemistry might be occuring that results in only Y58 and W56 being affected (Figure 10). Y57 was too broad for detection in the spectra so it is unknown whether this experiences a chemical shift. Lysine is also known to engage in cation-π interactions with tyrosine and tryptophan but with lower propensities than arginine. It also tends to be found at the periphery of interfaces rather than in the centre (Crowley and Golovin, 2005). Therefore, it is less likely that K72, Figure 10: Left hand side: The structure of Fld in cartoon representation. The box highlights the region that is rich in aromatic residues with side chains shown in 90


stick mode. The FMN cofactor is represented as blue sticks. Residue T11 was removed for clarity. Colour codes remain as per previous Δδavg categorisation for Fld. On the right hand side: Enlarged section showing Y58 and W56 (orange) that were moderately affected upon complex formation unlike Y93 that was not. The Y57 resonance was too broad to detect. K73 and K79 of Cyt c are involved in cation-π interactions with Y58 and W56 of Fld, although this cannot be disregarded. On comparision to the interface of the yeast Cyt c - Cytochrome c Peroxidase (CcP) complex (Guo et al., 2004) using the PDB file 1S6V, the part of the Cyt c molecule that lies closest to the CcP surface is an alpha helix containing an R13. As shown in Figure 11, the R13 of the Cyt c – CcP complex lies in close proximity to Y38 of CcP, forming a cation-π interaction. This suggests that in the present study, the same residue (R13) is forming these type of interactions upon complex formation with Fld.

Fld & Cyt c with other partners Some of the results from the current investigation are in agreement with previous studies. For example, the interactions of flavodoxin with its physiological partners, flavodoxin reductase and colbamin-methionine synthase, (Hall et al., 2001) bear a 91


Fig. 11. Cation-Ď€ interaction between R13 (magenta) of Cyt c and Y39 (orange) of CcP. Cyt c and CcP chains are shown as cyan and gray cartoons respectively with the heme cofactor of Cyt c in blue sticks. Residues 79 to 83 of the Cyt c chain have been removed for clarity. striking similarity to the chemical shift perturbations of Fld on titration with Cyt c. On comparison, it is clear that in both cases it is the face of Fld containing the FMN that is used as a binding interface with partner proteins. Also similar is the magnitude of the average chemical shift changes that were within the same range of approximately 0.01 - 0.1 ppm. Similar findings are evident regarding the interactions of Cyt c with other non-physiological partners (Ubbink & Bendall, 1997; Worrall et al., 2003). In the non-physiological complex with adrenodoxin (Worrall et al., 2003), many of the chemical shift perturbations correspond to those seen in the present study. Across the board the findings are conclusive that it is the heme containing face that Cyt c uses to interact in redox partnerships. When compared to the physiological complex of Cyt c and Cytochrome c Peroxidase (CcP) (Worrall et al., 2001) the binding sites appear more extensive for the interactions with CcP than for those seen for Cyt c with adrenodoxin or Fld.

Paramagnetic Results In this investigation paramagnetic effects signify close contact of an Fld nucleus 92


with the upaired electron orbiting the iron (FeIII) of the Cyt c heme cofactor. Most of the effected resonances (D10, T11, G12, N16, W56, Y57, Q62, and D67) are located at the surface of the protein remarkably close to the FMN cofactor. The furthest from the FMN is D67. It is noticeable that all of these residues are positioned only on one side of the FMN. Also significantly affected is the amide I143 which is situtated just below the surface of Fld but yet in close contact with the FMN. This suggests that this residue must come into close proximity with the heme of Cyt c also. There is further significance to these results. The changes in chemical shifts hold quantifiable information regarding the orientation of Fld to the heme cofactor of Cyt c. This can be calculated using the equation (2)

where Δδpara is the change in chemical shift due to paramagnetic effects (Crowley et al., 2001). This value is related to the angle (ϑ) at which the nucleus lies to the electron orbital and also the distance (r) from the nucleus to the iron. Therefore, the angle/ orientation at which Fld lies, either above or below the heme, can be calculated using the approximate distance at which the paramagnetic effects are experienced. The calculations were not carried out in this study due to time constraints.

Geometric complementarity The results from the studies carried out using PATCHDOCK indirectly suggest that geometric complementarity at the interface of the Cyt c – Fld complex is low. Patchdock is a molecular docking programme that searches for docking configurations in which the shape complementarity is maximised. It is expected that if geometric complentarity was an important aspect of binding, the results would include a complex in which the cofactors were in close proximity to each other to facilitate electron transfer. However, this was not the case. Of the top twenty results investigated, none portrayed the proteins in a reactive configuration. This may contribute to the transient nature of the interaction by prohibiting a large number of surface contacts at any one time. In other words structural accomodation by one protein of the 3D shape of the other, (such as a convex shape fitting into a concave, Figure 12B) would impede dissociation of the complex and subsequently limit the turnover of ET (Figure 12). Conclusion The Cyt c - Fld complex: Several conclusions can be made from the findings of this investigation. Firstly, and in agreement with previous studies of redox protein interactions, electrostatics play a fundamental role in Cyt c - Fld complex, preorientating the proteins and guiding them towards the reactive configuration. Charged 93


Fig. 12. (A) Schematic diagram illustrating the low geometric complementarity of Cyt c and Fld results shown in Figure 8. (B) This is most likely not the orientation adopted by the proteins in the complex as such a fit would impede dissociation and subsequently limit the turnover of ET. residues may also participate in salt bridges within the encounter complex. The discovery of the involvement of an arginine along with two aromatic residues in the complex interface suggests that cation-Ď€ interactions may be responsible for specifity of binding and guiding the cofactors into close proximity for electron transfer to occur. It is proposed that the energy cost due to desolvation of charged residues may counteract the benefit of the cation-Ď€ interaction as a mechanism to maintain a balance between affinity and specificity for the reaction to remain transient. Another mechanism supporting this balance may be the low geometric complementarity at the binding interface of the interacting proteins. This is indirectly suggested by the results of the molecular docking programme PATCHDOCK. Paramagnetic studies have shown that there is a dominant orientation of the proteins in the reactive complex in which the FMN cofactor is positioned alongside, and perhaps parallel to, the heme of Cyt c. These results require further analysis to deduce the orientation of Fld in relation to the heme of Cyt c. Overall, it has been shown that some of these findings correllate well with similar studies of these proteins, both with each other and with other partners. However in this study further insight has been gained into the possiblity of a cation-Ď€ interaction, the dominant orientation of the proteins during complex formation, and the absence of geometric complementarity of interacting surfaces. Further studies: There are many ways in which this complex could be investigated further. One such method is co-crystallisation of the complex. This can prove difficult for transient interactions as the interaction is just that, transient (Radaev et al., 2006). Therefore it may be advantageous to design a molecule which mim94


ics the surface of the partner protein (Fletcher & Hamilton, 2006; Thanos et al., 2006) which could be crystallized with the protein more easily than a complex. Some NMR studies have already been carried out using porphyrin molecules with negatively charged substituents (Aya & Hamilton, 2003; Crowley et al., 2007, unpublished work) to bind to cytochromes. It is possible that such studies, in this case involving Fld, could be carried out using porphyrin molecules with positively charged substituents that resemble the amino acid side chains of Lysine and Arginine. Apart from their potential as lead compounds for therapeutic agents, these molecules could further investigate the possibility of a cation-π interaction close to the FMN cofactor. Another possiblilty is to study these interactions using 2D (1H, 15 N) HSQC NMR similar to the methods outlined herein along with competition studies involving Cyt c. Small angle X-ray scattering methods have been used to further explore the orientation of the proteins within the Cyt c – Fld complex. This provides an envelope into which a best fit model of the reactive configuration of the complex can be applied. The data obtained from this technique is currently being analysed.

95


BUSINESS PANEL

Judging Panel Mr. William Kelly (Dublin City University) – Chair Judges’ Comments The judging panel were of the view that this paper met all three key criteria: it displayed originality, contained no significant weaknesses and demonstrated intellectual excellence. The paper examined the issues for Whole Foods Market Inc. - a US company with a very strong identity based on values derived essentially from its HR strategy - in expanding its operations to the UK. The judges felt that this paper set out the context very clearly and was very clearly written. It made excellent use of an academic literature that was thoughtfully analysed, reaching firm conclusions and recommendations. The recommendations appeared to the judges to be very sensible and collectively constituted an excellent plan. Please note Due to space constraints in the print version of this jurnal, only the introduction and recommendations of this essay are published here. The full text of the essay will be available from the Awards website at www.uaireland.com

96


BUSI N E S S

Different sultures, came culture? International HRS in Whole Foods Market Inc. Anne Byrne, Grainne Conroy & Megan Huxhold

W

Introduction hole Foods Market Inc. is a chain of organic food supermarkets founded in 1980 by the amalgamation of Safer Way Natural Foods and Clarksville Natural Grocery. Since then it has developed from a one shop operation in Texas, USA, to one of the leading food retailers in the USA, with over 265 stores in North America and the UK. In 1992, the company went public, floating on the Nasdaq Stock Exchange. Continuous accumulation of similar smaller stores has aided this growth – since 1980 the firm has accumulated eighteen regional competitors. Most recently, Whole Foods Market Inc. merged with Wild Oats Markets Inc1. The company has been listed as one of the “Fortune 100 Best Companies to Work For” every year since the inception of the list in 19982. Section 1 of this project will give an overview of the company and its current HRM strategy and practices. Section 2 will outline the HR strengths and weaknesses of the company and will identify a key people related business issue to be 1  Source: Whole Foods Market, (2008), Whole Foods Market Company, available online at: http:// www.wholefoodsmarket.com/company/index.html, accessed 04/04/2008. 2  Fortune, (2008), 100 Best Companies to Work For, available online at http://money.cnn.com/ magazines/fortune/bestcompanies/2008/index.html, accessed 04/04/2008

97


addressed, namely the issue of culture transfer and international HRS and HRM in Whole Foods Market’s UK operations. Section 3 will consist of an analysis of the literature and case-studies relevant to this key people related business issue. The final section of the project, Section 4, will detail the proposed HR strategy, strategic objectives and in depth HR practises for Whole Foods Market to implement is response to this international challenge.

Section IV

A. HR Strategy Problems with Culture Transfer The analysis in the previous section indicates that there are many challenges involved in the international HRS of companies with a heavy emphasis on company culture. The approach stems from an understanding that firstly, company culture cannot be directly transferred from Whole Foods USA to Whole Foods UK. This is due to national cultural and societal differences and practical problems. A number of the HR practices used by Whole Foods Market to reinforce and/or establish company culture in the USA cannot practically work in the UK (for example, the policy of 30% existing Whole Foods staff transferring to new stores.). These implementation difficulties with Whole Foods Market’s traditional methodologies will be highlighted throughout the following sections as the HR practices are explained. In addition, because of the nature of Whole Foods Market and the integral role HRS plays in the company’s brand image and business strategy, it would be foolhardy for Whole Foods to even attempt to directly transfer culture and HR practices. Responsive employees who fully believe in the culture of the company are integral to Whole Foods market’s brand image. In order for this to continue, Whole Foods employees need to have the capacity to facilitate organic cultural growth. Imposition of culture, or a culture which is not reflective of the needs of UK employees and consumers, will weaken the company’s brand. The UK Strategy Whole Foods Market’s HR strategy for the UK needs to fulfil two criteria: (a) it needs to be fully integrated with the company’s business strategy of quality and growth and (b) it needs adapt to fit the needs of the UK environment. With that in mind, it is proposed that Whole Foods adopt a strategy of identifying and fostering core Whole Foods company culture in the UK whilst recognising, accepting and even encouraging national cultural flexibility. 98


US WFM Culture

Core Culture

UK WFM Culture

Fig. 4(a). Venn diagram indicating the outcome of Whole Foods Market’s proposed HRS One of the challenges of this strategy is to overcome the potential for a perception of the imposition of “American” culture on UK employees. Practices must aim to be in tune with specific UK needs and allow scope for the national flexibility and organic growth of national company culture. In addition the aspects of culture being transposed from the USA should be presented as part of a “Whole Foods Culture” rather than an American culture. This strategy and its success is important for Whole Foods Market, not just for its UK operations but because of Whole Foods Market’s stated intention of further European expansion. A strategic plan needs to be in place to enable this future expansion.

B. HR Objectives In light of this strategy, the HR objectives for Whole Foods Market’s UK HR policy are as follows:

• To identify the core aspects of Whole Foods Market company culture • To ensure that these core aspects are transferred to the UK • To ensure that those aspects which are transferred are perceived as a Whole Foods culture and not an imposition of national US culture • To enable and facilitate the broadening of Whole Foods company culture in the UK to allow a UK Whole Foods culture to develop • To enable employees in the UK to respond to national market needs • To continue Whole Foods Market’s success in utilising HR practices and quality in employee service as part of an overarching business strategy • To create flexibility in company culture and HR practices to aid the success of future growth in overseas markets

In essence, these objectives are about balance. Whole Foods market must balance national responsiveness with the need to retain a unified brand image. 99


The following sections outline the specific HR practices to be implemented in order to achieve these objectives. Each of these practices will establish the core culture and a mechanism for transfer along with adaptation for the UK environment.

C. Staffing and Training & Development WFM has relied in the past on instituting staff from existing stores into new stores as a means of creating consistency in management and methods as well as establishing company culture. WFM aims for a 30% target of staff in a new store to be internally placed from other stores. In the UK context this threshold is difficult to maintain and WFM must consider this staffing issue and potential HRS change.

Factors WFM need to consider Cost? Expatriate staffing across international boundaries is very costly for the firm as there are significantly more rewards which must be made to expatriate staff to both incentivise and compensate for international transfer. In addition, the most common form of expatriation for international companies is at a high managerial level. On a cost and incentive basis it is difficult to justify or implement a high percentage of expatriation in lower levels. This poses difficulties for WFM who rely on the bottom-up promotion of company culture. How important is control? Control for WFM of processes etc. seems less important than the attainment of targets. Control can be exercised through central target setting without the need to control processes through expatriate management supervision. However, WFM brand image is central to their success, and employees and HRS are central to the brand, thus it may be important for this brand image to be controlled in a more direct manner. Industry type? WFM’s industry can be characterised as a “multi-domestic� industry and traditionally a low level of expatriate presence is seen in these areas as the cohesion of culture and strategy in these industries is seen as less important as domestic sectors can operate according to the needs of the specific market. A certain degree of expatriate employees at a regional high managerial level may be able to create the circumstances for the instigation of company culture whilst allowing for the organic growth of a new adapted culture suitable for the employees and market in the UK. 100


Host country characteristics? The UK has many business culture similarities to the US (as well as a number of differences). In addition, the educational levels in the UK of a similar base to that of the US. These factors indicate that it is possible to obtain high-calibre local management in the UK. Recommendation – Initial Recruitment The factors above indicate that it does not appear necessary or justified for high levels of expatriate employees in the UK. The question however is how this impacts upon company culture and what can be done to resolve this. Our recommendation takes a two-fold approach: (a) Limited expatriate presence Excessive expatriate presence is not justified for the above reasons. There is limited necessity in terms of a lack of high-calibre candidates in the UK, WFM industry type is such that autonomy on a store level is acceptable and control can be successfully implemented through target setting. Some expatriate presence is needed for communication and consistency reasons, but this should be reasonable limited. The issue outstanding relates to culture implementation. It is felt that the excessive dominance of parent-company nationals at a managerial level may create resentment about the imposition of culture rather than its organic growth. In addition, company culture in WFM is facilitated on a bottom-level basis which for practical reasons cannot be done in the UK by the introduction of expatriate bottom level staff. For these reasons it is felt that alternative methods of company culture growth should be found. (b) Impatriation Supplementary to a degree of impatriation should be the long term employment of UK employees in the US HQ level, once the UK region has been established. This will strengthen links between the regions and allow for greater coherence and integration of strategy. It will also create an additional knowledge basis and bridge for further European expansion. Communication difficulties and isolation of the second country region can be problems with international expansion, but a twoway process of staffing may prevent this from arising in the long run. Recommendation – Cross-Cultural Training and Development The way in which company culture can and should be fostered in the UK stores is through a detailed training and development plan. Training and development in the US has proved successful on a knowledge level through the WFM “University” and in fostering leadership skills through their team leader programme. These methods should be continued in the UK and supplemented with cross-national training as well as general team building training. Ultimately, company culture 101


has to be fostered in this manner rather than through an imposition from higher management or a costly bottom-level expatriation scheme. In addition to the established knowledge based and team based training methods already employed by Whole Foods market, the training and development plan will take a three step process as outlined below:

(a) Higher Management Training Host country higher management extended training and working in the US may also be successful. By allowing UK employees to work and train in the US and become immersed in the US culture, they may return with the ability to transfer WFM strategy, goals and practises to the UK. In addition, this manner of high level instigation of company culture and strategy may be more acceptable to lower level employees when it is seen to be coming from fellow UK citizens rather than being imposed by what can be seen as external interference. Regular travel to the USA by UK executives and strong combined management raining will provide the cohesion necessary to maintain a strong HRS and brand consistency. This continual cross-cultural training and communication allows the best practises from each region to develop, and each region to learn from the other. (b) Team Leader Training (US to UK) Another initial step in training should be the short-term (two-weeks) transfer of identified excellent team-leaders from the US to stores in the UK. This should be taken at an early stage, before practices and behaviours are too firmly established in the UK stores. This can foster a whole company attitude at the ground level. In addition, this method may be more successful than merely establishing higherlevel cross-cultural development as the informal, peer-based training may be more accessible for team leaders and workers in the UK. (c) Team Leader Training (UK to US) Once practices have been established in the UK, the next training and development step should be short-term training of potential team leaders/management in the US. The above training and development processes should be continued on a regular and long term basis over the course of the company’s life. This multi-tiered communication and training will allow each level of management to learn from the best practice of each nation and may foster a sense of cohesion.

Cost Obviously cross-cultural training, impatriation and expatriation will have an effect on the company’s cost base. This cost outlay is justified however, for two reasons. Firstly, this is a long-term investment in the brand identity of the company. A strong company culture is integral to Whole Foods Market’s success and by fos102


tering this through these processes, While Foods Market should be able to grow from strength to strength in the UK. Ultimately, HRS and HR practises will define the success of the UK operation and the initial cost outlays are justified because of their impact on overall company success. In addition, the processes recommended above are less costly than alternative methods which are employed by other companies engaging in international expansion. Expatriation is the most costly method of establishing culture and HRS in the new market and with the methods advocated above, expatriation is kept to a minimum, allowing culture to develop in a more organic manner whilst still retaining the necessary level of control and continuity.

Summary The aim through initial recruitment set-up and the long-term training plan is to foster the development of company culture in a way which retains the core aspects necessary for brand and company development, whilst allow employee and management autonomy to develop culture and practises which adapt to the differences between the UK and US. The practices will not be once off however, and each method will be continually used once implemented. The basis of this time line is to allow the time and scope for independent development of culture, whilst reinforcing this with the understanding and training of parent company methods.

D. Performance Appraisal In the US, Whole Foods uses performance appraisal as a mechanism of control to ensure that profitability targets are met. In terms of establishing strong foundations for the company in the UK market, we feel that performance appraisal has a crucial developmental role to play. We recommend that the first three years of the UK expansion be viewed as a “developmental phase”. After that the company will move into a more stable period or “flexible stability phase”.

Stage 1: The Developmental Phase (Years 1-3) “Act on the feedback then measure the impact”3 Whole Foods has invested heavily in the physical capital in the UK e.g. the flagship store in Kensington. In order to establish the necessary strong foundations to facilitate further expansion in the UK they must also invest in human capital. As stressed extensively throughout this project, Whole Foods’ key differentiator is superior quality service. It is important, therefore, to ensure that UK employees 3  Source: Craig (1999)

103


are delivering this. This involves providing employees with the necessary training and development to develop key skills. The main business objective of this phase is rapid growth of market share; therefore, an emphasis should be placed on sales in setting team targets. We suggest that a hybrid of 360* performance appraisal be used at regular intervals (every six months, for example). Supervisors, customers and colleagues from other teams would provide feedback on a team unit as a whole. This enables the team to gain an outsider perspective and to also to appreciate the interdependencies between the various teams in-store. Within the team itself, members would give feedback on each other. This aims to reduce the possibility of intra-team conflict by providing appropriate channels for people to air any frustrations. Whole Foods would then “act on the feedback”. This would be done by setting team sales targets for the next period, while at the same point, intervening to provide the training programmes deemed necessary based on the feedback received. The impact of this will then be measured at the end of the period when planned results are compared with actual. Whole Foods’ policy of making all stores’ results available internally would be especially useful in this phase as it will enable stores to learn from each other. This avoids the unnecessary duplication of mistakes while, leverages any successes across the entire organisation. We feel that the practices of the “Company Snapshot” and “Store Tour” could be introduced in the UK after the first year to 18 months.

Stage 2: The Flexible Stability Phase (Year 4 on approx) The purpose of the developmental phase is to recognise that major investment is needed to successfully establish Whole Foods in the UK. In the medium to longterm, however, profitability metrics cannot be ignored. Therefore, we suggest that after the first three years performance appraisal practices would start to converge more or less with current practices in the US. This would facilitate better integration between US and UK subsidiaries and would be supported by rewards practices such as gain-sharing. That said highlighting training needs will be an on-going practice in the UK as it is in the US. Recommendation Phase 1: Use performance appraisal to highlight training needs to support a policy of market share growth. Phase 2: Performance metrics sound adopt more of a profitability focus, converging more or less with US practices.

104


E. Pay and Rewards Highly Competitive Base Pay

+

Benefit Menu

+/-

+

Training Bonus Team Bonus Promotion Gain sharing

Fig. 4(a).

Should Whole Foods pay the same way? The lowest wage that Whole Foods pays in the US is 86% greater than the average minimum wage. In Phase two of this project, we asserted that the same wage would be merely 23% greater than the UK national minimum wage. This means that unless wage levels were increased substantially for the UK operations of Whole Foods this recruitment differentiator would be less pronounced and therefore less effective than in the US. But is that really the case? The national minimum wage in the UK is approximately 35% greater than that of the highest minimum wage paying state in the US, Washington ($8.074 or £.4.06 versus £5.525). In Kansas, the minimum wage ($2.65) equates to approximately £1.34 based on current currency conversion rates. This illustrates a fundamental difference between the two sides of the Atlantic on the concept of minimum wages. In the UK and indeed in Europe more generally, minimum wages are set in an effort to ensure that all employees can earn enough to cover the basic costs of living. In the US, state intervention in the economy is generally discouraged; therefore, it can be argued that the minimum wage levels are purposely set unrealistically low in order to allow businesses the freedom to determine what wage levels should be. This means that comparisons based on the level of “minimum wage premium” between wages in the US and the UK simply don’t compare like with like. A relatively high level of base pay is an initial attractor for potential employees. We recommend, however, that these rates be calculated with a view to offering higher rates than those offered by competing retailers in the UK rather than by adding US type premiums to an already high UK minimum wage. Offering relatively high base wages diminishes the need for frequent marginal pay increases which are common in many service jobs. Instead, we recommend that base wages be increased mainly in line with inflation. This is in an effort to foster a sense of 4  Source: US Dept. of Labour (2008) 5  Source: UK Dept. of Business, Enterprise and Regulatory Reform (2008)

105


team cohesion by minimising the pay differentials between employees of the similar positions.

Benefits: Universally beneficial? In the US, Whole Foods’ benefits package is a source of great employee motivation. This effect, however, is amplified by the minimal state provision of public amenities in the US. Europe, however, is home to the concept of the “welfare state”. In Phase 2 we posed the question of whether the benefit package, therefore, could possibly have the same impact in the UK as it does in the US. Whole Foods’ use of “welfare capitalisation” in the US hinges mainly on the provision of medical care. In the UK, however, 90% of citizens rely on the NHS for all their health-care needs.6 This suggests that applying the same benefits to the UK and the US would not prove as effective in the UK. The solution of this difficulty, however, lies in Whole Foods’ current benefit policy. US employees currently vote periodically on the benefits the most desire from a menu of benefit options. To make this relevant to the needs of the UK employee the principle of voting for benefits could remain while the menu of benefits offered could be modified. Modifications could include; a larger store discount, a greater focus on pension schemes or other innovations such as childcare subsidisation. The huge advantage of the voting system is that it will clarify quickly what UK employees value, therefore, avoiding the misuse of resources on schemes without any motivational impact. Collective Rewards The collective rewards package is the component most directly linked with performance appraisal. The performance appraisal component of this HR strategy distinguished between the early phase of expansion into the UK (developmental phase, years 1-3) and the the more stable phase which will emerge after the first three years approximately. We recommend that this two phase approach be carried through to the rewards package. The developmental phase should focus on gaining market-share, therefore, the focus should be on sales rather than profitability targets. Accordingly, teams bonuses would be awarded periodically to teams who exceeded their targets. The second component of this phase would focus on providing incentives to employees to take WFM University courses. This would be done by awarding training bonuses on completion of courses. This would increase the knowledge base of UK employees as well as, fostering the corporate culture. Moving towards the medium to long-term, operations in the UK will not be sustainable without being profitable. Profit-linked rewards, such as gain sharing, should therefore, be used. This also serves to integrate the UK stores into the overall whole Foods organisation. In the medium turn, members who display potential 6  Source: Mercer (2008)

106


leadership ability should receive leadership training in order to be able to benefit from the promotion opportunities that’ll become available as Whole Foods expands in the UK.

Recommendations(1) Calculate relatively high base wages competitive in the UK setting. (2) Retain the benefit vote but modify the benefit menu to suit the UK context. (3) The developmental phase should reward market share increases. In the medium to long term, rewards should be based around a profit focus.

F. Employee Relations Employee Relations & Unionisation: Does employee friendly Whole Foods need to be union friendly, too? The current HR strategy for Whole Foods in relation to unions is a union replacement. They look to match the benefits of union by making policies in-house. This is not union suppression, which acts to mute the voice of the employees.7 Factors Whole Foods needs to consider i. The difference in common union perception between the US and UK Ferner, et al. gives a comparative history of US and UK union activity. In America, union activity was fairly weak. This is contributed to the individual mindset of American workers and the ideology of the company being the resource of security for the worker not collective action. On top of this, industrial relations had the negative connotations of violence in the workplace, and disrupting order. The popularity of scientific management left little room for union regulation. In the UK, though, unions and collective bargaining had become the industrial policy and British public policy. Unions were seen as a voice for employees and were made part of the guidelines in the self-regulated work teams8. Because of its more positive connotations in Britain, more vigilance should be given to them. ii. The difference in US, UK, and EU laws on employee representation America The National Labor Relations Act of 19359 acts as the guidelines for American industrial relations and union activity. The act was made in the state of mind of America in the Great Depression. It seems to provide as many rights as it does re7  Source: Badigannavar, V. and J. Kelly. (2005) 8  Source: Ferner, A, et al. (2005) 9  Source: National Labor Relations Act. (1935)

107


strictions to unions. Most following legislation focuses on limiting union power including the LaborManagement Relations Act of 1947, the Labor-Management and Disclosure Act of 195910, and the Freedom of Information Act of 2000.

Britain Statutory Instrument 1999 No. 3323 - The Transnational Information and Consultation of Employees Regulations 1999 gave employers three options to improve employee relations including: the European Works Council, having an information and consultation procedure, or creating an agreement between the employers and the employees on consultation. Important is the fact that employees can opt out of the European Works Council through this act. This seems to represent the sentiment that representation can be in any form, weakening union voice. iii. Having their public image so entwined with their HR strategy Along with the differing common perceptions in union representation between America and the UK, comes differing public action when representation is forbidden in the company. The anti-union sentiment used in the USA by Whole Foods could tarnish their public image as a employee friendly employer. It could also give rise to government action making the union agreements legally-binding11. iv. The impact of further growth in Europe With further growth in Europe (Italy said to be next), comes a need for greater flexibility to union recognition. Recently, there has been an increase in union action in the low paying retail sector12. In Ireland, employees called for unionisation in Aldi. McDonald’s has seen union action in both Italy and France.13 Recommendation- Slightly Augmented Union Replacement Strategy i. Common Practice In the UK both John Lewis and Marks and Spencer’s hold union substitution strategies14. Whole Foods can look toward these companies to learn about the cultural importance placed on certain benefits and policies. ii. Sensitivity to Local Needs Wal-mart and ASDA When Wal-mart acquired ASDA in 1999, ADSA had already aligned their human 10  11  12  13  14

108

Source: U.S. Department of Labor. Employee Standards Administration Source: Torrington, D., L. Hall and S. Taylor (2005) Source: Muller-Camen, et al. (2001) Source: Both examples from ibid Source: Muller-Camen, et al. (2001)


resource strategies with that of Wal-mart. Because of this, most of the work was already done, but changes in the practices were introduced. Most being successful, but strain arose when it was thought that the parent company was wearing away present working conditions. These differences were viewed as being cultural differences15.

Whole Foods Fresh and Wild of the UK, acquired by Whole Foods, was imitating the Whole Foods strategy from the beginning16. Because of this, we can look to Wal-mart and ASDA as a good case study to base future problems on. Therefore, we recommend that Whole Foods keep their union replacement strategy in the initial stages, but to pay close attention to cultural differences using human resources. Whole Foods has the tools already in place to listen to their new employees. Voting on benefits will give Whole Foods better understanding for what is important to Britain and store autonomy will allow greater flexibility for every-day practices. Recommendation- Forward-Looking Union Strategy i. Discussed by Executives Objectively We also recommend a forward-looking plan of action toward union involvement. This plan would be reactionary, with executives from both the United States and Britain looking objectively at the employee needs and governmental policies that may leverage power. Looking at Ferner, et al’s three local forces (these include powerful actors in the local environment, subsidiary importance to the parent company, and the ideological norms of the area), the company can choose unionism in reaction to these local conditions without changing policies in the United States.17 ii. Looking towards Continental Europe With the increased threat of unionisation in the retail sector in continental Europe, this plan will be especially important with further expansion. The objectives of the plan would create a rational response to the tensions instead of placing nonunion emotions in the way. A flexible union strategy in Europe could be the difference between success and failure since Whole Foods relies so much on word-ofmouth advertisement to gain business. The company needs to be knowledgeable in how much importance each country places on unions to keep both the employees and the wider public happy. Cost The cost of this strategy will be very little considering all senior management will 15  Source: Fernie, et al. (2006) and Pioch (2007) 16  Source: Gewirtz, L. (2006) 17  Source: Ferner, A, et al. (2005)

109


Fig. 4(b). Summary of time-line of implementation receive some training in America. Whole Foods can take advantage of this time to involve them in the project. We believe the benefits outweigh the costs. Such a plan will have great benefits for the expansion of Europe.

G. Time Line of Implementation Ongoing projects which will start from the beginning of the implementation of Whole Foods Markets’ HR strategy include a process of installation of expatriate managers into the company’s UK operations. General training and development, as outlined, along with the pay and benefits package will also be implemented im110


mediately and will continue as part of the ongoing strategy. This will instil the overall Whole Foods culture whilst allowing a degree of flexibility in response to the UKL culture. Team bonuses and promotion opportunities will come into operation after the first year of strategy implementation. This will facilitate the development of company norms and will create more financially minded team attitudes. The final stage of implementation will involve a process of impatriation and gain-sharing opportunities. The aims of this final stage are to integrate the Whole Foods operations in the UK fully with US operations. In addition, the first phases of strategy implementation will focus on development to establish strong Whole Foods foundations in the UK. The second phase will be a period of flexible stability as UK operations become integrated, settled and perfected. With regards to union strategy, Whole Foods will adopt a union cooperation plan in response to the need which may arise. Up until then Whole Foods will continue its modified strategy of union replacement.

111


CHEMISTRY PANEL

JUDGING PANEL Prof. Kieran Hodnett (University of Limerick) – Chair Dr. Kevin M. Ryan (University of Limerick) Prof. John Cassidy (Dublin Institute of Technology) Dr. John Colleran (NUI Maynooth) Dr. Leigh Jones (NUI Galway) judges’ comments The winning project submitted by Roisin O’Flaherty addressed the synthesis of glycolipid analogues of α-Glactosyl Ceramide which are synthetic mimics of bioactive glucolipids found in sea sponges. The bioactive glycolipids have been shown to bind to and activate specific human cells (NKT) cells which regulate the immune system and prevent tumour formation. Two structural analogues of α-GalCer were prepared in this work with a significant variation in the alkyl chain length involving complex sequences of organic chemical reactions to prepare and isolate each. The quality of the work effort is excellent as is the depth of understanding evident from the report. The as synthesised compounds are comprehensively characterised using Nuclear Magnetic Resonance Spectroscopy and Mass Spectrometry with interpretations that are both thorough and concise. Both the structure of the report and style of writing makes for ease of reading and it is clear that considerable time and effort has been devoted to this work which results in a important contribution to this field of endeavour. The second winning essay submitted by Linda O’Connor addresses the significant issue of catalytic routes towards the chemical degradation of chemical warfare agents into less harmful substances. The essay reviews the chemical agents in categories of their effect on humans and with each agent explores the possible route to detoxification. The essay is comprehensive in both the range of chemical agents studied and methods of their destruction and references all the important contributions in this field. Significant recent developments in the area such as the effects of the counter chemicals on the environment are an important inclusion. While this essay is directed at readers with knowledge of chemistry, the discussion is well written and will be of general interest to all. 112


C h e m i st ry

Catalytic methods for the destruction of chemical warfare agents under ambient conditions Linda O’Connor

M

Introduction odern chemical warfare was first rolled out during World War I, where the French are credited with the first usage.1 It was, however, the Germans that really made the phenomenon of chemical warfare a reality during The Great War. From there, chemical and technological advances were made to bring us to chemical (and indeed biological) warfare in terms of modern day war. The Cold War between the United States of America and the then USSR was the closest the world came to all-out nuclear conflict. More recently the Iran-Iraq war of the 1980s implemented the use of mustard gas on a large scale. Attack using chemicals is not only confined to war; many terrorist attacks have utilised chemical warfare agents including the terrorist attack on the Japanese underground on 20 March 1995.2 The United Nations Convention on the Prohibition of the Development, Production, Stockpiling and Use of Chemical Weapons and on their Destruction is the most recent treaty signed and ratified in the fight against the use of chemical weapons.3 Chemical warfare agents can be classified according to their affect on humans. They include blister, nerve, choking, blood, vomiting, tear and incapacitating agents. The most significant of all these groups in terms of past use and military capacity are the nerve and blister agents. Nerve agents react irreversibly with cholinesterase, which results in acetylcholine accumulation, continual stimula113


ClH2C

H2O

S

CH2Cl

-HCl

1/2 O2

O HO

S

CH2Cl

ClH2C

S

CH2Cl

S

-HCl

1/2 O2

H2O

CH2Cl

O HO

S

OH

ClH2C

S

CH2Cl

S

O Fig. 1. Some possible detoxifying reactions of sulphur mustard.6 tion of the body’s nervous system and eventual death. Examples of nerve agents include the G-Agents Sarin (GB), Tabun (GA), Soman (GD) and the V-Agent VX. Blister agents affect the lungs, eyes and cause blistering on the skin. Examples include Sulphur Mustard (HD) and Nitrogen Mustard (HN).4 In the course of this essay, I will look at the catalytic destruction of the chemical warfare agents. I will examine their chemistry and how they can be broken down into less harmful substances by catalytic chemical means.

Blister Agents Sulphur Mustard (HD) Sulphur Mustard, bis-2-chloroethyl sulphide, is a very potent blister agent and has been documented as an antimitotic, mutagenic, carcinogenic, teratogenic and cytotoxic agent. Skin, eyes and the respiratory tract are its primary target organs. In its pure state, sulphur mustard is a colourless oily liquid, but the industrial product is yellow to dark brown and has a characteristic sweetish smell.5 Sulphur mustard can be detoxified by dehydrohalogenation to form the chloroethyl vinyl sulphide, by nucleophilic attack to displace an activated aliphatic halogen, or by oxidation.6 Smith details in his review article how HD may be de114


H2O

H2O2

HCO4-

HCO3O

Cl

S

Cl

Cl

S

Cl

Fig. 2. Bicarbonate and molybdate affect the oxidation of sulphides and HD catalytically. toxified by partial oxidation to the sulphoxide (HD-O), but further oxidation to the sulphone (HD-O2) is not welcome since the sulphone of HD is also a vesicant, i.e. causes blisters. Hydrolysis of HD is another potential method of detoxification. This, however, presents a number of problems. HD has a very low solubility in water and droplets of the agent dispersed in water tend to form a polymerised crust, restricting the effectiveness of hydrolytic catalysis.7 As suggested by Smith, the ideal manner of detoxifying HD would be the selective oxidation to the sulphoxide, HD-O. This would allow for a simple and rapid approach to a HD outbreak, and could even be applied in a prophylactic surface treatment for critical items in order to make them immune from the threat of HD contamination.6 Work carried out by Noradoun and Cheng at the University of Idaho looked at the degradation of EDTA by oxygen activation using a zerovalent iron/air/water system. They developed an iron-based oxidation catalyst that used zero-valent iron in combination with EDTA and air to initiate a radical based Fe3+/Fe2+ redox cycle.8 Whilst developed to look at improving wastewater treatment plants and their effectiveness in decontaminating any waste products in their water systems, this aqueous oxidation chemistry could be applied to detoxify HD, and indeed nerve agents.6 A well known process in chemical degradation is oxidation using peroxides. Although powerful oxidants such as hypochlorite and peroxyacids (i.e. m-chloroperoxybenzoic acid) effect rapid oxidation of HD, they are rather nonselective, simultaneously producing both sulphoxide and sulphone. The milder oxidant hydrogen peroxide selectively yields the sulphoxide, but the reaction is too slow for the purpose of immediate decontamination.9 Wagner and Yang also showed that 115


recently peroxide activators such as molybdate and bicarbonate affect the rapid oxidation of sulphides and HD catalytically. Another process in the detoxification of sulphur mustard where considerable effort has been spent is the process of photoxidation.6 However, a considerable limiting factor in such a process is the need for the presence of light and this does not easily allow for “ambient conditions.” Dehydrohalogenation of HD using calcium oxide (CaO) was performed by Wagner et al using AP-CaO and CaO. On partially hydrated AP-CaO, a rather fast steady-state elimination of HCl occurs after a short induction period. This behaviour is attributed to acid-catalysed surface reconstruction (to regenerate fresh surface) and the formation of CaCl2, which is known to be more reactive than CaO.10 According to Smith, this reaction is not considered truly catalytic in nature, because an excess of CaO was used and islands of CaCl2 (which is a known catalyst for such dehydrohalogenation reactions) were likely formed during the reaction.6

Nerve Agents G-Agents The G-class of nerve agents are organophosphorous compounds that exert their neurotoxic effects by inhibiting acetylcholinesterase enzymes. They have relatively high vapour pressures, are moderately soluble in water and hydrolyse in water with half lives on the order of half a day.7 Because of this, G-Agents are considered as less of a technical challenge when it comes to their decontamination compared to less volatile, less soluble, less labile agents.6 Hydrolysis of both Sarin (GB) and Soman (GD) is possible as they are both soluble in water and has been reported under acidic, basic and neutral conditions.11 Several enzymes have been shown to accelerate this process, i.e. enzymatic hydrolysis.6 According to Raushel, microbial enzymes have been identified that are able to efficiently catalyse the hydrolysis of organophosphate nerve agents, including Sarin and Soman. The enzyme phosphotriesterase (PTE) was developed and used to catalyse the hydrolysis of G-agents. Figure 2 shows a working example for the reaction mechanism of organophosphate triester hydrolysis.12 In this mechanism, the organophosphate binds to the binuclear metal centre within the active site via coordination of the phosphoryl oxygen to the β-metal ion (more solvent exposed in this model). This interaction weakens the binding of the bridging hydroxide to the β-metal (as evidenced by the longer oxygen–metal distance in the complex relative to the unbound state). The metal oxygen interaction polarises the phosphoryl oxygen bond and makes the phosphorus centre more electrophilic. Nucleophilic attack by the bound hydroxide is assisted by proton abstraction from Asp301. As the hydroxide attacks the phospho116


Fig. 3. Working model for the hydrolysis of organophosphate nerve agents by phosphotriesterase (PTE).12 rus centre, the bond to the leaving group weakens, although 18O isotope effects and Brønsted analyses support the notion that the transition state is late. His354 may facilitate the transfer of a proton from the active site to the bulk solvent. Another protein, organophosphate acid anhydrolase (OPAA), has also been developed to catalyse the hydrolysis of the organophosphate triesters, including Soman and Sarin, but not VX.12 The same parameters were used as for PTE and the results showed that the enzyme displayed similar stereoselectivity, but the overall rate of hydrolysis was significantly reduced relative to PTE. Brajesh, in his paper on Microbial Degradation of organophosphorous compounds deals with a number of G-agents. Tabun (GA) is subject to hydrolysis and the first step in this process, under neutral and acidic conditions, includes formation of O-ethyl N,N-dimethyl amidophosphoric acid and hydrogen cyanide. This first step is rapid. The subsequent hydrolytic step, which is comparatively slow, is hydrolysis of O-ethyl N,N-dimethyl amidophosphoric acid to dimethylphosphoramidate and then finally to phosphoric acid. Under acidic conditions, hydrolysis to ethylphosphorylcyanide and dimethylamine occurs. The final product of all pathways is phosphoric acid.13 Sarin (GB or isopropyl methylphosphonofluoridate) and Soman (GD or pinacolyl methylphosphonofluoridate) can also be detoxified using microbial routes. 117


Fig. 4. Possible pathways for the degradation of GA.13

118


Fig. 4. Microbial degradation pathway for GB and GD. PMPA: pinacolylmethyl phosphonic acid.13

GD is an intermediate between GA and GB. It is less water soluble and more lipid soluble than the other two G agents, which results in more rapid skin penetration and greater toxicity. The major metabolites identified for GB degradation are isopropylmethylphosphonic acid (IMPA) and methyl phosphonic acid (MPA). Another organophosphorous G-agent used in chemical warfare is cyclosarin 119


(GF or O-cyclohexyl methylphosphonofluoridate). As already stated, enzymatic hydrolysis is one of a number of ways of accelerating the rate of hydrolysis. In a study carried out by Harvey et al the stereospecificity of the catalysis of cyclosarin was investigated. They discovered that organophosphorous acid anhydrolase (OPAA) and the wild-type phosphotriesterase (PTE) enzymes were all found to catalyze preferentially the hydrolysis of the (+)GF isomer.14 Previous studies, discussed above, have also shown that these two enzymes catalyse detoxification reactions of organophosphorous G-agents.12 There are, however, limitations to the enzymatic hydrolysis of G-agents. As stated previously, all hydrolysis reactions of organophosphorous G-agents ultimately end in an acidic by-product. This obviously causes the pH levels in the reaction to drop below 6, which in turn terminates the ability of the enzyme to react. Using hydrolysis catalysts to decontaminate comparatively large quantities of an agent therefore requires a buffer to maintain the pH at an optimum level.6 Typically this buffering capacity could be provided by acid-base pairs. It can also be generated by a competing enzymatic reaction such as the formation of ammonia from urea by urease to neutralise the acidic organophosphorous hydrolase (OPH) hydrolysis products on demand.15 The formation of ammonia from urea using urease typically has a pH of 6.5 and the enzyme OPH has a maximum activity at pH 8.5. Russel et al found that the competing reactions stabilised the pH until one of the reagents was almost completely consumed and by changing the relative concentrations of the two enzymes, a predicted pH was achieved and maintained, without the use of the classical acid-base buffers. Another limitation with regards enzymatic hydrolysis concerns the mass transport of the chemical warfare agent to the active enzyme. Active enzymes are typically confined to the intracellular matrix, regardless of whether the enzymes are naturally occurring or engineered into the cultured organism.6 Non-enz y matic hydrolysis using iodosylcarboxylates to encourage catalytic hydrolysis of G-agents and other organophosphates has also been studied in depth by Morales-Rojas and Moss at The State University of New Jersey. It is known that hypervalent iodine complexes can be used as a nucleophile for the cleavage of reactive organophosphorous substrates such as p-nitrophenyl diphenyl phosphate (PNPDPP) when solubilised in aqueous micellar solutions of cetyltrimethylammonium chloride (CTACl). Morales-Rojas and Moss used this knowledge and applied it to improve the reactivity of o-iodosyl- and o-iodylcarboxylate derivatives for the degradation of organophosphorous substrates.16

V-Agent Another nerve agent, VX (O-ethyl-S [2-(di-isopropylamino) ethyl] methylphosphonothioate) presents many more problems than the other chemical agents, due to the fact that it is much less labile than the G-agents. It is a persistent, odourless, amber-coloured liquid. Un-catalysed hydrolysis of VX does not occur at useful rates in pH neutral solutions, i.e. ambient conditions. Furthermore, one of the pos120


PNPO PhO

P

O OPh

O I

O

O

O

I

O O

OPh P

OPh

O HO-

H+ -H+

O

OH I O

PhO

P

O OPh

O

Fig. 5. Mechanism of idosylcarboxylate hydrolysis of p-nitrophenyl phosphate.6 sible hydrolysis products (EA-2192) is both much less reactive toward further nucleophilic attack and nearly as toxic as VX itself.6 According to Munro et al, VX undergoes water and hydroxyl ion-catalysed hydrolysis, but is not subject to acid-catalysed hydrolysis. Hydrolysis of VX proceeds by numerous pathways, and producing a number of degradation products. At pH values of < 6 and > 10, cleavage of the P-S bond predominates, resulting in formation of ethyl methylphosphonic acid (EMPA) and diisopropyl ethyl mercaptoamine (DESH). The latter compound can be oxidized to bis (2-diisopropylaminoethyl) disulfide (EA 4196) or react with the diisopropyl ethyleneimmonium ion (CH2)2N + (C3H7)2 to form bis (2-diisopropylaminoethyl) sulphide. At neutral and alkaline pH values (7-10), the first pathway competes with de-alkylation of the ethoxy group (cleavage of the C-O bond), the latter pathway yielding the environmentally stable EA 2192 and ethanol.7 Work carried out by Wagner et al also came up similar results.10 Noradoun and colleagues at the University of Idaho used the insecticide 121


N

HS H20

C2H5O P S CH3

N

CH(CH3)2

+

Diisopropyl ethyl mercaptoamine

pH < 6, <10

O

CH(CH3)2

CH(CH3)2

O C2H5O P OH CH3 Ethyl methylphosphonic acid

CH(CH3)2 H20 pH 7-10

O HO P S CH3

N

CH(CH3)2 CH(CH3)2

S-2(disopropylaminoethyl) methyl phosphonthioate (EA 2192)

+

C2H5OH Ethanol

Fig. 6. Primary hydrolysis pathways of VX in the environment.7 malathion as a chemical analogue for VX. They carried out their study using oxygen activation at room temperature and pressure. Their proposed degradation scheme exhibits the desirable characteristics of green oxidation, i.e. environmentally innocuous reagents, solvents and products under mild reaction conditions.17 As is known, activation of molecular oxygen is of importance for catalysis. Monooxygenase systems found in nature, which are capable of efficiently oxidizing organic molecules using activated O2 under near RTP conditions, include cytochrome P450 and methane monooxygenase (MMO) both of which contain active iron centres. Cytochrome P450 enzymes require reducing equivalents to activate molecular oxygen to a state formally equivalent to that of H2O2 during the initial oxidation process. Therefore, peroxides have often been substituted for the reductive activation of O2 in abiotic studies mimicking cytochrome P450 oxidation. Biological and abiotic systems participating in the partial reduction of molecular oxygen create reactive oxygen species that may consist of superoxide ions and/or + hydrogen peroxide. An extremely reactive form of oxygen containing species, OH , is a product of the Fenton reaction (see equation 1 below) which is the reduction of H2O2 by a suitable iron center.17

Equation 1: FeII + H202

→

+

FeIII + OH- + OH

Noradoun and her group, in work done by them previous to this, examined the 122


(CH3)2CH

CH(CH3)2

N

H5C2

O VX

P

O

O

P O

Malathion

O

S

S

S S

C18H14O4

C18H14O4

O

P O

Malaoxon

Fig 7. Structures of VX, Malathion and Malaoxon showing similarities in phosphorous moiety.17 use of zero valent iron, EDTA and air (ZEA) to create radical species in situ. The ZEA system is capable of deep oxidation even under mild reaction conditions. They also discovered that the ZEA reaction was capable of degrading chlorinated phenols to produce low molecular weight carboxylates.18 There are advantages to using the ZEA system when compared with other systems which have been investigated for the detoxification of organophosphorous compounds such as hydrolysis, palladium-based catalysis, UV induced photolysis, chemical and enzyme assisted oxidation. These include milder reaction conditions, inexpensive reagents, no precious metal catalysts and no need for special pressurized reactors. Additionally, the ZEA reaction proceeds at room temperature under one atmosphere and in aqueous solutions.17 She then applied this knowledge to the degradation of malathion, chosen because of its similarities to VX in the phosphorous moiety. Malathion (S-1,2-bis(ethoxycarbonyl)ethyl O,O-dimethyl phosphorodithioate) has a relatively low toxicity. In comparison, Malaoxon (S-1,2-di(ethoxycarbonyl) ethyl O,O-dimethyl thiophosphate) is far more toxic due to its higher binding efficiency for the enzyme acetylcholinesterase (AChE), inhibiting its control over the central nervous system. In this regard it is similar to organophosphorous nerve agents. The proposed ZEA system is capable of oxidizing both malathion and malaoxon to low molecular weight acids. This is a strong indication that the ZEA system can be used in the detoxification of organophosphorous nerve agents such as VX.17 Upon completion of the study, Noradoun et al found that the ZEA system was able to degrade both malathion and malaoxon. This is particularly significant because both share structural features with VX as shown above. The diagram below shows the proposed degradation scheme for malathion, showing the two non-polar intermediates, malaoxon and DES and the final reaction products as low molecular acids. Iminodiacetic acid has previously been identified as a degradation product of EDTA and has therefore been left out of the proposed scheme. 123


O

C18H14O4

O

O O O

P

O

O

O O

O O

O

S

S

CO2 HCO3C2O42-

HOOC

P O

COOH COOH

Fig. 8. Malathion degradation scheme showing harsh oxidation of malathion and malaoxon to low molecular weight acids.17 The phosphorus–carbon and sulphur–carbon bonds of malathion are cleaved during the oxidation process leading to 17% recovery of the sulphur as sulphate and 4.5% recovery of the phosphorous as phosphate after 24 hours as examined by ion chromatography during the experimental process. Control studies showed no loss of product through adsorption onto the iron surface during the course of the reaction. According to Noradoun, the ZEA reaction has advantages over present oxidation technologies. Methods based on chemical oxidants such as bleach or H2O2 are attractive, though both have limitations such as long-term storage requirements, or possible safety hazards. For bleach-based oxidations reaction conditions must be properly maintained, e.g. VX destruction by bleach requires a sizable excess of HOCl/OCl– to achieve complete chemical oxidation. Furthermore, the use of copious amounts of chlorine has provoked environmental concerns over carcinogenic chlorinated organic compounds that can be produced, such as those produced in paper production. Methods using H2O2 in conjunction with peroxide activators such as bicarbonate and molybdate have been looked as a greener alternative. Although the products are less toxic, their release may still pose environmental concern. The downsides to using large excesses of highly concentrated H2O2 or bleach are the difficulties in transportation, long-term storage, operator safety as well as limited shelf life. The ZEA system would only require the storage of ZVI particles and EDTA, both of which have good stability. ZEA degradation of malathion has shown the system to be capable of degrading the phosphorus–sulphur groups. Pre124


vious studies have shown that the ZEA system is capable of degrading organics down to carbonates and simple carboxylates.18 Relative to other systems, another outstanding feature of the ZEA system is the use of mild reaction conditions, i.e. room temperature and atmosphere. This characteristic combined with inexpensive and stable reagents, establishes the ZEA system as a strong possibility as a field-portable organophosphorous remediation system.17

Conclusion Chemical warfare has been in existence since World War I, but in recent times it has become a more real threat, thanks to the increase in terrorism across the globe. Therefore, detoxicification of these chemical warfare agents is of vital importance. Catalytic destruction of these agents under ambient conditions is of particular significance as, should an attack occur, the problem must be dealt with as quickly as possible in order to minimise the destruction caused. Also, as one cannot predict when or where a chemical attack will happen, the need for the detoxification process to work efficiently under ambient conditions is key. From all the journals that I read and researched during the course of this essay, a recurring theme came through with regards to catalytic destruction of the nerve agents: the use of enzymes (biological catalysts) for the detoxification of the agents. Another issue that arose was the environmental aspect to the destruction of the agents. With climate change such a buzz word amongst scientists (and indeed everyone) today, protection of the environment is of the utmost importance when carrying out any kind of experimental scientific research. Noradoun and her colleagues carried out a number of studies into the destruction of chemical warfare agents, and in particular with regards the destruction of malathion – an analogue for VX – kept both the environmental aspect and the need for ambient conditions to the forefront of her research.

125


References 1. Duffy, M. 2009; Vol. 2009. 2. Okumura, T.; Takasu, N.; Ishimatsu, S.; Miyanoki, S.; Mitsuhashi, A.; Kumada, K.; Tanaka, K.; Hinohara, S. Annals of Emergency Medicine 1996, 28, 129-135. 3. Nations, U. 2008; Vol. 2009. 4. D’Agostino, P. A.; Chenier, C. L. Analysis of Chemical Warfare Agents: General Overview, LC-MS Review, In-House LC-ESI-MS Methods and Open Literature Bibliography, Defence R&D Canada, 2006. 5. Swamy, R. V.; Sugendran, K.; Ganesan, K.; Malhotra, R. C. Defence Science Journal 1999, 49, 117-121. 6. Smith, B. M. Chemical Society Reviews 2008, 37, 470-478. 7. Munro, N. B.; Talmage, S. S.; Griffin, G. D.; Waters, L. C.; Watson, A. P.; King, J. F.; Hauschild, V. Environmental Health Perspectives 1999, 107, 933-974. 8. Noradoun, C. E.; Cheng, I. F. Environmental Science & Technology 2005, 39, 7158-7163. 9. Wagner, G. W.; Yang, Y.-C. Industrial & Engineering Chemistry Research 2002, 41, 1925-1928. 10. Wagner, G. W.; Koper, O. B.; Lucas, E.; Decker, S.; Klabunde, K. J. The Journal of Physical Chemistry B 2000, 104, 5118-5123. 11. Yang, Y. C.; Baker, J. A.; Ward, J. R. Chemical Reviews 1992, 92, 1729-1743. 12. Raushel, F. M. Current Opinion in Microbiology 2002, 5, 288-295. 13. Brajesh K. Singh, A. W. FEMS Microbiology Reviews 2006, 30, 428-471. 14. Harvey, S. P.; Kolakowski, J. E.; Cheng, T.-C.; Rastogi, V. K.; Reiff, L. P.; DeFrank, J. J.; Raushel, F. M.; Hill, C. Enzyme and Microbial Technology 2005, 37, 547-555. 15. Russell, A. J.; Erbeldinger, M.; DeFrank, J. J.; Kaar, J.; Drevon, G. Biotechnology and Bioengineering 2002, 77, 352-357. 16. Morales-Rojas, H.; Moss, R. A. Chemical Reviews 2002, 102, 2497-2522. 17. Noradoun, C. E.; Mekmaysy, C. S.; Hutcheson, R. M.; Cheng, I. F. Green Chemistry 2005, 7, 426-430. 18. Noradoun, C.; Engelmann, M. D.; McLaughlin, M.; Hutcheson, R.; Breen, K.; Paszczynski, A.; Cheng, I. F. Industrial & Engineering Chemistry Research 2003, 42, 5024-5030.

126


c h e m i st ry

Sea Sponges cure Cancer? Synthesis of novel Glycolipid Analogues of α-Galactosyl Ceramide Roisin O’Flaherty

I

1. Abstract solated from a marine sponge Agelas mauritianus, α-Galactosyl Ceramide (α-GalCer) is a bioactive glycolipid that has therapeutic effects against bacteria, viruses, parasites and autoimmune diseases.1 Two α-GalCer analogues 2, 6 were successfully synthesised in the laboratory with 24% and 10% yields respectively. These glycolipids were synthesised in 2 and 4 steps respectively. A glycosyl halide 1 was prepared with excellent α-anomeric selectivity on reaction of 33% HBr acid in HAc with 1,2,3,4,6-penta-O-acetyl-galactopyranose, a relatively cheap starting material in a glycosylation reaction. The glycoside 2 was then synthesised in a reaction of the glycosyl halide 1 reacting with a long chain alcohol, 1-docosanol. A peptide unit 3 was prepared by reacting Boc-L-Serine with 1-tetradecylamine and NEt3 with a peptide coupling reagent TBTU in the presence of a racemisation suppressant HOBt. A hemiacetal sugar 4 was formed in order to deprotect the acetoxy group of the anomeric carbon of a galactose based sugar by the reaction of the 1,2,3,4,6-penta-O-acetyl-galactopyranose with benzylamine at 50°C, using THF as a solvent. This product 4 was then further reacted with trichloroacetonitrile in the presence of sodium hydride using CH2Cl2 as a solvent to yield a reactive imidate product 5. Compounds 3 and 5 were thus reacted with 0.04 N TMSOTf in CH2Cl2 at 0°C to give the α-GalCer analogue 6. Structural elucidation was carried out on 127


compounds 1, 2, 3, 4, 5, 6. Therefore two glycolipid compounds were synthesised: one containing a long lipidic chain, and one containing a medium lipidic chain that could have potential therapeutic applications against disease and cancer.

2. Introduction & literature review 2.1. Biology CD1 molecules are β2-microglobulin associated antigen-presenting proteins. 2 The ability of CD1 proteins to perform their role is most likely due to their ability to act as lipid binding proteins, which trap hydrophobic alkyl chains within a deep hydrophobic pocket in the protein. A β2-microglobulin molecule is a component of MHC (major histocompatibility complex) class I molecules, which in turn is one of two classes of MHC molecules. The MHC is a large gene family found in most vertebrates and plays important roles in the immune system and autoimmunity. A specific subset of CD1 molecules are CD1d molecules. The primary function of these CD1d proteins is to recognize and bind glycolipid antigens through lipidprotein interactions with receptors on Natural Killer T (NKT) cells.2, 3 NKT cells are a subset of T cells combining properties of NK cells and T cells. These NKT cells produce cytokines called interferons such as interferon-ɣ (IFɣ) and interleukin-4 (IL-4) which results in Th1 and Th2 immune responses, respectively. 4 The role of cytokines is to signal immune cells such as NKT cells to travel to the point of infection, the autoimmune disease or cancer. Th1 and Th2 are T helper cells which are a subset of lymphocytes, which is a type of white blood cell again aiding in the disease fighting process. Several studies have demonstrated the importance of NKT cells in immunoregulation, tumour immunity, and the prevention of autoimmune diseases in mice.5 It has also been suggested that activation of NKT cells induce secondary immune effect, including the activation of T cells and NK cells although this hypothesis remains to be confirmed.6 These secondary effects could be critical in fighting diseases and cancers. Bioactive glycolipids can be recognised and bound by CD1d-restricted NKT cells. A specific CD1d-restricted lymphoid subset is the human Vα24+Vβ11+ NKT cell which is potently activated by α-Galactosyl Ceramide (α-GalCer) presented by CD1d on antigen-presenting cells.6, 7 2.2 The importance of α-Galactosyl Ceramide A family of ceramide-like glycolipids were first isolated from a marine sponge, Agelas mauritianus.8 From these, α-GalCer was chosen to undergo clinical trials in the search for a compound that could inhibit liver metastases in mice.2, 9 Luckily, the liver is largely made up of NKT cells and CD1d molecules which enables the α-GalCer to aid the attack on diseases and cancer. 9 The structure of α-GalCer is 128


OH OH O HO HO O

O HN

1

C25H51 OH OH

C14H29

Fig. 2.1. Structure of immunostimulant α-GalCer.

shown in Figure 2.1. These compounds are only known to occur naturally in marine sponges but can be synthesised in the laboratory. A plausible reason for this is that they contain an α-anomeric linkage rather than a β-anomeric linkage. This distinguishes them from the ceramides that commonly occur in mammalian tissues which contain β-anomeric sugars.2 The glycolipid α-GalCer is a potent immuno-regulator and has been shown to have therapeutic effects against cancer and autoimmune diseases including multiple sclerosis, alopecia areata, Crohn’s disease, rheumatoid arthritis and systematic lupus erythematosus (SLE).10, 11, 24 The α-GalCer can have different effects on diseased cells. With certain types of diseases, the injection of α-GalCer can lead to a boost in the adaptive immunity to the infection and promotes healing. In other cases, such as autoimmune diseases the α-GalCer activates NKT cells to suppress tissue disruption and lessens the degree of disease. Obviously α-GalCer would be more efficient in the organs to which NKT cells primarily reside, including the liver (as outlined previously), spleen, and other lymphoid organs. Many synthetic routes for creating structural analogues of α-GalCer have been investigated. It has been shown that these analogues have both different activities and efficiencies to that of α-GalCer. Alterations in the structure of α-GalCer have been shown to have several effects: change in binding ability, change in activity/potency, change in the type of cell that presents the compound efficiently, or a change in the outcome of NKT cell activation in the terms of the types of cytokines produced. These factors could influence the biological impact against autoimmune diseases or cancer. We will investigate some effects in relation to alterations in the lipid tail, the carbohydrate head and the hydrophilic spacer group. Firstly, alterations in the lipid unit of the α-GalCer are investigated. There seems to be conflicting evidence in the literature about whether variations in the lipid chain can influence the binding of the α-GalCer analogues to the surface glycoproteins on T-cell membrane surfaces. LaBell and Jackobson provided evidence that a long hydrocarbon chain is necessary to influence the binding of the α-GalCer on a HIV cell surface glycoprotein, gp120.12 In contrast Silberger and Bhat showed that variations in the lipid length had little effect on binding efficiencies of α-GalCer an129


Ac O AcO

O c A O O Ac O

AcO AcO AcO

OAc O

NH

O O H

N H

O O

Fig. 2.2. Structure of glycolipid analogues of α-GalCer 2, 6.

alogues.13 The length of the lipid chain is a crucial factor in the activity of α-GalCer and its structural analogues with varying lengths, as shown by Colombo et al and Cateni et al.14, 15 The issue regarding the saturation of the lipid tail has not been fully investigated. Although this hypothesis is yet to be confirmed, its design can allow for the synthesis of both saturated and unsaturated compounds. However, in this project only saturated lipid chains are investigated. Secondly, alterations in the carbohydrate head are investigated. The carbohydrate group is indeed necessary in the production of synthetic analogues of α-GalCer. LaBell and Jackobson outlined the need for a carbohydrate head in the biding efficiencies of these compounds .12 Zing et al investigated attaching a sulfatide group to the C-3 position in the sugar moiety of α-GalCer.16 No significant effects of NKT cell stimulation were reported. Various papers discuss the synthesis of α-GalCer analogues using protecting groups such as acetoxy groups in place of the hydroxyl groups on the sugar moiety.2, 15 However, when analysing biological properties of these compounds, such as activity and binding efficiencies, they are usually deprotected. Protected α-GalCer analogues 2, 6 synthesised are shown in Figure 2.2. of varied chain lengths. Alterations in the hydrophilic spacer group are described in many papers but detailed analysis of their effects has yet to be investigated.

2.3 Reaction Involving the Sugar Moiety 2.3.1. SN2 Reactions The method of synthesis of the glycolipid 6 follows an SN2 reaction. The SN2 mechanism was first discovered by Edward David Hughes and Sir Christopher Ingold in the 1930’s. It is a type of nucleophilic substitution where a lone pair on a nucleophile attacks an electrophilic site expelling a leaving group in just one step as in 130


Nu-

X

R

Nu

+

R

X-

Scheme 2.1. Reaction mechanism of SN2 reaction where R= alkyl group, X= leaving group. Therefore it is a first order rate reaction which depends on the carbon skeleton and the leaving group. Scheme 2.1. In an SN2 mechanism, the rate of the reaction depends on the concentration of the starting reagent but is independent of the nucleophile concentration.

2.3.2. Koenigs-Knorr Reaction The Koenigs-Knorr reaction is a well known glycosylation reaction that utilises the SN1 reaction outlined in Section 2.3.1. It is named after William Koenigs and Edward Knorr. In 1893 a paper was published by Emil Fischer with the first preparation of alkyl glucosides as anomeric mixtures.17 These consisted of mainly α-anomeric selectivity. Shortly afterwards, a procedure was conducted for mainly β-anomeric selectivity, developed by Koenigs and Knorr.18 In general this reaction involves the use of glycosyl halides, such as the compound 1 synthesised in this project, to act as glycosyl donors. However modified Koenigs-Knorr reactions can be performed with alternative promoters. Kunz and Harreus successfully outlined promoters suitable for this reaction.19 The imidate product 5 in this project is another example of a modified Koenig-Knorr reaction which successfully acts as a glycosyl donor. 2.4 Activation and coupling for Peptide Bond Formation Peptide coupling reactions have been significantly advanced due to the development of new peptide coupling reagents in organic synthesis. Activation involves the attachment of a leaving group to increase reactivity for a subsequent reaction. Activation of a carbonyl group in a carboxylic acid is essential in peptide bond formation to form an amide. This is because amines form salts when reacted with a carboxylic acid. Thus very high temperatures are needed to form the subsequent amides, which could potentially decompose the sample.20 Therefore we shall discuss various amide coupling reagents. 2.4.1. Carbodiimide Reagents Carbodiimide reagents have been used for many years due to their moderate activity and reasonable price. DCC (N,N’-dicyclohexylcarbodiimide) is one of the most 131


N

N

C

C

N

N

Fig. 2.3. Structure of DCC, DIC, EDCI. N

N

HCl

C N

well known carbodiimide reagents and was first reported by Sheehan in 1955.21 It is particularly useful in synthesizing active esters and symmetrical anhydrides. The by-product of DCC however, is quite insoluble in many organic solvents. Since the by-product of DIC (1,3- Diisopropylcarboddimide) is readily soluble in chloroform, it seems to be a more attractive reagent. Another commonly used carbodiimide is EDCI (1-(3-(Dimethylamino) propyl)-3-ethyl-carboddimide hydrochloride). The structures of DCC, DIC and EDCI are shown in Figure 2.3. respectively. The use of carbodiimide reagents in peptide synthesis has increased with the use of various additives such as HOBt (1-hydroxylbenzotriazole) and HOAt (1-Hydroxy-7-azabenzotriazole). These additives have aided in reducing racemisation of products and increasing reaction rates.20

2.4.2. Phosphonium and Uronium reagents The most famous phosponium reagent is BOP (Benzotriazol-1-yloxytris(dimethylamino)-phosphonium hexafluorophosphate) as shown in Figure 2.4, which was introduced in 1975. It was the first serious alternative to carbodiimide reagents. However, there are serious disadvantages in using this coupling reagent. There is no selectivity in racemisation and a co-product synthesised in the reaction is toxic.21 Other phosphonium salts include PyCloP (chlorotri(pyrrolidino) phosphonium hexafluorophosphate), PyBroP (Bromotri(pyrrolidino) phosphonium hexafluorophosphate), and PyBOP (Benzotriazol-1-yloxytri(pyrrolidino)- phosphonium hexafluorophosphate). Due to the problems associated with the above phosphonium reagent BOP, the use of uronium reagents such as TBTU and HBTU has increased significantly. HBTU was first introduced in 1978 by Gross. 22 TBTU and HBTU work similarly and 132


N N N

N O

Figure 2.4. Structure of BOP

P

PF6

N

N both have ambiguity regarding their structure. These coupling reagents cannot be strictly called uronium reagents because in the crystalline state they are actually guanidinium derivatives.21 At present, these compounds are considered to be some of the most effective amide coupling reagents available, however, they are

N N N Figure 2.5. Structures of TBTU and HBTU respectively.

N O

PF6 N

N N N N

O

BF4

N 133


quite expensive. The structures of these compounds are shown below in Figure 2.5.

2.4.3 Acid halogenating reagents A paper in 1903 by Fischer first demonstrated the use of acid halides in peptide synthesis.21 In the case of extremely hindered amino acids, the use of acid halides is ideal as a peptide coupling reagent. They also have high reactivity which compliments their use in synthesis. However there is a drawback in that racemic peptide products are formed. One commonly used acid halogenating reagent is oxalyl chloride as shown in Figure 2.6. 2.4.4 Racemisation suppressants The most commonly used racemisation suppressant in peptide synthesis is HOBt. As illustrated in Figure 2.7, a certain level of hydrate is present in this compound. It was first reported to act as an additive in peptide synthesis in 1970 by Koenig and Geiger.21 As outlined earlier in Section 2.4.1., they act as racemisation suppressants and reactivity enhancers in peptide reactions. Other common examples not outlined above are HODhbt (3, 4-dihydro-3-hydroxy-4-oxo-1,2,3-benzotriazine) and N-hydroxytetrazole. 2.5 Overall aims of the project Given the potent activity of α-GalCer and the promising therapeutic effects on autoimmune diseases and cancer, there is an area of obvious interest in creating structural analogues. On the basis of the evidences outlined above, the overall aim was to prepare structural analogues of α-GalCer with varying lipid lengths (from C14 to C22) using coupling reagents. In this paper, the synthesis of analogues 2, 6 are described. The glycolipidic compounds also have a potential application as nutraceutical compounds due to the presence of lipid groups.23

3. Experimental 3.1. General Procedures 3.1.1. Reagents Reagents used were of AR grade and all solvents for synthesis, extraction and column chromatography were distilled and dried before use, if necessary. 3.1.2. Equipment H NMR and 13C NMR spectra were recorded with a Bruker Avance 300 MHz spectrometer operated at 300 MHz for 1H and 75 MHz for 13C at 298 K. NMR spectra were obtained by using CDCl3 as solvent; chemical shifts are expressed as δ units (ppm) relative to tetramethylsilane (TMS) as internal standard. The abbreviations s, d, dd, t, q, m and sb refer to singlet, doublet, doublet of doublet, triplet, quartet, 1

134


O

N N

Cl Cl

xH O 2

N OH

O Fig. 2.6. Oxalyl Chloride

Fig. 2.7. Structure of HOBt

multiplet and singlet broad signal, respectively. 2D NMR experiments were also recorded with a Bruker Avance 300 MHz spectrometer at 298 K. The MS-CI spectra were measured with a Profile Kratos spectrometer. Optical rotations were measured on an AA-100 polarimeter. The α-D concentrations are given in g/ mL. The column chromatography was performed by using standard grade silica gel. Analytical thin layer chromatography (TLC) was carried out using commercial silica coated aluminium sheets: compound spots were visualized by staining with a suitable dye; either 5% sulphuric acid in ethanol or 0.5% ninhydrin in ethanol and charring. Infra-red spectra were obtained as film in the region 4000–400 cm−1 on a Nicolet Impact 400D spectrophotometer. Evaporation under reduced pressure was always affected with the bath temperature kept below 40 °C.

AcO AcO

OAc O

AcO

Br

Fig 3.2. 2,3,4,6-tetra-O-acetyl-1-bromo-α-D-galactopyranose 1 (ROF 3.1)

HBr 33% in HAc (5 mL, 0.138 moles) was added to 1,2,3,4,6-penta-O-acetyl-galactopyranose (2 g, 5.124 mmol) on an ice water bath and was left to stir overnight at rt. The reaction was monitored closely by TLC (ethyl acetate: hexane 2:1). It was evaporated and the reaction mixture was dissolved in ethyl acetate (50 mL) and a 135


Ac O AcO

O c A O O Ac O

Fig 3.3. 2,3,4,6-tetra-O-acetyl-1-O-docosanyl-β-D-galactopyranose 2 (ROF 8.1) red/orange solid was removed by filtration. The reaction mixture was washed with CH2Cl2 (50 mL) and saturated sodium carbonate (30 mL x 3) followed by brine (30 mL). The organic layer was dried over MgSO4, evaporated, high- vacuum dried and stored in the fridge to give compound 1 (ROF 3.1) as a yellow/brown oil (1.575 g ,74.76 %). Rf = 0.55 (hexane /ethyl acetate 2:1). [α]20D = +142.96 (c 0.028, CH2Cl2). H-NMR (CDCl3, 300 MHz): δ = 6.74-6.73 (d, J=3.9 Hz, 1H; H-1), 5.52-5.51 (d, J=3.2 Hz, 1H; H-4), 5.41-5.36 (dd, J=3.2 Hz, J=10.6 Hz, 1H; H-3), 5.06-5.01 (dd, J= 3.9 Hz, J= 10.6 Hz, 1H; H-2), 4.54-4.49(t, J=6.4 Hz, 1H; H-5), 4.24-4.09 (m, 2H; H-6, H-7), 2.16-1.99 (4 x s, 12H; 4 x -OC=OCH3). 1

C-NMR (CDCl3, 75 MHz): δ = 169.92-169.35 (4 x s; 4 x OC=OCH3), 88.35 (d; C-1), 71.05 (d; C-5), 67.76 (d; C-3), 67.54 (d; C-4), 66.8634 (d; C-2), 60.73 (d; C-6), 20.37-20.23 (3 x q ; 3 x -OC=OCH3). 13

MS-CI: 82.97 (CH2OAc), 149.00, 324 (sugar with loss of CH2OAc group) IR (film from CH2Cl2) : 3483.07 (OHstretch), 3060.73 (CHstretch), 2964.31 (CHstretch), 2726.69 (CHstretch), 2422.43 (CHstretch), 2121.16, 1974.54, 1748.96 (C=Ostretch), 1648.37, 1432.22 (CH2 bend ), 1219.92 (C-O stretch), 737 (C-Br stretch) cm-1. Dry CH2Cl2 (35 mL) was added to a mixture of 1-docosanol (510 mg, 1.56 mmol) and silver carbonate (381 mg, 1.38 mmol) in the dark in argon at rt and was left stirring for 10 minutes. Molecular sieves were heated, ground and added to the mixture which was stirred for a further 30 minutes. Iodine (18 mg, 0.14 mmol) was then added followed by a solution of 1 (ROF 3.1) (283 mg, 1.38 mmol) in dry CH2Cl2 (5 mL) which was added dropwise over a period of 40 minutes. The reaction was left to stir overnight at room temperature. The reaction mixture was then vacuum filtered through Celite, washing with CH2Cl2 (30 mL). The filtrate was washed with brine (3 x 30 mL) and the organic layer was dried over MgSO4. It was then highvacuum dried to give an oil that was purified by column chromatography (hexane/ ethyl acetate 4:1) to give 2 (ROF 8.1) as white crystals (107 mg, 23.69 %). 136


Rf = 0.26 (hexane /ethyl acetate 4:1). [α]20D = -8.25 (c 0.004, CH2Cl2). H-NMR (CDCl3, 300 MHz): δ =5.39-5.39 (d, J=1 Hz, 1H; H-4), 5.38-5.17 (m, 1H; H-2), 5.04-4.99 (dd, J= 3.4 Hz, 10.4 Hz, 1H; H-3), 4.46-4.43 (d, J=7.8 Hz, 1H; H-1 ), 4.19-4.09 (m, 2H; OCH, H-6), 3.91-3.87 (m, 2H; OCH, H-7), 3.48-3.45 (t, J=6.9 Hz, 1H; H-5), 2.15 (s, 3H; OCOCH3), 2.05 (s, 6H; 2 x OCOCH3), 1.98 (s, 3H;-OCOCH3 ), 1.25 (s, 40H; -(CH2)20CH3), 0.90-0.86 (t, 6.5 Hz, 4H; -CH3). 1

C-NMR (CDCl3, 75MHz): δ = 170.39 (s; -OC=OCH3 ), 170.29 (s; -OC=OCH3 ), 170.20 (s; -OC=OCH3 ), 169.35 (s; -OC=OCH3), 101.37 (d; C-1), 70.99 (d; C-3), 70.57 (d; C-5), 70.31 (t; C-6), 68.96 (d; C-2), 67.09 (d; C-4), 61.29 (t; OCH2), 31.92 (t; -CH2CH2CH3), 29.62 (t; -OCH2CH2 ), 29.35 (t; -OCH2CH2(CH2)16), 25.82 (t; -CH2CH2CH2CH3), 22.68 (t; -CH2CH3 ), 20.67 (q;-OC=OCH3), 20.74 (q;- OC=OCH3), 20.59 (q; -OC=OCH3), 14.11 (q; -CH2CH3). 13

MS-CI: 331(fragment form sugar unit), 309 (C22H45), 229, 169 IR (film from CH2Cl2) :3483.92 (OHstretch from starting material), 3059.64 (CHstretch), 2919.21(CHstretch), 2850.95(CHstretch), 2119.04, 1751.25 (C=Ostretch) cm-1.

O O H H O

O N

H

N H

Fig. 3.4. N-tert-butoxycarbonyl-L-serine tetradecyl amide 3 (ROF 6.1) DMF (5 mL) was added to a mixture of Boc-L-Serine (250 mg, 1.22 mmol), TBTU (430 mg, 1.34 mmol) and HOBt (181 mg, 1.34 mmol) under nitrogen and was left to stir for 10 minutes. 1-Tetradecylamine (286 mg, 1.34 mmol) and NEt3 (0.2 mL, 1.46 mmol) was then added under nitrogen to the reaction mixture and was left to stir overnight. CH2Cl2 (25 mL), brine (30 mL) and 0.2 N hydrochloric acid (30 mL) was added. The aqueous layer was extracted with CH2Cl2 (3x 25 mL), and combined organic layers were washed with saturated sodium hydrogen carbonate solution (30 mL). The organic layer was dried over MgS04. The solvent was then evaporated under reduced pressure, vacuum dried to give an oil that was purified by column chromatography (hexane/ethyl acetate 1:1) to give 3 (ROF 6.1) as a white solid (337 mg, 57.89 %). 137


Ac O Ac O

O c A O H O

Ac O

Fig. 3.5. 2,3,4,6-tetra-O-acetyl-α,β-galactopyranose 4 (ROF 11.1) Rf = 0.30 (hexane /ethyl acetate 2:1). [α]20D = -8.58 (c 0.013, CH2Cl2). H-NMR (CDCl3, 300 MHz): δ =6.79 (s, 1H; -NHCH2), 5.69-5.67 (d, J= 7.5 Hz, 1H; -OH), 4.14-4.04 (m, 2H; Hα, Hβ), 3.65-3.59 (m, 2H; Hβ’, -NHC=O), 3.25-3.23 (m, 2H; -NHCH2), 1.45 (s, 12H; -OC(CH3)3), 1.25 (s, 24H; -NHCH2(CH2)12CH3), 0.90-0.86 (t, J = 6.42 Hz, 3H; -CH2CH3). 1

C-NMR (CDCl3, 75 MHz): δ =171.28 (s; -OC=O), 156.32 (s;-C=ONHCH2), 80.48 (s;-C(CH3)3), 77.45-76.61 (m, CDCl3, -CH­2OH), 62.90 (t; NHCH2-), 54.84 (d; -NHCHCH2OH), 39.53 (t; -C=ONHCH2CH2), 31.91 -26.85 (t x 10; -NHCH2CH2(CH2)10), 28.29 (q; -OC(CH3)3), 22.68 (t; -CH2CH3), 14.10 (q;-CH2CH3). 13

MS-CI: 556.00, 345, 240 (C=ONHC14H29), 212 (-NHC14H29), 160 (-CH(CH2OH)NBoc), 57 (C(CH3)3) IR (film from CH2Cl2): 3319.60 (OHstretch), 3107.63 (NHstretch) 2924.92 (CHstretch), 2853.95 (CHstretch), 1706.4 (C=Ostretch), 1650.17 (NHbend), 1174.77 (COstretch) cm-1. Benzylamine (0.307 mL, 2.81 mmol) was added to a solution of 1,2,3,4,6-penta-O-acetyl-galactopyranose (1.224 g, 2.34 mmol) in dry THF (30 mL) and it was stirred overnight at 50°C in a silicon oil bath. CH2Cl2 (30 mL) was added followed by 0.4 N HCl (20 mL) and the organic layer was washed with brine (3 x 20 mL).The organic layer was dried over MgSO4 and was evaporated under reduced pressure to

AcO AcO AcO

OAc O O

NH CCl3

Fig. 3.6. 2,3,4,6-tetra-O-(2,2,2-trichloro-acetmidoyl)-α-D-galactopyranose 5 (ROF 12.1) 138


give 4 (ROF 11.1) as a brown oil (1.345 g, 122.93 %). This was used without further purification. Rf =0.47 (hexane /ethyl acetate 1:1). 4 (ROF 11.1) (0.7 g, 1.06 mmol) was dissolved in dry CH2Cl2 (5 mL) under nitrogen at rt. Trichloroacetonitrile (765 mg, 5.3 mmol) was added to the reaction mixture. Sodium hydride (17 mg, 0.707 mmol) was then added under nitrogen and the reaction mixture was left to stir for 40 minutes. The solvent was then evaporated under reduced pressure to give an oil that was purified by column chromatography (2:1 hexane/ ethyl acetate) to give 5 (ROF 12.1) as white crystals (310 mg, 79.32 %). Rf = 0.71 (hexane /ethyl acetate 1:1). [α]20D = +32.88 (c 0.658, CH2Cl2). H-NMR (CDCl3, 300 MHz): δ = 8.67 (s, 1H; -C=NH ), 6.61-6.01 (d, J=3.3 Hz, 1H; H-1), 5.57-5.57 (d, J=1.2 Hz, 1H; H-3); 5.56-5.38 (m, 2H; H-2, H-4), 4.47-4.42 ( t, J= 6.6 Hz, 1H; H-5), 4.20-4.05 (m, 2H; H-6, H -7) , 2.17 (s, 3H; -OC=OCH3 ), 2.03-2.01 (3 x s, 9H; -OC=OCH3 ). 1

C-NMR (CDCl3, 75MHz): δ = 170.29-169.96 (4 x s; -OC=OCH3), 160.95 (-OC=(NH) CCl3), 93.54 (d; C-1 ), 90.77 (s; -CCl3), 68.99 (d; C-5), 67.51-68.91 (3 x d; C-2,C-3, C-4), 61.25 (d; C-6), 20.65-20.54 ( 4 x q; 4 x -OC=OCH3). 13

MS-CI: 170 (-OCNHCCl3), 331(sugar without anomeric substituent), 347 IR (film from CH2Cl2): 3479.62 (NHstretch), 3319.88 (NHstretch), 2964.29 (CHstretch), 2922.87, 2851.67 (CHstretch), 1751.35 (C=Ostretch), 1677.40 (C=Ostretch), 1617.98 (CHstretch) cm-1. Dry CH2Cl2 (7 mL) was added to a mixture of 5 (ROF 12.1) (188 mg, 0.38 mmol) and 3 (ROF 6.1) (129 mg, 0.38 mmol) under nitrogen on an ice water bath. Over a period of 10 minutes a 0.04 N TMSOTf (0.95 mL) solution was added dropwise to the reaction mixture. It was then left to stir overnight at room temperature. The reaction was quenched with sodium hydrogen carbonate and washed with brine (3 x 30 mL). The organic layer was dried over MgSO4 and was evaporated under reduced pressure to give an oil that was purified by column chromatography (hexane/ ethyl acetate 3:1 to 2:1 to 1:1 to 1:1.5) to give 6 (ROF14.1) as yellow-white crystals (30 mg, 10.36%). Rf = 0.76 (hexane /ethyl acetate 1:1). [α]20D = -1.08 (c 0.004, CH2Cl2). 1

H-NMR (CDCl3, 300 MHz): δ = 6.36 (bs, 1H; -NHCH2), 5.40-5.39 (dd, J=0.9 Hz, J=3.4 139


AcO AcO AcO

OAc O

NH

O O H

N H

O O

Fig. 3.7. 2,3,4,6-tetra-O-acetyl-1-O-[2-tert-butoxycarbonylamino-2-tetradecylcarbamoyl(S)-ethyl-β-D-galactopyranose 6 (ROF 14.1) Hz, 2H; H-4, NHCH), 5.21-5.15 (dd, J=7.9 Hz, J=10.5 Hz, 1H; H-2), 5.04-4.98 (dd, J=3.4 Hz, J=10.5 Hz, 1H; H-3), 4.57-4.54 (d, J= 7.9 Hz, 1H; H-1), 4.27 (bs, 1H; Hα ), 4.17-4.14 (t, J=1.9 Hz, 2H; H-6, H-7), 4.07-3.97 (m, 2H; H-5, Hβ), 3.76-3.70 (dd, J=7.9 Hz, J=10.4 Hz, 1H; Hβ’), 3.28-3.23 (dd, J=5.4 Hz, 11.0 Hz, 2H; -NHCH2 ), 2.16-1.98 (4 x s, 12H; -OC=OCH3), 1.45 (s, 9H; -(C=O)OC(CH3)3), 1.25 (s, 24H; -(CH2)12CH3), 0.90-0.86 (t, J=6.5 Hz, 3H; -CH2CH3). C-NMR (CDCl3, 75MHz): δ = 170.45 (s; -(C=O)OC(CH3)3), 102.00 (d; C-1) 170.08 (s; -C=ONHCH2), 170.01 (s; -OC=OCH3), 169.58 (s; -OC=OCH3), 169.28 (s; -OC=OCH3), 71.06 (d; C-5), 70.70 (d; C-3), 68.70 (d; C-2), 66.97 (d; C-4), 61.24 (d; C-6), 39.69 (t; -NCH2 ), 31.92 (t; -NHCH2CH2), 29.64-29.35,28.29-26.83 (5 x t; -CH2CH2(CH2)10CH2CH3), 29.28 (q; -(C=O)OC(CH3)3), 22.68 (t; -CH2CH3), 20.76-20.54 ( 3 x q; 3 x -C=OCH3), 14.11 (q; -CH2CH3). 13

MS-CI: 240 (C=ONHC14H29), 331 (sugar), 390. IR (film from CH2Cl2) : 3335.00 (NHstretch), 2925.52 (CHstretch), 2854.47 (CHstretch), 1754.88 (C=Ostretch), 1660.45 (C=Ostretch), 1527.79, 1467.57 (CHbend), 1368.16 (CHbend), 1225.46 (COstretch), 1172.18 (CNstretch) cm-1.

4. Results and Discussion 4.1. Synthesis of glycolipid 2 A retro-synthetic pathway of the glycolipidic analogue 2 of α-GalCer is depicted in Scheme 4.1. The glycosyl halide 1 is reacted with an alcohol group in an SN1 substitution reaction in the presence of Ag2CO3 by a Koenigs-Knorr reaction to give a glycoside 2. 17 The glycosyl halide 1 in the protected form is a key intermediate that can be obtained by reaction of HBr 33% in acetic acid and 1,2,3,4,6-penta-O-acetyl140


C22H44OH

+

2

1

+

Scheme 4.1. Retro-synthesis of α-GalCer analogue 2.

AcO AcO

AcO

AcO

OAc O OAc

AcO AcO

OAc O Br

AcO AcO

AcO

HBr 33% in HAc

OAc O O

C22H45

Scheme 4.2. Synthesis of glycolipid derivatives 2. Reagents and conditions: (a) HBr/HAc, 0°C; (b) 1-Docosanol, CH2Cl2, Ag2CO3, I2, rt, N2. galactopyranose to give a 75% yield of the glycosyl halide 1. The synthetic route followed is shown in Scheme 4.2.

4.1.1. Synthesis of glycosyl halide 1 The glycosyl halide 1 was prepared from a reaction of 1,2,3,4,6-penta-O-acetylgalactopyranose with a 33 % solution of HBr in acetic acid. Purification was attempted by recrystallisation in methanol but a decomposition of compound 1 was observed. Therefore, the glycosyl halide 1 was prepared at 0 °C without further purification by recrystallisation and was stored in the fridge. It was characterised as a crude product and used in a subsequent reaction as in Scheme 4.2. Structural elucidation of the glycosyl halide 1 was performed through a detailed analysis of the COSY data (Appendices 1 (ROF 3.1)). Starting from the ano141


position H-1 H-4 H-3 H-2 H-5 H-6, H-7 OC=OCH3

δ (ppm) 6.74 5.52 5.41 5.06 4.54 4.24 2.16,2.11, 2.05,1.99

multiplicity d d dd dd t m 4xs

J (Hz) 3.9 3.2 3.2, 10.6 3.9, 10.6 6.4 n/a n/a

Table 4.1. 1H NMR Chemical Shift Assignments for glycosyl halide 1 in CDCl3. meric hydrogen signal (δ = 6.74, d, J=3.9 Hz), the sugar moiety spin system from C-1 to C-6 was assigned as in Table 4.1. The anomeric hydrogen (H-1) has a small coupling constant (J = 3.9 Hz) thus indicating that the galactopyranose sugar moiety is linked in the α position to the bromide leaving group. The anomeric hydrogen is coupled to a proton at C-2 resonating at δ 5.06 (dd, H-2), which in turn couples to a proton at C-3 (δ 5.41, dd, H-3). This signal is coupled to a proton at C-4 (δ 5.52, d, H-4) which is coupled to another proton at C-5 resonating at δ = 4.54 (t, H-5). Sequential assignments from this proton allowed us to assign the protons of C-6 (δ = 4.24, m, H-6, H-7). The signals of C=OCH3 were found to be resonating at δ = 2.16, 2.11, 2.05, 1.99 respectively. The 13C NMR spectrum contains 4 quaternary signals at δ = 169.92-169.35 for the carbonyl carbons of the protecting acetoxy groups. The anomeric carbon is at δ = 88.35 which couples to the anomeric proton in the HSQC spectrum. The sugar skeleton is elucidated from C-2 through to C-6 from the HSQC and DEPT-135 spectra. The methyl groups of the acetoxy groups are at the lowest chemical shift of δ = 20.37-20.23. The mass spectrum obtained for this compound show fragments with masses of 82.97, 149.00, 324 respectively. A source such as an electron ionization source produces many fragments so it comes as no surprise that the total mass of compound 1 is not present. The molecular weight of the glycosyl halide 1 is 411.20 and with the loss of the CH2OAc fragment at C-5 has a mass of 324.This mass is indeed present in the mass spectrum. The CH2OAc fragment corresponds to the peak with a mass of 82.97 thereby verifying the structure of the glycosyl halide 1. The IR spectrum was also a great tool in determining the presence of the glycosyl halide 1 with characteristic CHstretch bands present at 3060.73, 2964.31, 2726.69, 2422.43 cm-1 and a C-Brstretch at 737 cm-1. The [α]20D yielded a value of + 142.96 indicating the presence of chiral centres. The stereochemical outcome of the glycosyl donor 1 is determined by the anomeric effect. Substituents on a six-membered ring typically adopt the equatorial position to minimize steric interaction with the ring. In some instances however, 142


O

σ* (C-X)

Scheme 4.3. Partial donation of the O lone pair (n orbital) into the antibonding orbital ( σ*)in a pyranose ring. thermodynamic considerations have a preference for the axial position, which is known as the anomeric effect.25 This effect occurs for electronegative substituents such as oxygen, fluorine, chlorine and bromine at the anomeric centre of the sugar. When there is an electronegative substituent (X) in an axial position at the anomeric centre, the oxygen atom of the ring has one of its lone pairs of electrons antiperiplanar to the C-X bond. This lone pair can partially donate electrons into the anti-bonding (σ*) orbital of the C-X bond as in Scheme 4.3. This induces electron delocalisation around the ring and therefore is stabilising. Thus the anomeric substituent is in the α position as determined in NMR data (Table 4.1).

4.1.2. Synthesis of glycolipid 2 Once it was established that high α selectivity was attainable as in glycosyl bromide donors 1, β-selective glycosylation strategies were investigated.17 The neighbouring group at C-2 in the glycosyl donor 1 is protected with an ester group which is well documented to provide good anchimeric assistance.26 The presence of this neighbouring group at C-2 results in the formation of a β substituent in the glycoside 2 as in Scheme 4.2, instead of a mixture of stereoisomers. Our initial instinct for synthesizing the glycoside 2 led us to react the glycosyl halide 1, as in literature with the 1-Docosanol in the presence of AgOTf at low temperatures for activating the halide leaving group but very little sugar was present in the NMR proving that the reaction did not proceed.27 Ag2CO3 was then used in its place. This could be due to the fact that Ag2CO3 is believed to be a better activating group than AgOTf.28 Thus the glycosyl bromide donor 1 was reacted with a long chain alcohol group, 1-Docosanol, with Ag2CO3 activating the bromide leaving group and I2 acting as a catalyst to give the glycoside 2 in a poor yield of 24%. Possible reasons for this low yield could be attributed to formation of side-products such as a saponification reaction occurring or thermal decomposition of the product 2. Due to the exuding water free reaction conditions a direct saponification of starting compounds is unlikely. However, the thermostability of this compound is yet to be investigated. Structural elucidation (Appendices 2 (ROF8.1)) of the glycoside 2 was again performed through detailed analysis of the 1H NMR, 13C NMR and 2D NMR techniques such as COSY and HSQC. The anomeric proton was found to be a doublet at δ = 143


position

δ (ppm)

multiplicity

J (Hz)

H-2

5.38

m

n/a

H-4 H-3 H-1 H-6 H-7 H-5

-OCH -OCH

-OCOCH3 -OCOCH3 -OCOCH3

-(CH2)20CH3 -CH3

5.39 5.04 4.46 4.19 3.91 3.48 4.19 3.91 2.15 2.05 1.98 1.25 0.90

d

dd d

m m t

m m s

2xs s s t

1

3.4, 10.4 7.8

n/a n/a 6.9

n/a n/a n/a n/a n/a n/a 6.5

Table 4.2. 1H NMR Chemical Shift Assignments for glycoside 2 in CDCl3. 4.46 (Table 4.2.) with a coupling constant of 7.8 Hz. A galactose sugar moiety containing a β substituent contains trans diaxial hydrogen atoms which are aligned with a dihedral angle of 180° and give large J values. Therefore this large coupling constant indicates that the substituent on the sugar moiety in the glycoside 2 is in the β stereochemical arrangement. H-2 was found to be a multiplet at a δ = 5.38 with an intensity of 1. It was verified to be coupled to the anomeric hydrogen via the COSY data. This in turn was found to be coupled to a proton at C-3 (δ = 5.04, dd, H-3) which was coupled to a proton at C-4 (δ = 5.39, d, H-4). The signal at δ = 3.48 belongs to the H-5 proton (t, 1H) but it does not seem to couple with the H-4 proton on the COSY spectrum. This is probably due to the fact that the proton at the C-4 position is in the equatorial arrangement while the proton at the C-5 position is in the axial arrangement. Thus the dihedral angle is 60° and the axial/ equatorial coupling is very small and may not be observed in the COSY spectrum. Sequential assignments from this proton allowed us to assign the protons of C-6 with H-6 present at δ = 4.19 as part of a multiplet and H-7 present at δ = 3.91 as part of another multiplet. A proton from the OCH2 was found to be resonating in a multiplet with the H-6 compound and the other proton of the OCH2 was found to be resonating within the multiplet shared with the H-7 proton. The protons of the acetoxy groups were found to be resonating at δ = 2.15, 2.05, and 1.98 respectively. The signal for the long hydrocarbon chain was found to be resonating at δ = 1.25 144


with an integration of 40 as expected. The methyl group CH3 at the end of the chain was found to be resonating at δ = 0.90. From the COSY 2D NMR the sugar moiety was assigned with relative ease. It was more difficult to assign the other protons such as the OCH2 protons. However it did become apparent from the COSY NMR that the 2 protons in the OCH2 group are in different environments. Take the signal at δ = 4.19 as an example containing one of the protons of the CH2 group (OCH) and the H-6 signals. If you look closely at this position, it can be seen that there is coupling of the OCH to the long hydrocarbon chain (CH2)20 at δ = 1.25 and coupling between H-6 and H-7 at δ = 3.91. A similar deduction can be made about the signal at δ = 3.91. The signal for the long hydrocarbon chain was elucidated with ease due to the low chemical shift (δ = 1.25) and the large integration value of 40. Similarly the triplet at δ = 0.90 was found to be the methyl group at the end of the hydrocarbon chain. On analysis of the 13C NMR, HSQC and DEPT-135, the structure of the glycolipid 2 was further verified. The quaternary carbons of the acetoxy groups were found to be singlets at δ = 170.39, 170.29, 170.20, and 169.35 respectively which is in direct agreement with the DEPT-135 NMR as these signals are no longer present. The high chemical shifts of these signals are indeed indicative of carbonyl substituents. The signal at δ = 101.37 was found to be the anomeric carbon on the sugar moiety. This signal was deduced from analysis of the HSQC spectrum as it is coupled to the anomeric proton at δ = 4.46 on the 1H NMR. It appears at a chemical shift characteristic of an anomeric proton also. DEPT-135 further verifies this deduction as the peak appears in the negative direction. The signals C-3, C-5, C-6, C-2, C-4 were found to be resonating at δ = 70.99, 70.57, 70.31, 68.96, 67.09 respectively in the 13C NMR. In the HSQC spectrum, coupling between the various carbons and protons verified the assignation of these signals. DEPT-135 was in direct agreement with the HSQC data also with the CH peaks in the negative direction and the CH2 peak in the positive direction. The remainder of the signals were assigned to the carbon atoms in the long hydrocarbon chain and in the protecting groups. A hetronuclear NOE experiment irradiating protons would be helpful in the total determination of the structure. Since, however the structure of 2 is already elucidated by 1H NMR and 13C NMR, we can waive such measurements. The IR absorption bands for this compound simply verify the various functional groups and do not yield sufficient information to characterise this compound. These absorption bands were found at 3483.92 (OHstretch), 3059.64 (CHstretch), 2919.21 (CHstretch), 2850.95 (CHstretch), 2119.04, 1751.25 (C=Ostretch) cm-1. The absorption band of interest is present at 3483 cm-1 as it highlights the possibility of OH starting material. Acetylation of galactose with acetic anhydride and pyridine at room temperature produces the fully acetylated compound in Scheme 4.4. as a mixture of stereoisomers at the anomeric centre called anomers. It is well known that free sugars as seen in Scheme 4.4. come as an equilibrium mixture of anomers. The product in Scheme 4.4 was the starting material in the production of the glycolipid 2 as seen in Scheme 4.2. From the IR spectrum, it is clear that further purification 145


AcO AcO

AcO

OAc O

H O

a OAc

H O

O H O H

H O

Scheme 4.4. Synthesis of 1,2,3,4,6-penta-O-acetyl-galactopyranose. Reagents and conditions: (a) Ac2O, py, rt. of the glycolipid compound 2 is necessary. The mass spectra showed fragments at 31, 309, 229, and 169. The fragment at 331 is in direct agreement with the sugar moiety without the –OC22H45 substituent while the C22H45 fragment has a mass of 309 which is certainly present in the mass spectrum with a massive relative abundance, thus in good agreement with the proposed structure.

4.2. Synthesis of amino acid glycolipid 6 Firstly, the amide derivative of N-Boc serine 3 suitable for glycosylation was synthesized as shown in Scheme 4.5. This compound was reacted with the imidate product 5 that was synthesized as shown in Scheme 4.6. in a glycosylation reaction shown in Scheme 4.7. The reasoning for the choice of the amide derivative 3 was that serine-based lipids have been reported to exhibit similar bioactivity as ceramide mimics such as in Îą-GalCer.29 The whole synthetic process took 4 steps to synthesis the glycolipid 6 in 10% yield. 4.2.1. Synthesis of amide derivative 3 In early attempts at synthesizing the amide derivative 3, 1-tetradecylamine, DIC and NEt3 were reacted, a system that was used successfully in previous reports with DIC acting as an amide coupling reagent.30 The product 3 was formed as confirmed by TLC analysis in a small yield but unfortunately, the separation of the urea by-product and the desired product 3 proved to be very difficult as both the urea and the amide derivative 3 were readily soluble in chloroform. Our focus was therefore turned to another carbodiimide coupling reagent, EDCI, in a reaction under similar conditions. The advantage of using EDCI over DIC as a carbodiimide coupling reagent is the fact that the urea by-product formed from the EDCI reaction is water soluble whereas the urea formed from the DIC is not. Therefore purification of the product would be simpler using EDCI as an amide coupling reagent. This reaction did not proceed to completion as evident in the preliminary 1H NMR results however. TLC results showed no product formation either thus verifying the failed reaction. Our focus then turned to a different synthetic route where oxalyl chloride was reacted with 1-tetradecylamine but again we were met with 146


O HO

N

H O

O

O HO

58%

H

N

H O

OH

O

N H

H C14H29

Scheme 4.5. Synthesis of amide derivative 3. Reagents and conditions: (a) TBTU/HOBt, 1-tetradecylamine, NEt3, DMF, rt.

AcO AcO AcO AcO

a

OAc O

AcO

90%

OH

AcO

b

OAc O

AcO

4

79%

OAc O

AcO

OAc

AcO

O

5

NH CCl3

Scheme 4.6. Synthesis of sugar donor 5. Reagents and conditions: (a) Benzylamine, THF, 50째C; (b) Cl3CCN, NaH, CH2Cl2, rt.

O HO

N

H O

N H

AcO 10%

AcO

AcO

O

AcO

AcO

H C14H29

AcO

3

OAc O O

OAc O NH

O

5

NH

O

H

N H

CCl3 C11H23

O O

6

Scheme 4.7. Synthesis of glycolipid 6.Reagents and conditions: (a) TMSOTf, CH2Cl2, 0째C. 147


failure. This unsatisfactory result led us to search for a more practical synthetic route to the compound 6, thus reverting to a uronium coupling reagent. TBTU seems to be one of the best amide coupling reagents, succeeding in difficult sterically hindered coupling and giving minimal racemisation when there is a danger.31 However, one disadvantage of the TBTU coupling reagent is the cost. HOBt is used to reduce chances of racemisation and side reactions by generating an active ester in situ. Therefore TBTU/ HOBt were reacted with 1-tetradecylamine and NEt3 to synthesise the amide derivative 3. Structural elucidation (Appendices 3 (ROF 6.1)) of the amide derivative 3 was performed primarily through detailed analysis of the 1H NMR, 13C NMR and also through 2D NMR techniques such as COSY and HSQC. The low field chemical shift at δ = 6.79 was assigned to the amide proton attached to the long hydrocarbon chain (-NHCH2) appearing as a broad singlet which is shown to be coupled to NHCH2 resonating at δ = 3.25 in the COSY NMR. The proton of the hydroxyl group (OH) is at a chemical shift of δ = 5.69 appearing as a doublet due to one of the diastereotopic protons at a close proximity. This signal is shown to couple to both these protons in the COSY NMR which are present at δ = 4.14 appearing as a multiplet (Hα, Hβ) and these protons are in turn coupled to the Hβ’ proton at δ = 3.65. The NH proton of the amide group N-Boc protecting group (NHC=O) is present at a chemical shift of δ = 3.65 along with the Hβ’ proton appearing as a multiplet with an integration of 2. These protons should couple to one another but it is unclear in the COSY data for the simple explanation of the small separation between the signals of the respective protons. The protons of the methyl groups of the protecting Boc group (-OC(CH3)3) of the serine derivative are present at δ = 1.45 with an integration of 9. The integration of the total peak is 12 indicating that there is still some impurity present in the sample to a small degree. At a low field chemical shift of δ = 1.25 the -(CH2)12CH3 protons are assigned and are shown to be coupled to the methyl end chain protons present at δ = 0.90 as triplets. Thus there is clear evidence of the presence of the amide derivative 3 (Table 4.3). position

δ (ppm)

multiplicity

J (Hz)

-OH

5.69

d

7.5

-NHCH2 Hα Hβ

Hβ’

-NHC=O -NHCH2

-OC(CH3)3

-NHCH2(CH2)12CH3

6.79 4.14 4.14 3.65 3.65 3.25 1.45 1.25

s

m m m m m s s

n/a n/a n/a n/a n/a n/a n/a n/a

Table 4.3. 1H NMR Chemical Shift Assignments for amide derivative 3 in CDCl3. 148


On analysis of the 13C NMR, the quaternary carbons of the carbonyl groups are present at high chemical shift at δ = 171.28 (s; -OC=O), 156.32 (s;-C=ONH) and 80.48 (s;-C(CH3)3) respectively. These signals are confirmed to be quaternary by the disappearing signals in the DEPT-135 NMR. The signal present at δ = 77.45 is characteristic of a CDCl3 solvent peak.32 As evident from the HSQC NMR and DEPT-135 NMR spectra, the carbon attached to the diastereotopic protons (Cβ) at δ = 77.22 (t; CH2OH) was structurally elucidated. At δ = 62.90 (t; NHCH2-) a CH2 group is present as evident in the DEPT-135 NMR. Similarly for chemical shifts δ = 54.84 (d; -NHCHCH2OH), 39.53 (t; -C=ONHCH2CH2), 31.91-26.85 (t x 10; -NHCH2CH2(CH2)10), 28.29 (q; -OC(CH3)3), 22.68 (t; -CH2CH3), 14.10 (q;-CH2CH3) the 13C NMR was analysed. The presence of the various functional groups was confirmed with carbon signals indicating the existence of the long hydrocarbon chain and the protecting Boc group. These were further confirmed and verified with the HSQC and DEPT135 NMR data. On analysis of the mass spectrum many fragments of the compound 3 were detected such as the C=ONHC14H29 fragment with a mass of 240.There also were fragments with masses of 212 (-NHC14H29) and 160 (-CH(CH2OH)NHBoc).An important fragment was one with mass 57 (C(CH3)3). The relative abundance of this fragment was very high. There was a fragment present with a mass of 556. This mass is far too high for the compound 3 in question, hinting at possibility of impurities. The IR spectra simply conveyed the presence of the various functional groups present such as the hydroxyl group present at 3319.60 cm-1 (OHstretch), the amide group with bands present at 3107.63 (NHstretch) 1706.4 (C=Ostretch), 1650.17 (NHbend), 1174.77 (COs) cm-1. Therefore compound 3 was indeed synthesised with relative success. tretch The low yield of this reaction would be the main problematic consideration. Possible improvements on this synthesis would include the use of HATU as the amide coupling reagent in place of the TBTU which again would be used in conjunction with the HOBt previously used.33 A slightly different solvent system, such as the use of different ratios of the solvent used or using different polarity solvents for the column chromatography might help in better separation to avoid impurities of the amide derivative 3.

4.2.2. Synthesis of imidate sugar donor 5 The imidate sugar donor 5 was synthesised from the reaction of a crude hemiacetal sugar 4 with trichloroacetonitrile and sodium hydride in CH2Cl2 under anhydrous conditions as in Scheme 4.5. The reaction proceeded via an addition reaction where the hydroxyl substituent at the anomeric carbon of the hemiacetal sugar 4 attacked the electrophilic carbon of the trichloroacetonitrile. The sodium hydride acted as a base aiding in the formation of the α imidate product 5. The preparation of the imidate product 5 to act as a sugar donor as in Scheme 4.5 consisted of a twostep reaction with excellent yields of 90 % and 79 % respectively. It was interesting to see the effects of using different glycosyl donors as in compounds 1 and 5 and to see whether the subsequent glycosylation reactions result149


ing in the formation of glycolipids 2 and 6 were improved as a result. Literature results have found that trichloroacetimidate-mediated glycosylation as an alternative method to the classical Koenigs–Knorr procedure now appears to be one of the most ideal glycosylation protocols. 17, 34 The first step in the synthesis of the reactive imidate 5 was the production of a hemiacetal sugar 4 from the same starting material (1,2,3,4,6-penta-O-acetyl-galactopyranose) used in the synthesis of the glycosyl halide 1. This starting material was reacted with benzylamine in THF at 50 °C to give the hemiacetal sugar 4 as in Scheme 4.6. Structural elucidation was carried out on compound 4 through analysis of 1H NMR and 13C NMR spectra but due to time restraints no COSY, DEPT-135 or HSQC NMR were carried out. Purification of the hemiacetal sugar 4 by doing column chromatography was not attempted due to the time restraints either. In certain terms the yield of 90% is somewhat misleading therefore as impurities are almost certain to be present. Column chromatography purification was carried out on the imidate sugar 5 and structural elucidation was carried out by the afore mentioned NMR techniques (Appendices 5 (ROF12.1)). Starting from the anomeric hydrogen signal (δ = 6.61, d, J=3.3 Hz), the sugar moiety spin system from C-1 to C-6 was elucidated as in Table 4.4. The anomeric proton (H-1) at δ = 6.61 is a peak of significant interest and through the 2D COSY NMR was indeed shown to be coupled to the proton at the C-2 position (δ = 5.56, H-2). All the other signals for the protons on the sugar moiety were present and coupled as expected. The signal of interest is the anomeric proton. The coupling constant of the anomeric proton was found to be 3.3 Hz indicating, as in the glycosyl halide 1 that the imidate substituent 5 is in the α substituent as predicted by the anomeric effect.35 Another peak of interest is one at δ = 8.67 (C=NH ) appearing as a singlet. Due to the highly electronegative N atom this proton is present at a high chemical shift. The protons of the protecting groups are present at low chemical shifts at δ = 2.17 (s, 3H; -OC=OCH3 ), 2.03-2.01 (3 x s, 9H; -OC=OCH3 ).

The 13C NMR spectrum display 4 quaternary signals (δ = 170.29-169.96) corresponding to the 4 carbonyl groups in the acetoxy protecting groups. Another quaternary carbon is present indicating the carbonyl carbon of the imidate group (δ = 160.95). The anomeric carbon is at a relatively high chemical shift (δ = 93.54) and from the HSQC spectrum it can be seen to couple to the anomeric proton. It is also shown to be a CH peak in the DEPT-135 as the signal is facing in the positive direction. The quaternary carbon of the imidate group (-CCl3) bounded to the three Cl atoms is present and no coupling is present in the HSQC as expected. The C-5 carbon is present at δ = 68.99 which has a high chemical shift due to the presence of the neighbouring electronegative oxygen. The signals for the C-2, C-3, C-4 are found at δ = 67.51-68.91 but it is difficult to assign each signal to a carbon as there is 150


Position

δ (ppm)

multiplicity

J (Hz)

H-1

6.61

d

3.3

C=NH H-3 H-2 H-4 H-5 H-6 H-7

-OC=OCH3 -OC=OCH3

8.67 5.57 5.56 5.56

s

d

m

n/a 1.2

n/a

m

n/a

4.20

m

n/a

2.17

s

4.47 4.20 2.03

t

m 3xs

6.6

n/a n/a n/a

Table 4.4. 1H NMR Chemical Shift Assignments for imidate sugar donor 5 in CDCl3. very close coupling occurring in the HSQC leaving the discrimination of these signals very difficult to assign. These CH signal assignments do agree with the DEPT135. The C-6 carbon is present at δ = 61.25 and is in agreement with both HSQC and DEPT-135 spectra. At a low chemical shift, four signals are present corresponding to the methyl groups in the acetoxy protecting groups at δ = 20.65-20.54. Thus it may be concluded that 1H, COSY, 13C, DEPT-135 NMR are in good agreement in the structural elucidation of compound 5. From the mass spectrum provided further structural characterisation can be made by the presence of the following fractions 170 (-OCNHCCl3) and 331 (sugar without anomeric substituent). The presence of the sugar moiety and the imidate group are sufficient information that the compound 5 is present. Briefly looking at IR spectra shows NH stretches, CH stretches and carbonyl stretches which is sufficient in the structural elucidation.

4.2.3. Synthesis of glycolipid 6 As in Scheme 4.7, the glycolipid 6 was synthesised by reacting the sugar imidate 5 with the amide derivative 3 in CH2Cl2 at 0 °C in the presence of a freshly made solution of TMSOTf. The TMSOTf acted as a lewis acid to activate the imidate leaving group of the reactive sugar 5 to form the glycolipid 6. Similar to the glycolipid 2, a β stereospecific substituent is expected in this glycosylation reaction. This glycosylation reaction of 5 with 3 afforded a low yield (10%) of glycolipid 6. Analysis of the 1H NMR and COSY spectra (Appendices 6 (ROF 14.1)) of this glycolipid 6 allowed assignation of the following protons. For the sugar the best entry point is the anomeric carbon (δ ≈ 100), to which the proton signal at δ = 4.57 (d, J= 7.9 Hz, 1H; H-1) can be assigned using the HSQC plot. This carbon signal is not seen 151


in either the 13C NMR or the DEPT-135 due to the bad signal/noise ratio. H-1 has one coupling partner at δ = 5.21 (dd, H-2). This H-2 cross peak in the COSY spectrum leads to H-3 (dd) at a chemical shift of δ = 5.04 as well as H-1.The signal of H-6 (δ = 4.17, t,) is very close to that of H-7 located at the same chemical shift, so the cross peak associated with J (6,7) is very close to the diagonal and difficult to detect. Nevertheless this signal is shown to be coupling with H-5 at δ = 4.07. The coupling constant between H-1 and H-2 (J=7.9 Hz) is relatively large, proving that the H-2 is in an anti-periplanar orientation with respect to the H-1. Thus, H-1 is in the axial position and the peptide substituent is in the equatorial position (β anomer) as expected. The signal at H-4 in the 1H NMR is seen at a chemical shift of δ = 5.40 as a doublet on top of a broad multiplet. The broad multiplet at this chemical shift is due to NHCH. As both signals are relatively close to one another it is hard to know for sure the coupling constant of the H-4 with respect to H-3 and H-5. Therefore it cannot be known for sure whether it is in the axial position. The significant differences between the coupling constants of the protons on the sugar are a strong indication that the molecule contains a six membered ring, rather than a flexible, pseudo-rotating five-membered one. This is confirmed by the fact that vicinal coupling constants with magnitudes of approximately 10 Hz are found; such values only appear for fixed antiperiplanar proton orientations. Since the substance is readily soluble in chloroform, and in view of the existence of the four methyl peaks at δ = 2.16, it is reasonable to assume that the saccharide part of the molecule is peracetylated. Thus it may be concluded that the sugar moiety is present in the glycolipid 6. It is not possible to determine the connectivity of the amino acid groups in the peptide part of this molecule because 1H, 1H coupling across an amide group is too small to be detected by a standard COSY experiment. However the proton peaks were found to be at δ = 6.36 (NHCH2) and 5.40 (NHCH) respectively. The chiral centre Cα of the serine derivative makes the methylene Hβ and Hβ’ diastereotopic, i.e., in the 1H NMR their chemical shifts are principally different, providing more complicated multiplets than originally anticipated. However, a connectivity network can be established starting from the signal at δ = 4.07 (Hβ, m) to a signal at δ = 3.76 (Hβ’, dd) indicating the coupling of these two protons. Hα is found at a chemical shift of δ = 4.27 which coupled to the –OCH2 carbon at δ = 61.24 in the HSQC data. The CH2 groups of the long hydrocarbon chain are present as a singlet at δ = 1.25 and are shown to be coupled to the methyl group of the long chain terminal at δ = 0.90. Overall the 1H NMR results are shown in Table 4.5. The 13C NMR can be divided into two groups: the signals belonging to the sugar moiety and the signals corresponding to the peptide group. The latter have different combinations of neighbour groupings. For the sugar moiety, 3 separate signals are present (δ = 170.01, 169.58, 169.58) for the protecting acetoxy groups. These are not present on the DEPT-135 spectrum. The acetoxy protecting group of the C-2 and C-3 are equivalent due to them being in the equatorial position while the C-4 protecting group is in the axial position. On approximation, the protecting group 152


position

δ (ppm)

multiplicity

J (Hz)

NHCH

5.40

dd

0.9, 3.4

NHCH2 H-4 H-2 H-3 H-1 Hα

H-6 H-7 H-5 Hβ

Hβ’

-NHCH2

-OC=OCH3

-(C=O)OC(CH3)3 -(CH2)12CH3 -CH2CH3

6.36 5.40 5.21 5.04 4.57 4.27 4.17 4.17 4.07 4.07 3.76 3.28 2.16 1.45 1.25 0.90

bs

dd dd dd d

bs t t

m m

dd dd

4xs s s t

n/a

0.9, 3.4

7.9, 10.5 3.4, 10.5 7.9

n/a 1.9 1.9

n/a n/a

7.9, 10.4 5.4, 11.0 n/a n/a n/a 6.5

Table 4.5. 1H NMR Chemical Shift Assignments for glycolipid 6 in CDCl3. of the C-5 carbon would have the lowest chemical shift of these signals due to the presence of the neighbouring CH2 group. The carbons on the skeleton are assigned with the aid of the HSQC spectrum. C-1 as described earlier is present at δ ≈ 100. C-5 (δ = 71.06) has a relatively high chemical shift due to the presence of the neighbouring electronegative O. The C-3, C-2, C-4, C-6 signals are at positions δ = 70.70, 68.70, 66.97, 61.24 respectively. The assignments of the methyl groups of the protecting groups is straightforward due to the HSQC spectrum and are at δ = 20.7620.54. The assignment for the peptide part of the molecule was more complicated due to overlapping of signals. The NHCH2 signal was easily distinguished to appear at δ = 39.69 from HSQC and the negative direction in the DEPT-135 (CH2 group). Due to the close signals in the region of δ = 61 ≈ 70 and low signal to noise ratio, assignation is difficult which left the Cβ unassigned but it is my strong opinion that it overlaps with the C-6 signal as there seems to be two signals at that position and it is in agreement with the HSQC spectrum provided. At δ = 31.92-29.35, 4 signals are present for the CH2 group of the long hydrocarbon and is confirmed by HSQC spectrum. At δ = 29.28 the methyl groups of the Boc protecting group are present. At δ = 28.29, 26.83 two further peaks are present for the long hydrocarbon chain and at δ = 14.11 the methyl terminal group of the lipid is present. All 1H and 13C chemical 153


shifts can be verified by calculations involving increment rules.3 The mass spectrum was analysed with the sugar fragment present at 331. The fragment at C=ONHC14H29 was present with a mass of 240 with a small relative intensity. The IR spectrum is again in agreement verifying various functional groups with OH bands, NH bands and CH stretches.

4.3. General Discussion For the sake of clarity, integration steps are omitted in my elucidation of the structures when discussing 1H NMR. In most cases the number of hydrogens corresponding to a given signal is obvious; if there is any doubt, comments are provided. The melting points of these compounds were not recorded as many of the compounds synthesised were obtained as oils and foams. Exact measurement of the temperature would be a difficult task to achieve, thus analysis of the melting point would be inconclusive and yield no structural information to us. The [α]20D measurements were indeed carried out, but simply verified whether the compounds in question were optically active. 5. Conclusion In conclusion, expedient methods for the synthesis of α-GalCer analogues 2, 6 in β-anomeric selectivity have been devised. These compounds differ significantly in structure from the previously studied and well documented α-GalCer. 2, 8, 9 These glycolipids are potentially useful for immunotherapy and cancer treatment. Glycosyl donors 1, 5 were synthesized in α-anomeric selectivity along with an amide derivative 3. The glycosyl donors 1, 5 could potentially be used in the synthesis of different analogues of α-GalCer in the future. A number of important properties have been suggested (Section 2.2) that would make analogues of α-GalCer superior agents to α-GalCer for a variety of applications in the prevention and treatment of disease. 37, 37, 38 In this project, these considerations were taken into account to synthesise unique compounds with different activities and binding abilities. For future development of the work, biological testing on the compounds 2, 6 in the deprotected form would verify whether these analogues were immunologically active. Synthesis of the α analogues of the compounds 2, 6 would also be of interest to compare the different effects of the anomeric selectivities on the activity and binding ability of the potential drug. Possible improvements in this project would be the purification of products 2, 6 by HPLC, which would remove some impurities not removed during previous column chromatography.

154


References 1 Hung, L.C.; Lin, C.C.; Hung, S.K.; Wu, B.C.; Jan, M.D.; Liou, S.H.; Fu, S.L. Biochem. Pharm. 2007, 73, 1957-1970. 2 Fan, G.T.; Pan,Y.S.; Lu,K.C; Cheng,Y.P.; Lin,W.C.; Lin,S.; Lin,C.H.; Wong,C.H.; Fang, J.M.; Lin, C.C. Tetrahedron 2005, 61, 1855–1862. 3 Plettenburg, O.; Bodmer-Narkevitch, V.; Wong, C.H. J. Org.Chem. 2002, 67, 4559 4564. 4 Goldsby, R.A.; Kindt, T.J.; Osborne, B.A. Immunology, Freeman, New York, 2000, 287-298. 5 Godfrey, D.I.; Hammond, K.J.; Poulton, L.D.; Smyth, M. J.; Baxter, A.G. Immunol. Today 2000, 21, 573–583. 6 Nieda, M.; Okai, M.; Tazbirkova, A.; Lin, H.; Yamaura, A.; Ide, K.; Abraham, R.; Juji, T.; Macfarlane, D.J.; Nicol, A. J. Blood 2004, 103, 383-389. 7 Kawano, T.; Cui, J.; Koezuka, Y.; Youra, I.; Kaneko, Y.; Motori, K.; Yeno, H.; Nakagawa, R.; Sato, H.; Kondo, E.; Koseki, H.; Taniguchi, M. Science 1997, 278, 1626-1629. 8 Nickel,M.; Brummer, F. J. Biotechnol. 2003, 100, 147-/159. 9 Elewaut, D.; Kronenberg, M. Semin. Immunol. 2000, 12, 561-568. 10 Wu, D.; Fujio, M.; Wong, C.H. Bioorg. Med. Chem. 2008, 16, 1073–1083. 11 Miyamoto, K.; Miyake, S.; Yamamura, T. Nature 2001, 413, 531–534. 12 LaBell, R.Y.; Jacobsen, N.E.; Gervay-Hague, J.; O’Brien, D.F. Bioconjugate Chem. 2002, 13, 143-149. 13 Bhat, S.; Spitalnik, S.L.; Gonzalez-Scarano, F.; Silberberg, D.H. Proc. Natl. Acad. Sci. U.S.A. 1991, 88, 7131-7134. 14 Colombo, D.; Franchini, L.; Toma, L.; Ronchetti, F.; Nakabe, N.; Konoshima, T.; Nishino, H.; Tokuda, H. Eur. J. Med. Chem. 2005, 40, 68-70. 15 Cateni,D.;Bonivento,P.;Procida,G.;Zacchigna,M.;Gabrielli-Favretto,L.;Scialino, G.; Banfi, E. Bioorg. Med. Chem. 2007, 15, 815-826. 16 Xing, G.W.; Wu, D.; Poles, M.A.; Horowitz, A.; Tsuji, M.; Ho, D.D.; Wong, C.H. Bioorg. Med. Chem. 2005, 13, 2907-2916. 17 Fischer, E. Ber. Dtsch. Chem. Ges. 1893, 26, 2400-2412. 18 Koenigs, W.; Knorr, E. Chem. Ber., 1901, 34, 957–981. 19 Kunz, H.; Harreus, A. Liebigs Ann. Chem. 1982, 41–48. 20 Jones, J., II; Amino Acid and Peptide Synthesis; Davies, S.G., Compton, R. G., Evans, J.; Gladden, L.F., Eds, Oxford: New York, 2002, p. 25. 21 Han, S.Y; Kim, Y.A. Tetrahedron 2004, 60, 2447–2467. 22 Dourtoglou, V.; Ziegler, J.C.; Gross, B. Tetrahedron Lett. 1978, 1269–1272. 23 Espín, J.C.; García-Conesa, M.T.; Tomás-Barberán, F.A. Phytochemistry 2007, 68, 2986–3008. 24 Sullivan, B.A.; Kronenberg, M. J. Clin. Invest., 2005, 115, 2328–2329. 25 Juaristi, E.; Cuevas, G. Tetrahedron., 1992, 24, 5019-5087. 26 Shvily, R.; Müller, T.; Apeloig, Y.; Mandelbaum, A. J. Chem. Soc., 1997, 2, 1221 1234. 155


27 Corzana, F.; Busto, J.F.; Engelsen, S.B.; 0nez-Barbero, J.; Asensio, J.L.; Peregrina, J.M.; Avenoza, A. Chem. Eur. J., 2006, 12, 7864-7871. 28 Nakata,T.;Nomura, S.;Matsukura, H. Tetrahedron. Let., 1996, 37, 213-216. 29 Fukunaga, K.; Yoshida, M.; Nakajima, F.; Uematsu, R.; Hara, M.; Inoue, S.; Kon do, H.; Nishimura, S.I. Bioorg. Med. Chem. Lett., 2003, 13, 813–815. 30 Joshi, B.P.; Park, J.;Kim, J.M.; Lohani,C.H.; Cho, H.; Lee, K.H. Tetrahedron. Let., 2008, 49, 98–101. 31 Lloyd-Williams, D.; Albericio, F. Chemical approaches to the synthesis of pepti des and proteins; Giralt, E., Eds, CRC Press: Boca Raton, 1997, pg 115-127. 32 Gottlieb, H.E.,; Kotlyar, V.; Nudelman,A. J. Org. Chem., 1997, 62, 7512-7515. 33 Carpino, L.A.; Abdel-Maksoud, A.A.; Mansour, E.M.E.; Zewail, M.A. Tetrahedron. Let., 2007, 48, 7404-7407. 34 Nakajima, N.; Saito, M.; Kudo, M.; Ubukata, M. Tetrahedron., 2002, 58, 3579 3588. 35 Pretsch, E.; Buhlmann, P.;Affolter, C. Tables of Spectral Data for Structural Determination of Organic compounds, Springer, 2003, 200-320. 36 Lin, C.C.; Shimazaki, M.; Heck, M.P.; Aoki, S.; Wang, R.; Kimura, T.; Ritzen, H.; Takayama, S.; Wu, S.-H.; Weitz-Schmidt, G.; Wong, C.H. J. Am. Chem. Soc. 1996, 118, 6826–6840. 37 Cheng, Y.P.; Chen, H.T.; Lin, C.C., Tetrahedron. Lett., 2002, 43, 7721–7723. 38 Demchenko, A. V.; Rousson, E.; Boons, G.J. Tetrahedron. Lett., 1999, 40, 6523– 6526.

156


157


CLASSICS PANEL

Judging panel Prof. Brian McGing (Trinity College Dublin) – Chair Judges’ commentary Athena is one of the most fascinating of the Olympian deities. In mythology she was said to have been born from the head of Zeus, who, in the oldest form of the myth, had become pregnant with Athena after swallowing Metis (which means ‘advice’ or ‘cunning intelligence’ in Greek). While this world of the gods has, for the modern reader at least, strange, fairytale elements, in Homer’s great poem of honour and death, the Iliad, it is made more familiar by its familial setting. It is, admittedly, an odd and rather dysfunctional family – Hera, Zeus’ wife, is also his sister; their son, Ares, is detested by his father; Athena is the product of a distinctly odd birth; and Zeus was a serial adulterer – and it was by no means an easy task to create a family out of the motley collection of Olympians that in mythology had many independent (and non-familial) roles. The assigning of gender roles was crucial for Homer’s creation of the Olympian family. Zeus, for example, has to be the ‘father,’ Hera the nagging and jealous wife. It is the relationship between the core members of this family that Melanie analyses in this paper with great skill, and, in particular, how the ambiguous, androgynous Athena fits in. The circumstances of her birth are reflected in her two main roles as both a loving daughter. And there remains something of Metis in her too, most obviously in her friendship with the wily Odysseus in the Odyssey, but also in the intelligent advice she gives Achilles at the beginning of the Iliad. Melanie disentangles these various relationships, and their consequences, with clarity, verve and considerable sophistication of analysis. The paper is well planned, written and referenced; it makes excellent, and critical, use of modern scholarship and ideas on both Homer and gender. To identify and problematize a topic, bring to bear ancient and modern evidence in a sophisticated manner, and produce a rational and ordered set of arguments in support of an independently devised case – these are some of the vital transferrable skills we seek to impart to students in studying the ancient world. This essay is an excellent demonstration of those skills, and Melanie is to be warmly congratulated on a first class piece of work. 158


C L A S SIC S

The Son he never had: Zeus’ parthenogenetic creation of a surrogate son? Melanie Hayes ‘The image of the Olympian family is determined by the image of the father.’1

T

his statement establishes the premise of this discourse, that Homer in his creation of a patriarchal society on Olympus defined the characterisations of the gods, though more noticeably the goddesses, in terms of their relationships with the patriarch. This treatment will claim that in subverting traditional aspects of the goddesses’ mythological personas, Homer highlights those characteristics that assist his creation of gender paradigms; characterisations reflecting gender concerns of contemporary Greek society. Positing the claim that an examination of the relationships that develop within this divine clan reveals a dichotomous divide, I will harness the exemplum of the divine coterie that emerges to highlight the gender concerns established by patriarchal relationships. One goddess, through her conflict with the patriarch, illustrates male concerns about the destructive power female sexuality; the other, through her very existence, signifies male appropriation of female reproductive functions. This disquisition will examine the validity of the title statement, that in subverting the female contribution, Zeus created an image of himself, a surrogate son in the androgynous Athene, who owed her loyalty to the male alone. As a mythopoetic creation, Homer’s Olympic family is not representative of any established system of Greek mythology.2 Homer goes to great lengths to portray a 1  Kerényi, C., C. Holme (trans) Zeus and Hera: archetypal image of father, husband, and wife (London: Rutledge & Kegan Paul, 1975), 59. 2  Kerényi 1975 : 42.

159


picture of a divine family all inhabiting the Olympic home of ‘father Zeus’. In establishing Zeus as the dominant male, the lord of the gods ruling over a collective community, this form of patriarchal society reflects that of Mycenaean Greece. Homer’s goddesses in turn act as mirrors to their mortal counterparts in their dependence on the dominant male for self definition. These characterisations are developed at the expense of traditional aspects of these goddess’s cult and mythic personas. Hera for example, the earth goddess and protector of marriage as Zeus’s consort is characterised as the archetypal nagging wife who keeps a jealous watch over her husband’s movements. The powerful nymph Thetis, champion of Zeus and Hephaestus is cast as a helpless but devoted mother, supplicating ‘father Zeus’ on behalf of her son. Aphrodite, an ancient fertility goddess born from the seed of Uranos3 becomes the rather ‘pathetic’ daughter of Zeus and Dione, whose relationship is juxtaposed against that of beloved and powerful Athene, the parthenogenetic daughter of Zeus.4 As a result of this androcentric community a subtle sort of theomancy occurs on Olympus, a cold war between the goddesses, each competing for the attention of Zeus, seeking his favour to grant their desires, vying for prominence and respect amongst the gods. This tension results in what Louden refers to as ‘catfights’5 between the goddesses: bitchy retorts are exchanged to undermine the opposition and elevate the goddesses in Zeus’ eyes. Take Athene’s attempt to humiliate her half-sister Aphrodite (5.418-430), calling to Zeus’ attention Aphrodite’s lack of warrior prowess and her crushing defeat at the hands -- but at the bidding of Athene -- of the mortal Diomedes. This scene emphasises the parodic nature of Aphrodite’s role as protector of Paris and Aeneas to Athene’s as mentor to Achilles.6 More importantly it illustrates the divide that emerges between the goddesses: with Hera and Athene as wife and ‘beloved daughter’ occupying the inner sanctum closest to Zeus, jealously guarding their influential positions from the other goddesses. (5.418-420) But Hera and Athene glancing aside at her began to tease the son of Kronus, Zeus, in words of mockery: the goddess grey-eyed Athene began to talk among them

The other side of the coin is represented in book 21. Having been scolded by Hera for her insolence in taking on the ‘august consort of Zeus’ (21.481) Artemis turns tell tale. Her pride and ears sore from the boxing, she flees to Olympus in tears lay3  Blundell, S, Women in ancient Greece (Harvard: Harvard University Press, 1995), 36. Blundell notes that Homer’s most frequent epithet to Aphrodite is the ‘Cyprian’ ; in recalling her place of birth in mythological tradition Homer may be subtly acknowledging the alternative tradition which he has manipulated. 4  While Homers doesn’t explicitly discuss the tradition of Athene’s birth she is referred to repetitively as ‘daughter of Zeus’, suppressing the tradition of her mother Metis’ conception of her. 5  Louden , B. ‘The Gods in epic or the divine economy’ in J.M. Foley (ed.), A companion to ancient epic ( Malden , 2005), 95. 6  Louden 2005: 97.

160


ing her troubles before Zeus, (21.510) implicating his wife as her abuser, who had ‘done this rash thing to her.’ The goddess Thetis inspires the jealousy of both Hera and Athene for her position of influence with Zeus. Noting the secret encounter between Zeus and Thetis (1.536-546) Hera rebukes Zeus for plotting behind her back, feeling as his wife she deserved to be privy to his counsels. Athene confides in Hera her jealous fears that Thetis has usurped her influential position in Zeus heart, (8.370, 8.374) Yet now Zeus hates me, and is bent to the wishes of Thetis… Yet time shall be when he calls me again his dear girl of the grey-eyes.’

So what unites this ‘divine coterie’7 against the others? Homer in a brief but controversial allusion to the ‘Judgement of Paris’ offers for some the uniting element ; their hatred for (24.27-30) Priam and his people, because of the delusion of Paris who insulted the goddesses when they came to him in the courtyard and favoured her who supplied the lust that led to disaster.’ O’Brien posits that it is the injured pride of the two goddesses, their jealousy at Paris’ preference for the allure of Aphrodite that inspires the combined wrath of Hera and Athene against the Trojans. While this theory fits with the Homeric characterisation of Hera as a jealous and vindictive goddess,--one could believe that ‘this petty affront’ as Slater puts it, could be the ‘source of [Hera’s] devastating rage’ -- its does not sit so easily with the portrait of Athene, the obliging asexual goddess. Shearer, rejecting the theory that Athene’s endless scheming and her energetic action on the battle field was because she was ‘miffed’ over Paris’ choice, posits rather that Athene, like the other gods, took part in this tug of war between sides for sheer sport and love of battle.8 What this debate does highlight is the contrasting characterisations of these two goddesses; their alternative methods of reaching their common goal, to win the favour of Zeus to achieve the destruction of Troy. This duo juxtaposes, in their relationship to Zeus, the nagging wife against the dutiful daughter; the vindictive matron and the helpful virgin. Highlighting the importance of sexuality in female characterisations, the sexual power of a female in her re-productory role correlates to negativity and antagonism while ‘benevolence among goddesses is highly correlated to virginity.’9 ‘Virginal and boyish Athene… [is] the most helpful female deity in the Greek pantheon’, Hera, maternal and sexual is the ‘most vindictive and 7  O’Brien, J.V., The transformation of Hera: a study of ritual, hero, and the goddess in the Iliad (Lanham: Rowman & Littlefield Publishers, Inc., 1993),91. 8  Shearer, A., Athene: image and energy (London: Viking Arkana, Penguin, 1996), 9. 9  Slater, P. E., The glory of Hera: Greek Mythology and the Greek family (Princeton: Princeton University Press, 1968), 66.

161


persecutory10. Finley describes Hera as the ‘most complete female… whom the Greeks feared a little and did not like at all.’11 Homer’s portrait of the Olympian family would appear to place Hera in this light. Referred to as a ‘brazen-faced mother’ by her son12 (18.395), ‘reckless of word’13 by her brother-in-law (8.209), she is honoured among the Gods only as Zeus’s consort, as one ‘who lie[s] in the arms of Zeus’ (14.213). Zeus continually fails to grant Hera her due respect as his wife; by his constant infidelities, and in excluding her from his counsel. He exerts his position of dominance (1.565) by threatening to whip her, reminding her of previous punishments at his hands (15.17-20). Zeus is ‘lord of Hera’ (7.411), but while he reverts to displays of brute strength to sub-ordinate her, Hera is characterised as utilizing her inherently devious and deceitful female nature, harnessing her sexuality to assert her will. Through her seduction of Zeus in book 14 (14.153-360), linked to the tale of her delusion of Zeus (19.91-138) by the refrain dolophrosunê (guile[ful]), Hera is cast as the manipulative female. ‘Hera seeks to manipulate, to outwit her spouse with calculating charm, deceit (apatê), and superior intelligence.’14 While this picture of the ‘scheming wife’ does not present a positive image of women it does highlight the strength of Hera’s character, fighting her subordinate position to the patriarch. Blundell describes her ‘strong and vigorous personality15’, Hera herself reminds Zeus of her dynastic claims to power, (18.364-5) As for me then, who claim I am highest of all the goddesses both ways, since I am eldest born and am called your consort. Though Hera’s claims rely on a patriarchal framework of male control, according to Pomeroy ‘the domination of Zeus over Hera … is constantly threatened. Hera, as her husband’s sister, is his equal, and is never totally subjugated.’16 While Zeus does not appear to take Hera’s claims for honour seriously, he ‘rebukes his wife-sister when it serves his purposes (eg. 1.544-50),’17 he does fear her wrath. When being supplicated by Thetis Zeus’ main concern is the reaction of his wife. (1.518-19) ‘This is a disastrous matter when you set me in conflict with Hera, and she troubles me with recriminations.’ 10  Ibid 11  Slater 1968: 66. 12  This portrait highlights Hera’s role as the archetypal ‘bad mother’, thus undermining her maternal contribution and stressing the importance of patrilineal succession. 13  ‘Reckless’ typifies the gender bias of characterising females as inherently uncontrolled and wild, something to be subdued by the dominant male. 14  O’Brien 1993: 179. 15  Blundell 1995: 34. 16  Pomeroy, S.B., Goddesses, Whores, Wives and Slaves: Women in Classical Antiquity (New York: Schocken Books, 1975), 7. 17  O’Brien 1993: 84.

162


While the threat of Hera’s wrath is a cause of anxiety for her spouse (15.18), – note Zeus’ desire that Thetis ‘go away for fear she see us’ (1.523)-- ‘the strongest of all the immortals ’does not concede to the wishes of his wife. For all her ‘forcefulness the pattern of male domination’ on Olympus remains in tact.18 Athene’s close relationship with her father throws the fraught interaction between Zeus and his wife into greater relief. Among the Olympians it is his virginal daughter that Zeus seems to admire and trust most. ‘Athene is warrior, judge, and giver of wisdom, but she is masculinised and denied sexual activity and motherhood.’19 Homer’s characterisation of the goddess Athene encapsulates the attributes of her father, to the detriment of many of her cultic attributes. She is the ‘destroyer of cities, divider of booty, goddess of spoil, marshaller of the host; bellowing she joins the action on the battle-field (20.48). Adopting aggressive male stances, she dons the aegis of her father (5.736). She is not, however, a goddess of uncontrolled violence, as Hera is characterised, rather her disciplined nature identifies her with the controlled male, ruled by the head instead of passions. Athene is known for schemes and strategy, she is Athene polyboulos (5.260), and like her father she is known for sophia (wisdom) and practical intelligence. Homer contrasts the controlled anger (4.20-25) of ‘the clever and supportive Athene, the child of the father20’ with Ares, who in his uncontrolled menos is associated with his mother.21 As an honorary male and obvious favourite is Athene the son Zeus never had? Zeus’ distain for his son Ares is categorically stated ‘to me you are the most hateful of all the gods’ (5.890) but what of strong-minded Apollo? Though he is ‘beloved Phoibos’ (15.221), Apollo ‘the worthiest son of Zeus’ is only once referred to as Zeus’ son (7.37); like Ares he is linked more with his mother Leto.22 Athene is the child, as far as Homer is concerned, of Zeus alone. Athene it would appear owes her position in the Olympian family as a surrogate son to her parthenogenetic birth, Zeus’ creation of a child in the image of her father. Athene is the son he never had… thank goodness. For while Homer suppresses any direct mention of the alternative traditions of Athene’s birth, of her mother Metis, in order to present Zeus as the sole parent, usurping the female reproductive role, this tradition can in many ways account for the bond between father and daughter. It was the act of swallowing Metis that assured Zeus’s triumphal dominance of the Olympian order; ‘he put her in his belly, fearing that she would give birth to something else mightier that the thunder-bolt.’23 Thus the patriarch prevented the prophesied birth of a son to Metis, one destined to over throw him as the su18  Blundell 1995: 34. 19  Pomeroy 1975) :8. 20  Blundell 1995: 34. 21  O’Brien 1993: 85. 22  Kerényi 1975: 55. 23  Doherty, L.E., Siren Songs: gender, audiences, and narrators in the Odyssey (Michigan: The University of Michigan Press, 1995), 1-2. This fragment of text cannot be confidently assigned to a specific work.

163


preme Olympian. ‘Zeus can rule forever because he has prevented the birth of an heir.’24 Athene, the result of a union between Zeus and Metis, born subsequently from Zeus head, as a female is no threat to Zeus hegemony. The circumstances of her birth dictate her ambiguous nature. Athene is the androgynous daughter who owes her ‘allegiance to her father and to the males he favours.’25 This ambiguity seems to be encapsulated in her patriarchaly determined virginity 26; being ‘born to her father’ and desiring to belong to him has produced , according to Kerényi, the virginity of the father’s daughter’27. The feminine aspects of Athene’s character, not overtly emphasised by Homer, reveal themselves in tender moments between father and daughter. She is ‘Tritogenia, dear daughter’ who Zeus has hurt with his disregard for her wishes; but she must ‘take heart’ his ‘meaning toward [her] is kindly meant’ (22.183-185). Zeus addresses Athene with the epithet, ‘bright’ or grey eyes’, his ‘favourite appellation for his favourite daughter’; one that may indicate a level of intimacy.28Athene acknowledges the special relationship and the signalling of intimacy with this epithet, predicting that the enmity between the pair caused by Thetis request will subside and he will ‘again call me his dear girl of the grey eyes’ (8.373).As the favourite child Athene has special privileges, but ‘because her words have a special way of reaching her father’s ear, there’s an especially delicate balance to be kept.’29 Note Zeus’s indignation with Athene (8.406-8), more so than with Hera, for the goddesses attempt at thwarting his plan; he is used to this behaviour from his wife, ‘it is his daughter’s defiance that has hurt and enraged him.’30 In turn accustomed in getting her way with her father Athene sulks when rebuked (8.460), her ‘wicked’ father has ‘crossed [her] high hopes.’ So Athene as both beloved daughter and surrogate son walks an ambiguous line. Blundell purports that Athene ‘transverses and transcends the boundary between masculine and feminine roles’ in many of her activities. Doherty states that ‘Athene is consistently portrayed as male-identified, most notably in her role as patron of warriors.’31 To Blundell, however, this role as ‘koutrophos to young men evokes a feminine quality: the role of helper, depicted in 18.204 when Athene arrays Achilles for battle, in its subordinate status is often associated with the female. The tradition of Athene’s birth which relates that she emerged from Zeus’ head dressed 24  Doherty 1995: 7. 25  ibid. 26  Blundell 1995 :26. 27  Kerényi 1975: 53. 28  Clay, J.S., The wrath of Athene: Gods and men in the Odyssey (Lanham/London: Rowman & Littlefield Publishers, Inc., 1997), 206. A form of this epithet is used by Odysseus at an intimate moment between mortal and goddess in the Odyssey. 29  Shearer 1996): 15. According to Shearer ‘she alone knows where he keeps the key to his thunder-bolt storeroom and has permission to use that mighty arsenal. 30  ibid. 31  Doherty 1995: 174n.39. Clay (1997, 181) notes that Zeus and Athene share the same sphere of influence among mortals, like Zeus Athene supports kings and warriors

164


in battle gear sits comfortably with Homeric Athene, the archetypal warrior goddess. Shearer, however, describes an lesser known aspect of this tale, that when Athene had ‘gained some distance from her father Zeus’s head …she was able to take off her armour’, ‘the fundamental elements returned to their accustomed ways’32, her feminine nature restored. Is Athene then truly ‘just for the male’? Returning to the non-Homeric tradition of Athene’s conception we should consider the fact that Metis remained inside Zeus becoming his attribute, as proclaimed by the epithet Metieta, meaning ‘counsellor’ or deviser33. Metis is cleverness personified, ‘Zeus the deviser’ uses metis to consolidate his rule over the gods. Athene in turn absorbs this attribute, one which seems to associate her with her father, but could also be construed rather as coming from her mother. That Homer was aware of this tradition is proclaimed by Athene herself… ‘I among the gods am renowned for my metis and wiles’ ( Odyssey 13.299).34 Metis as ‘the personification of prudence’35 transfers this attribute to her daughter; a quality Athene reflects in her first act of the Iliad ( 1.197-222.), in her prudent intervention she appeals to Achilles’ reason. A link between mother and daughter remains; Metis is held captive in Zeus’ belly acting as advisor , while Athene who is ‘ostensibly free and powerful’, ‘is likewise an advisor to males’36, and controlled ultimately by the patriarch Zeus. Athene, half of the divine coterie that emerged on andocentric Olympus, is a female whose primary ties are to males. She is the favourite child of Zeus who owes her loyalty to her father. Homer, in suppressing the role of Metis, claims Athene for Zeus alone, demonstrating that it is the father as sole parent and the dominant male who defines her character. The special relationship between father and daughter, a vivid contrast to the conflict between patriarch and his consort, is explained by the circumstances of her birth; one which has ‘secured the stability of Zeus’ political regime, and has at the same time validated patriarchal control within the family.’37 ‘Yes, with all my heart I am my father’s child’ (Eumenides 739); Homer has succeeded in establishing a patriarchally determined characterisation of Athene, which like those the other Olympian goddesses, would define her persona throughout antiquity; portrayals reflective of broader gender concerns. However in the tender relationship between father and daughter; in the obtuse references to Athene’s association not just with the male, but also the female, her mother Metis, Homer portrays Athene as an androgynous ‘tom-boy’ daughter rather than the son he never had; doted on by her father, her virginal status negating her feminine threat to the patriarch’s hegemony. 32  33  34  35  36  37

Shearer 1996: 5. ibid Clay 1997: 199. Bell, R.E., Women of classical mythology: a biographical dictionary (New York: Oxford: OUP), 306. Doherty 1995: 8. Blundell 1995 : 28.

165


COMPUTER SCIENCE PANEL

JUDGING PANEL Dr. Fred Cummins (University College Dublin) – Chair Dr. John Mc Kenna (Dublin City University) Dr. Mikael Fernstrom (University of Limerick) Dr. Rem Collier (University College Dublin) Judges’ Commentary The paper selected as the winner of the Computer Science category is Beetlz – BON Software Model Consistency Checker for Eclipse by Eva Darulova. This paper describes an integrated approach to the specification and design of Java software artefacts based on a combination of the Business Object Notation (BON) and the Java Modeling Language (JML). The core work carried out is the identification of a mapping between BON and JML that allows BON specifications to be automatically translated into JML, which is then used to generate template Java code. Building on this, the report describes how tool support based on the core work has been developed and integrated in to one of the most popular Java development environments, Eclipse. In summary, the report describes an outstanding piece of computer science work. It is very well written, easy to read, and has good coverage and depth of all the relevant background topics. The core contributions of the work combine both theoretical and practical components that have been leveraged to deliver a high quality finished product that has been made available for use.

166


C om pu t e r s c i e nc e

Beetlz – BON software model consistency checker for Eclipse Eva Darulová

D

Abstract evelopment of a software project usually involves, to some extent, both modelling and specification languages. Although both are useful in their own right, using them together in an interconnected way brings many benefits to all stages of the development process. Work can proceed on both the model and the implementation concurrently. However, this approach requires tool support that keeps the two versions consistent and updates them when necessary. This report discusses the theoretical and practical considerations of a such a combination between the Business Object Notation and Java, together with the Java Modelling Language. It defines and discusses relations between individual concepts and presents their implementation in the automatic consistency checking Eclipse IDE plugin and tool `Beetlz’. Introduction Why another Eclipse tool? The term Software Engineering covers a broad field in computer science that in general can be described as researching ways to produce high-quality, cost-efficient and reliable software [1]. As software itself varies greatly across applications, so do the techniques that are used to produce it. Software engineering, although the name may imply so, is no exact science and building a faultless software product is, by its very nature, very difficult [2]. Examples of popular approaches to building reliable and manageable software are object-oriented programming (OO) [3] and formal methods 167


[4]. While the first is widely known through programming languages like C++ [5], Java [6, 7] and Eiffel [8], formal methods remain fairly unknown with the every-day programmer. Its main goal is to write a formal specification of a software system, that is a precise and complete description of exactly what the software is supposed to do and that can potentially also be used for verification purposes. To achieve the needed amount of precision a mathematical notation is needed, however, the extent of formalism depends on the program, its purpose and also user preference: a more rigorous example is the Z notation [9] (which can be somewhat off-putting for the mathematical layman), but it is also possible to only partially specify a system or to use an easier, if less powerful, notation. In whatever form though, once applied they can highlight problems at an early stage, reduce testing cost and increase overall quality. Closely tied to OO-programming is an approach called ‘Design by Contract’ (DbC) [10]: here a ‘contract’ is set up between a client (a class) that uses the services of a supplier (a function of another class). This specifies the minimum requirements the client has to obey before it can call a function (the precondition) and the minimum that the supplier guarantees to return (the postcondition). Additionally, each class specifies an invariant, which is basically a set of predicates that have to hold true at any point in time. By this contract, the responsibilities are clearly assigned and each piece of software concentrates on fulfilling its designated role. Another technique currently applied throughout software development is software modeling. It is used to describe the product at an abstract level, which depicts the program in a more understandable way than for example source code, so that all team members can be incorporated in discussions, from the designer, over the project manager to the actual programmer and tester. Examples include the popular modeling language UML (Unified Modeling Language) [11, 12] and the less known Business Object Notation (BON) [13, 14]. Both are object-oriented and thus easily applied together with common implementation languages. The result is that ideally, one has a high-level model to facilitate communication between the team members and a formal specification that ensures correctness of the product. In reality however, since updating the model to depict the ever-changing software requires effort, it is mostly abandoned and the resultant benefits are lost. If one provides tool support that helps in this updating process, the model is more likely maintained for the duration of the software development. This project joins the modeling language BON with the implementation language Java to show how such a combination is realizable. The mappings, or mathematically speaking the relations, are implemented by the ‘Beetlz’ tool, which takes input from BON and Java/JML and provides feedback on where the two artefacts are inconsistent. A brief introduction to BON and JML is given in chapter 2, on which chapter 3 then builds to describe the theoretical relations between BON on one side and Java and JML on the other. The report closes with conclusions and future work. For space reasons, this report is only an abbreviated version of the original. For the full report and the implemented tool please see [15, 16]. 168


i. Background i.i BON – a modelling method BON [14] is a modeling method for the design and analysis of object oriented programs and puts into practice the core of the object-oriented paradigm. BON specifies both a notation and a process for OO systems. Here, only the static textual notation will be investigated but details about other possible descriptions and about the process can be found in the book ”Seamless Object-Oriented Software Architecture” [14]. Only the formal description is considered in this case, since it most closely corresponds to implemented source code. The basic element of the formal description is the class. Since BON is strongly typed, a class also represents a type. Each class consists of a name, a header, generic parameters, an inheritance clause, a set of features (corresponding to Java methods and fields) and an invariant (see the example below). effective class KEEPER indexing about: “A type of personnel that looks after the animals.” inherit PERSONNEL feature name: STRING redefined getID: VALUE feedAnimal: BOOLEAN -> an: ANIMAL ensure delta animalsToLookAfter; an.hungry = false; end feature{PERSONNEL} animalsToLookAfter: SET[ANIMAL] invariant animalsToLookAfter.count < 50; end The name is unique in the system and therefore serves as an unambiguous identifier. Class headers convey more detailed information about the intended use of the class [14]: • deferred: Class not (fully) implemented. • effective: Class is implementing a deferred class or reimplementing an interface. • root: A process can start from here. There must be exactly one root class in each system [17]. 169


• interfaced: All features are visible to all classes. • reused: Class is reused from a library. • persistent: Class instances are potentially persistent. • generic: Class is parametrized. The allowed types can be further con strained. Any class providing some sort of functionality also has a set of features. A feature closely corresponds to a Java method or a Java field, as BON does not have a special concept for attributes. Each feature has a class-unique name, return and parameter types, a renaming clause and a pre- and postcondition. BON categorises features into queries and commands. While the first returns some value, but does not change the state of an ob ject, the latter does not return anything but may change ob ject state. This distinction is important as only queries, which are side-effect free, may be used in assertions. Note that BON does not allow ‘hybrids’ – features which both return a value and change state. Design by Contract is realised in BON by means of assertions. These constitute class invariants and feature’s preconditions and postconditions and are introduced by the keywords invariant, require and ensure respectively. They are also all inherited by child classes and must be obeyed. When redefining features, the covariant rule applies: preconditions may only be weakend, and postconditions strenghtened as prescribed by DbC [10]. In addition, the covariant rule applies to feature signatures as well to ensure correct and sensible use of inheritance relations. Assertions are written in first-order predicate logic which is extended by keywords that are necessary to allow for reasoning about features. Inheritance, marked by the keyword inherit can be simple (one parent class), multiple (several parent classes) or repeated (multiple inheritance from the same class). The first two types are common to most OO languages, while repeated inheritance is less common since it poses some challenges due to possible naming conflicts. BON also provides a way of grouping classes into clusters, which are similar to Java packages. However, since they ultimately can be expanded into class relations only, we will confine ourselves to looking at fully expanded static charts with classes only.

i.ii JML – a specification language Assuming the reader is fairly familiar with Java, we continue directly with its formal specification language JML [18]. It is particularly accessible, even for beginners, since it mostly uses an extended Java syntax and can be included directly in Java source code in the form of annotation comments. An example of such annotations is given below. /** * A type of personnel that looks after the animals. */ 170


public /*@ nullable_by_default @*/ class Keeper implements Personnel { private /*@ spec_protected @*/ Set < Animal > animalsToLookAfter; private /*@ spec_public @*/ String name; //@ invariant animalsToLookAfter.size() < 50; //@ constraint name == \old(name); @Override public /*@ pure @*/ int getID(){ ... }

}

//@ assignable animalsToLookAfter; //@ ensures an.hungry == false; public boolean feedAnimal(Animal an){ ...

//@ assignable \nothing; public List < String > getAnimalNames() { ... } /** * A Mop is an `integral’ part of each Keeper. */ public static class Mop{ } JML describes the interface and the behaviour of Java elements and is thus a behavioural specification language. It defines the interface and the behaviour of a Java module, such as a class or a method, by extending the interface with a specification. The interface is the standard Java declaration (in the case of a method this is the method declaration) and the behaviour is specified by an annotation comment. Each class can be annotated with one or more invariants, history constraints (constraints for short), and initially clauses [19]. Each method or constructor can be annotated with specification cases. Cases are inherited from super types and each case describes the behaviour that must be satisfied by the method or constructor and can be given in light-weight or heavy-weight form. The former only supplies the minimal information needed or wanted, whereas the latter is intended to supply a full specification. For all specifications and assertions it holds that only side-effect free or pure expressions may be used. Individual annotations and predicates are written in standard Java syntax extended by some keywords and operators necessary for proper reasoning. Sometimes the variables and methods declared in a Java 171


public interface do not provide enough detail and flexibility to describe a class’ behaviour. For this case, JML provides the model identifier, which lets the user define additional fields, methods, constructors and even types that can be used as part of the specification, but not part of the Java API [19]. Ghost fields are similar to model fields in that they abstract values for specifications, but they differ in that their values are not given by existent Java fields but by explicit initializations.

ii. Relations Now that the most important concepts have been defined, we turn our attention to a detailed confrontation and try to define relations for the individual elements. It should be kept in mind that Java is a possible refinement of BON. It is not the refinement though, since the two languages were developed independently. Therefore, it is in general the case that the BON model gives the minimum of information whereas the Java implementation will be much more expressive. It follows that information may be added when converting a BON model into Java source code and information may be lost when going in the other direction. This is in general not a deficiency, merely an abstraction, and so all places in the following chapter where an element is said to be ignored should be read with this in mind. Additionally, not all elements are relevant: the implementation will naturally have elements that serve merely as helpers, for instance, private fields or accessor methods. These elements are implementation details and are excluded from comparison. Therefore, defining a relation between BON and Java consists of identifying the relevant elements and relating those individually. Both BON and Java are object-oriented programming languages where the basic unit is the class and functionality is provided by features. Since OO programming is inherently modular, the comparison can be made in such a fashion as well. Thus, a comparison will be structured on a per-class and per-feature basis. This chapter defines the relations between BON, Java and JML, first for Java only elements and then relating BON’s assertion language to JML. Given two projects, the first task is to find which classes correspond to each other. Since class names are unique, they can be used for such an identification. Given the different naming conventions, DANGEROUS_ANIMAL

zoo.animal.DangerousAnimal

can be regarded as equal. All classes in BON are public, therefore Java’s class visibility is regarded as an implementation detail. It should also be noted that private classes are not part of the public API and thus are not included in comparisons.

ii.i BON – Java relations This section examines the interesting parts of relations without assertion elements. 172


ii.i.i Class modifier A BON model only declares simple classes, whereas Java distinguishes between classes, interfaces and enumerated types. The last two can be thought of as ordinary classes with certain restrictions: An interface only contains implicitly public constant fields and public static abstract methods and can thus be viewed as a type of an abstract class. One possibility for relating an interface is to restrict the BON class to be deferred and to have only deferred and public features. However, this mapping breaks down when model fields and methods or ghost fields are introduced. These are not abstract and should also appear as not abstract in the corresponding BON class. Enumerated types contain constants of the their own type and these are implicitly distinct. Hence an equivalent class in BON must define a set (which has distinct elements by definition) of constants of its own type. However, this is still not enough to uniquely identify an enumerated type, as normal methods are allowed in Java enum classes as well. For instance, a naming convention for this set, such as enumeration, can be agreed upon to solve the problem. ii.i.ii Inheritance Since BON does not make a distinction between a class and an interface, inheritance and interfacing is resolved by the same mechanism. Hence, a BON super class may correspond to a super class or an interface in Java. As far as typing is concerned, these are equivalent. Multiple inheritance, which is not allowed as such in Java, is solved in a similar fashion. All super classes of a class must be compared against all interfaces combined with the super class. Issues will arise, if the class inherits from multiple abstract classes. This clearly causes a compile error in Java. As described in subsection 3.1.1, an interface can relate to an entirely abstract class, in which case the conflict is solved. If it is not entirely abstract, there is no easy solution and an inconsistency remains. One way, although restrictive, is to limit the number of inherited abstract classes in BON to one. Since this can lead to changeable and unclear constraints, it may be more desirable to accept these discrepancies as inherent to the differences in the two languages. By providing multiple inheritance and feature renaming, BON also allows repeated inheritance. This may introduce potential naming conflicts and requires a procedure on how to resolve them. In BON, this has to be done manually by renaming features ([17], section 2.1) and hence requires careful thought on the designer’s side. Java, by disallowing multiple inheritance, does not support this type of inheritance directly. However, it can be realised by implementing an interface repeatedly. It is different in that this approach does not introduce any naming clashes [20]. Interfaces only provide a method declaration, so that each implementing class will only have one copy of each repeatedly defined method and no conflict arises. Any repeated inheritance with non-interfaces ultimately violates Java’s syntax so no relation is possible. 173


ii.i.iii Generics Types in both languages are allowed to have generic parameters. The principles used are equivalent ([3], chapter 10): a class may have an arbitrary number of generic parameters, each of which can be restricted. In BON this restriction is limited to one type, whereas in Java one may list an arbitrary number of types, all of which, interfaces included, are listed with the keyword extends. The order of Java constraints is arbitrary, hence it is sufficient for a relation if the one BON constraint is also present in the Java list. As with inheritance, types that are not in the model, and are thus assumed implementation details, should be ignored. The situation can also be solved by combining multiple Java types into one by, for example, declaring one interface that extends all interfaces listed. ii.i.iv Feature names Just as class names have to be matched during a comparison of a BON project and a Java implementation, so do feature names. Unlike for classes, BON does not have explicit naming conventions, although feature names are generally written in lowercase [14]. This is consistent with Java, where the convention also is to have the first letter lowercase. The difficulty arises when overloaded methods are used, a mechanism not allowed in BON. However disallowing overloading outright is likely to cause a lot of unnecessary errors. A middle course solution is to adopt the convention that overloaded features in BON are distinguished by numbering: feed, feed1, feed2 ... , and mapping them to Java methods on the basis of their count and type of formal parameters. Since Java is a refinement of BON, any additional methods in Java are ignored. However, all BON features are part of the relation, so for a tool this means that additional BON features have to be flagged as errors when checking the relation. As part of encapsulation and information hiding, a common procedure in Java is to declare variables private and provide access with getter and setter methods. The BON model describes and specifies the state of an object and will thus work with exactly those private features. Hence, a mechanism for distinguishing mere accessor methods from fields for specification is needed. One possibility is to ignore all methods with prefixes get- , has- , set- and is-. In reality, this proves to be too eager as it hides methods with proper functionality. If however, there exist two variables by the same name, except for one having one of the above prefixes, one can assume that it is an accessor. In that case, the accessor method is ignored and the corresponding field is used for the relation instead. Private variables are commonly used for JML specifications instead of accessor methods by changing their specification visibility with the keywords spec_public and spec_protected. For the purpose of relating two features, this visibility is considered instead of the pure Java modifier. If JML annotations are disallowed, then the visibility modifier of the Java fields must be chosen appropriately to allow them to be used in specifications. 174


ii.i.v Feature signature The feature signature consists in both languages of an optional return type and a number of optional formal parameters. Due to the possibility of overloaded methods the order of Java formal parameters is important. In BON this does not play a role, so a relation on the signature thus checks that the return types and the number of formal parameters match and that all types in the model are present in the implementation, in whichever order. It is assumed, that all input has compiled correctly, so it need not be checked whether the types actually exist and/or are valid. Types are matched primarily based on name, except for types that are being declared in the model itself, which automatically have a mapping from class relations. Furthermore, basic types and some commonly used classes and interfaces from Java’s standard library should be recognised. Since Java 1.5 a covariant redefinition of return types is allowed [7], but BON applies the covariant rule for the whole signature. For parameter types thus a conflict arises that can only be solved if we require that BON uses redefinition for return types only. ii.ii BON – JML relations A BON model can be used with a standard Java implementation, that is without use of JML, by simply ignoring assertions and all model and ghost annotated elements. It will then describe the program structure and relate directly to the Java API, but it will not provide feedback on specifications and thus correctness or reliability. JML is tailored to Java just as the BON assertion language is tailored to BON, so that it is possible to relate the two in a similar fashion as in section 3.1. Before looking into the relations, let us re-examine the way method specifications are written in JML. Heavy-weight specifications provide a full description of a method’s behaviour, but often one does not want to bring out the `big guns’, but merely provide some formalism, in which case light-weight specifications are particularly useful. In context of relations to BON’s assertion language, the latter are particularly interesting, since they follow similar notation syntax. For the sake of simplicity and in light that light-weight specifications can be written as heavyweight ones and vice-versa [21], the following relations always assume light-weight JML annotations. As described in section 2.1, model fields and methods and ghost fields are, for specification purposes, equal to fields and methods declared in Java. Since BON provides a specification only, it is natural to map all model and ghost elements to their BON counterparts, just like any Java declarations. ii.ii.i Nullity Every programmer who has encountered a NullPointerException in Java will acknowledge that it can make a big difference as to whether a reference can be null or not. As suggested by [22], JML now declares all reference types non_null by default. This stands in contrast with BON, where the opposite is the case. One 175


could choose to ignore this fact, however it may have an effect on how other assertions are written. non_null can be expressed in BON with an assertion in the invariant (in the case of queries) or in the pre- and postcondition (in the case of return and parameter types). In the other direction, if one chooses to follow BON’s default, one can easily do so by annotating the Java class by nullable_by_default. However, it is not possible to declare a BON class non_null by default.

ii.ii.ii Queries and Commands Since both BON and JML follow the Design by Contract theory, both allow only side-effect free expressions in assertions. BON ensures the correctness by strictly dividing its features into queries and commands and disallowing hybrids. In Java, no such assumptions are made by default and one has to specifically annotate a method as pure, for it to be allowed in assertions. Note that Java fields are allowed by default as well. Hence they can be thought of as implicitly pure. Therefore, there is a relation between a BON query and a Java pure method or a field. On the other hand, Java allows methods that both modify the state of an object and have a return value, so that a BON command is not entirely equivalent to a non-pure method. In the opposite direction, a Java `hybrid’ method translates into a command that changes state and a query that provides the return value. ii.ii.iii Frame condition Although the frame condition in BON is part of the postcondition and in Java it is a separate clause, they both serve the same purpose and list locations that may be modified during execution. A difference exists in the default values. BON’s delta clause, when missing, translates to modifying nothing for a query and modifying everything for a command. This stands in contrast to Java’s default of assignable \everything. Therefore, a consistency checking tool should not only check explicit frame condition clauses, but also generate an error whereever the default values do not match. Conclusion Consistency checks are commonly being performed during software development as they help to identify potential bugs and errors. Most common examples are compilers and typechechers for specific languages but work has also been done on tools that check the consistency between different software models [23], different representations within the same notation [24] or fully integrated industrial solutions [12]. This report extends this work in that it presents a relation between the model language BON and the concrete implementation language Java. The successful partial implementation shows that an automatic consistency checking tool is indeed feasible and meaningful even if the relation is not always perfect. Surprisingly, mappings between BON and pure Java constructs prove to be more complex and more involved than relations on the assertion languages. It may also seem to the casual reader that those relations have many deficiencies, but in fact the details 176


they are concerned with mostly turn out to be exotic and rather rare special cases. The Beetlz tool has been designed to be tolerant in those instances so that it can be readily used in software development. The basic relations between BON and Java have been implemented in the application named `Beetlz’ [16]. It provides automatic tool support for consistency checking between a BON model and a Java implementation and is available in a command-line version as well as an Eclipse plugin for all major platforms, see a screenshot in Figure 1. Potential future work on the tool itself includes support for additional JML elements, like multiple specification cases, heavy-weight specifications or further operators as well as some extensions to BON that would close some of the current gaps with respect to Java and JML. On the theoretical side, formalising the relations between BON and Java, which at present only exist in structured English, would make a very interesting but also quite involved future project. This project outlines how it is possible and advantageous to use a model of the software together with its implementation in a seamless manner. With added tool support, as the Beetlz tool demonstrates, updating of the model and/or the implementation is taken care of mostly automatically and thus can serve two purposes: • Encourage the use of software models in software engineering. • Reduce faults and misunderstandings resulting from poor communication with customers and/or team members.

Fig. 1.

177


References 1. I. Sommerville, Software Engineering. Pearson Education, 8 ed., 2007. 2. F. P. Brooks, “No Silver Bullet: Essence and Accidents of Software Engineering,” Computer, vol. 20, pp. 10–19, April 1987. 3. B. Meyer, Object-Oriented Software Construction. Prentice Hall PTR, 2nd ed., March 2000. 4. J. F. Monin and M. G. Hinchey, Understanding formal methods. Springer, 2003. 5. B. Stroustrup, The C++ Programming Language. Addison-Wesley Longman, 2000. 6. C. Horstmann, Java Concepts, 5th edition. Wiley, 2007. 7. J. Gosling, B. Joy, G. Steele, and G. Bracha, Java(TM) Language Specification, The (3rd Edition). Addison-Wesley Professional, July 2005. 8. B. Meyer, Eiffel, The Language. Prentice Hall, 1991. 9. J. Spivey, The Z Notation: a reference manual, 2001. 10. B. Meyer, “Applying design by contract,” IEEE Computer, vol. 25, pp. 40–51, 1 992. 11. M. Fowler, UML distilled: A Brief Guide to the Standard Object Modeling Language. Addison-Wesley, 3 ed., 2004. 12. Rhapsody: UML Model-Driven Development, Lynuxworks™, http://www.lynux works.com/partners/show_product.php?ID=248. 13. R. F. Paige and J. S. Ostroff, “A Comparison of the Business Object Notation and the Unified Modeling Language,” Tech. Rep. 03, York University, 1999. 14. K. Walden and J.-M. Nerson, Seamless Object-Oriented Software Architecture. Prentice Hall, 1995. 15. Beetlz, full report: http://kind.ucd.ie/documents/proposals/reports/ darulova09.pdf. 16. E. Darulova, F. Fairmichael, and J. Kiniry, Beetlz homepage, http://secure.ucd.ie/ products/opensource/beetlz/. 17. R. F. Paige and J. S. Ostroff, “Precise and Formal Metamodeling with the Business Object Notation and PVS,” Tech. Rep. 03, York University, 2000. 18. G. T. Leavens, A. L. Baker, and C. Ruby, “Preliminary design of JML: a behavioral interface specification language for java,” SIGSOFT Softw. Eng. Notes, vol. 31, pp. 1– 38, May 2006. 19. G. T. Leavens, E. Poll, C. Clifton, Y. Cheon, C. Ruby, D. Cok, P. Müller, J. Kiniry, P. Chalin, and D. M. Zimmerman, JML Reference Manual DRAFT, Revision: 1.231, 2008. 20. T. A. Gardner, Inheritance relationships for disciplined software construction. Springer, 2001. 21. A. D. Raghavan and G. T. Leavens, “Desugaring JML Method Specifications,” Tech. Rep. 00-03e, Computer Science Iowa State University, 2005. 22. P. Chalin and F. Rioux, “Non-null references by default in the Java modeling language,” in SAVCBS ’05: Proceedings of the 2005 conference on Specification 178


and verification of component-based systems, (New York, NY, USA), p. 9, ACM, 2005. 23. R. F. Paige, P. J. Brooke, and J. S. Ostroff, “Metamodel-based model conformance and multiview consistency checking,” ACM Trans. Softw. Eng. Methodol., vol. 16, no. 3, 2007. 24. B. Litvak, B. Litvak, S. Tyszberowicz, S. Tyszberowicz, A. Yehudai, and A. Yehudai, “Behavioral consistency validation of UML diagrams,” in Software Engineering and Formal Methods, 2003.Proceedings. First International Conference on, pp. 118–125, 2003.

179


DENTAL SCIENCES PANEL

Judging Panel Prof. Donald Burden (Queen’s University Belfast) – Chair Prof. Finbarr Allen (University College Cork) Prof. Helen Whelton (University College Cork) Prof. Brian O’Connell (Trinity College Dublin) Judges’ commentary Enlargement of the gums (gingival hyperplasia or gingival overgrowth) is a common side effect of the immunosuppressant drugs organ transplant patients are required to take to prevent transplant rejection. This is a significant clinical problem which has both aesthetic (psychosocial) and dental health implications for organ transplant patients. Gingival overgrowth may extend to interfere with mastication, occlusion and speech. Marked inflammatory changes can lead to the onset of gum disease (periodontal disease) and the potential loss of teeth. The Dental Sciences panel concluded that this paper represented an excellent evaluation of our current understanding of this complex area. The applicant conducted a comprehensive review of the relevant literature and followed this with a detailed and scholarly interpretation of how our understanding of this area has evolved with the advent of new studies. The paper provides an up-to-date analysis of the scientific studies which underpin our knowledge of the pathogenesis, risk factors and management of drug induced gingival overgrowth. The Dental Sciences panel agreed that the quality of this paper is such that it would merit consideration for publication in an appropriate peer reviewed professional journal.

180


De n ta l S c i e nc e s

The aetiology & management of gingival hyperplasia in organ transplant patients Emer Walshe

M

Introduction edication induced gingival enlargement is the most widespread side-effect of systemic medication on the periodontal tissues.1 Drugs associated with gingival overgrowth (GO) are broadly categorised according to their therapeutic actions, namely anticonvulsants, immunosuppressants and calcium channel blockers.2 Post-organ transplantation patients are medicated with immunosuppressants, most commonly with cyclosporin-A, (CsA), due to its selective action on the immune system. They may also be medicated with a calcium antagonist to attenuate CsA-induced nephrotoxicity.3 In 1983, Rateitschak-Pluss first reported cases of CsA-induced GO in humans,4 which was followed by reports of enlargement due to dihydropyridines in 1984.5 Gingival overgrowth associated with CsA is clinically indistinguishable from that elicited by the antiepileptic drug, phenytoin.6 The GO usually begins as a papillary enlargement and is more pronounced in the anterior segments and labial surfaces of the teeth.7,8,9 Gingival overgrowth is usually confined to the attached gingiva but may extend coronally and interfere with mastication, occlusion and speech.2,10 There are marked inflammatory changes demonstratable by bleeding upon probing.10 The onset of gingival overgrowth occurs within one to three months after initiation of CsA therapy.8,11 181


Definition of Terms Many terms have been used in the literature to describe clinically apparent enlargement of the papillary and marginal gingivae. It has been suggested that GO is a more general term that better describes the lack of understanding of the pathogenesis of the condition.12 Gingival hyperplasia is “an abnormal increase in the number of normal cells in a normal arrangement in an organ or tissue, which increases in volume.”13 According to Seymour and Jacobs,10 “the disagreement with regard to fibroblast numbers indicates that cyclosporin-induced gingival enlargement may not be a true hyperplasia” and suggests that gingival overgrowth is more appropriate terminology. Literature Review Cyclosporin A is a successful immunosuppressant drug, derived from the fungal species, Trichoderma polysporum and Cylindrocarpon, widely used in the prevention of organ transplant rejection.6 The drug has become the immunosuppressant of choice since 1978.14 Cyclosporin-A preferentially suppresses cell-mediated immune reactions.15 The drug binds to cyclophilin, a cytoplasmic protein that is important in T-cell responses to cytokines.16,17 This complex inhibits calcineurin, resulting in a decrease in interleukin-2, which is the stimulus for increasing the number of T-lymphocytes.17 It has been suggested that the intracellular concentrations of these proteins may be related to the sensitivity of T-lymphocytes.18 The prevalence of significant gingival changes in dentate patients medicated with CsA alone is approximately 25% to 30%.6,19,20,21 Calcium Channel Blockers Organ transplant patients may be additionally medicated with calcium channel blockers (CCBs) to attenuate CsA-induced nephrotoxicity3 and for their hypertensive action.22,23 Calcium channel blockers may be classified chemically as dihydropyridines, (nifedipine, isradipine, amlodipine,) phenylalkylamine derivatives (verapamil) and benzothiazepine derivatives (diltiazem).12 Calcium channel blockers can independently induce GO, although with a lower prevalence than CsA.24,25,26 Nifedipine is the most widely used CCB and has been implicated as a cause of GO.5,24,27,28,29,30,31,32,33 The severity of nifedipine-induced GO is associated with poor oral hygiene, pre-existing gingival inflammation and combination drug therapy.32 Gingival overgrowth has been reported in 15% to 83% of patients medicated with nifedipine.24,29,30,34 Gingival overgrowth may be associated with higher doses of nifedipine,29 although several studies failed to show any relationship between dose or plasma levels of nifedipine and GO.24,35,36,37,38,39 Risk Factors for DIGO Age and Demographic Variables Age has been shown to be an important risk factor for CsA-induced GO, with 182


adolescents being more susceptible to GO. 40,41 Age is not applicable for CCBs as the drugs are mainly used in middle-aged patients. It is suggested that fibroblast sensitivity to the drug may be influenced by a hormonal component.1 Increased circulating androgens and androgen metabolism in adolescents was found to be a major factor in causing CsA-and nifedipine-induced GO.42 This may stimulate a selected sub-population of fibroblasts to increase collagen synthesis, or decrease collagenase activity.42

Gender Studies relating to CsA and nifedipine suggest males are at greater risk than females for GO and the severity of changes is greater in males than females.22,39,43 Drug Variables There still remains controversy regarding the relationship between drug variables and the expression of drug-induced gingival overgrowth (DIGO). There is agreement that a baseline or threshold concentration of the drug is required to induce gingival changes although this may vary between individuals.40,44 Some studies have shown that GO is related to high doses of CsA,45,46,47while others show that drug dosage is a poor predictor of the gingival changes.22,24,36,40,48,49 A positive correlation has been reported between CsA blood concentration and the prevalence of GO.41,50 Correlations were also found between drug plasma concentration and the severity of GO,11,50 although studies failed to show a correlation between serum trough concentrations of CsA and the severity of GO.23,40 A positive correlation has been found between the salivary concentration of CsA and the extent of GO.20,40,41 Nimmi et al51 reported that dental plaque might act as a reservoir for CsA, which is released by stimulated salivary flow. Whole salivary concentrations of CsA are higher in patients taking the liquid form of the drug compared to the capsule form.52 Studies have revealed that patients exhibiting significant gingival changes had sequestration of both nifedipine and amlodipine.35,53 Thomason et al54 found that despite high levels of nifedipine sequestered in the gingival crevicular fliud (GCF), only the plasma concentration of nifedipine was identified as a risk factor for the severity of gingival changes. Duration of Therapy Although several studies have found no correlation between GO and the duration of therapy,21,23,55 Thomason et al54 found a positive correlation between therapy duration and severity of GO. Concomitant Medication It has been suggested that combined therapy may increase the prevalence but not the severity of GO.56 Combination therapies of nifedipine and CsA can produce more GO than if either drug was used independently.22,36,39,57,58 183


Periodontal Variables Plaque High plaque scores and gingival inflammation exacerbate the expression of DIGO.44 A positive correlation was found between GO scores and plaque scores,20,36,37,40 although others suggest gingival changes were unrelated to plaque scores.11,24,49 Most of the evidence that correlates the presence of bacterial plaque and GO is derived from cross-sectional studies, which is a significant limitation. It remains unclear whether plaque is a contributory factor or a consequence of the gingival changes,9,44 although the most recent classification system for periodontal diseases acknowledges plaque as a cofactor in the aetiology of drug-associated gingival enlargement.59 Oral Hygiene Oral hygiene has been identified as a risk factor for development and expression of GO.21,22,23,39,43,50 Longitudinal studies have supported the suggestion that the severity of DIGO increases in the presence of poor plaque control and gingival inflammation.8,20,60,61 Plaque control and removal of gingival irritants is beneficial for gingival health but does not inhibit the development of CsA-induced GO.49 Stone et al62 found that improved oral hygiene alone would not prevent GO but may reduce the severity of CsA-induced GO. Gingival Inflammation A positive correlation was found between GO and gingival inflammation.36 There remains a lack of agreement whether or not gingival inflammation is a determinant of CsA-induced GO. Varga et al63 revealed that patients with hyperplastic gingivitis prior to transplant surgery were highly likely to develop severe gingival changes post surgery. These findings may indicate susceptibility of the gingival tissues or fibroblasts to both plaque-induced inflammatory changes and CsA.44 Pathogenesis of DIGO The pathogenesis of DIGO is uncertain. It is likely to be associated with direct and indirect effects of CsA on fibroblasts and the extracellular components of the lamina propria, as well as targeted activation of growth factors.12 Disturbances in collagen metabolism of gingival tissue rather than an increase in the number of fibroblasts has been considered to be a possible mechanism in the pathogenesis of DIGO.64 Brown et al65 proposed that there is an interaction between the drug and gingival inflammation secondary to bacterial irritation and secondly, that the drug alters the cascade of biochemical events, resulting in increased gingival connective tissue production. Genetic factors are important in GO expression as they determine the heterogeneity of the gingival fibroblast. Functional heterogeneity exists among phenotypically stable fibroblasts in response to various stimuli.66 CsA and its major metab184


olite M-1767 could react with a phenotypically distinct subpopulation of gingival fibroblasts, stimulating deoxyribonucleic acid (DNA) synthesis and proliferation of gingival fibroblasts.67,68,69 This is maintained in the presence of lipopolysaccharide (LPS), which normally inhibits these cells, suggesting a role for plaque in the pathogenesis of GO.70,71 The increase in cell number coupled with a reduction in the breakdown of gingival connective tissue6 has been speculated to cause excessive extracellular matrix accumulation in CsA associated GO.72 Barclay et al24 suggested that CsA and CCBs reduce cytosolic free calcium in recruited T-lymphocytes and gingival fibroblasts. This impairs T-cell proliferation and collagenase synthesis.24 Studies of human lymphocyte antigen (HLA) have found that patients expressing HLA-DR173 and HLA-B379 have a protective role against CsA-induced GO, while patients expressing HLA-DR2 had an increased risk of developing GO.73

Connective Tissue Homeostasis Gingival fibroblasts control collagen production by synthesis and release of metalloproteinases and tissue inhibitor of metalloproteinases.74 An impairment of collagenase synthesis will result in poor collagenolysis, which may contribute towards GO.36 In vitro studies have shown that CsA causes a significant increase in collagen synthesis.75 Inflammatory Cytokines and Growth Factors Studies show that human gingival fibroblasts treated with drugs associated with GO increase production of fibroblast cytokines and prostaglandin E2 (PGE2).76,77 CsA does not induce PGE2 formation in gingival fibroblasts, but potentiates the response to tumor necrosis factor (TNF) alpha.77 Increased gingival levels of platelet-derived growth factor (PDGF) may be responsible for fibroblast proliferation and production of extracellular matrix constituents in GO.74 Hassell and Hefti6 proposed a scenario of cytokine-mediated regulation of fibroblast growth and protein synthesis. CsA inhibits production of interferon (INF)-g, leading to an imbalance of homeostasis in favour of enhanced proliferation and collagen production. Cytokine dependent alterations in extracellular matrix metabolism appear important to GO and could result in abnormal differentiation of cells, resulting in accumulation of fibroblasts with a range of proliferative phenotypes.78 Management of DIGO Dental management should be initiated pre-transplantation in conjunction with the patient’s medical team. A thorough clinical and radiographic examination should be carried out during early medical planning.79 Assessment of the oral hard and soft tissues, diet analysis and professional cleaning of teeth should follow transplant surgery.79 Treatment of DIGO is achieved by rigorous oral hygiene, debridement and surgical excision in cases where aesthetics, function or speech is compromised.80 185


Substitution/Withdrawal Drug withdrawal or substitution is an obvious solution in the management of DIGO. New-generation immunosuppressants (tacrolimus and mycophenolate mofetil) are alternative medications to the traditionally used CsA.2,81 Tacrolimus (FK506) has been shown to have potential as an alternative to CsA.82,83 Some authors reported that the drug is not associated with GO,81,82 while others suggested that the prevalence and severity of GO is less with tacrolimus compared with CsA.84,85 A reduction of GO has been reported following a change in therapeutics from CsA to tacrolimus,86,87,88 and changing to a same class CCB.89 Azithromycin, an antimicrobial agent has been shown to improve CsA-induced GO 90,91,92 and does not modify CsA levels.92 It has been suggested that the drug blocks CsA-induced cell proliferation and collagen synthesis.64 A review of clinical trials on systemic use of azithromycin suggests complete regression or amelioration of CsA-induced GO is possible.93 If a transplant patient is medically stable and the side effects of CsA are controlled, the medical team may be reluctant to alter the therapeutic regimen.94 Â Non-Surgical Management Selection of a treatment modality depends on the severity of the DIGO. Elimination of local factors, plaque control and regular periodontal maintenance therapy may ameliorate but not prevent DIGO in a susceptible patient.95 Plaque control should always be a first-line measure in treatment.79,96 There is evidence that good oral hygiene and plaque removal decreases the degree of GO and improves periodontal health.10,97 The use of chlorhexidine digluconate mouthwash (0.1% w/v) has reportedly reduced the incidence of DIGO recurrence following gingival surgery.98 Surgical Management Surgical treatment is only advocated where GO is severe.99 Gingival overgrowth may be assessed using the method described by Seymour et al. (1985).100 The index measures the degree of GO in a labio-lingual and apico-coronal direction. Surgical interventions have been suggested with GO index scores in excess of 30%.101 If drug therapy is likely to be continued for life, psychosocial considerations must be given in an effort to reduce the frequency and extent of surgical intervention.2 Factors to be considered when deciding on appropriate treatment include the extent of the surgical area, the presence of periodontitis, the presence of osseous defects combined with gingival enlargement lesions and the position of the bases of the pocket in relation to the existing mucogingival junction.96 The classic external bevel gingivectomy is a viable treatment option in small areas, (up to six teeth), with no evidence of attachment loss.96 The periodontal flap is indicated in situations with larger areas of GO, or areas where attachment loss combined with osseous defects are present.96 It was found that no difference exists between flap surgery and conventional gingivectomy with respect to recurrence of GO.102 Provided clinical guidelines based on research are adhered to, evidence 186


supports the biological compatibility of electrosurgery to excise papillary enlargement.103 The carbon dioxide (CO2) laser has been advocated in surgical management of DIGO due to decreased surgical time and rapid postoperative haemostasis.2 A comparison split-mouth crossover study conducted by Mavrogiannis et al (2006) revealed less recurrence of DIGO within a 6-month period with laser excision than conventional gingivectomy.102 The use of CO2 lasers in combination with conventional gingivectomy has been advocated for dual DIGO.104 Â Treatment Outcomes and Recurrence Rate Reduction in pocket depth achieved by flap surgery may be sustained for longer periods than by gingivectomy technique.105 The combination of CsA and dihydropyridine medication as well as gingival inflammation were found to be significant risk factors for an increase or recurrence of GO following periodontal treatment.56

Discussion and Conclusion The concominant use of calcium channel blockers with CsA has been demonstrated to increase the prevalence and severity of CsA-induced GO 39,84 although these drugs can independently induce GO.24,25,26 The clinical presentation of DIGO is the likely result of a complex interaction between growth factors and cytokines.14 It has been demonstrated that the pathogenesis of DIGO has a multifactorial aetiology.106 Studies suggest that the incidence and severity of DIGO in patients treated with CsA depends on plaque control, the level of gingival inflammation and extent of periodontal destruction, the dosage and duration of therapy, plasma and tissue concentrations of the drug, as well as age of the patient and perhaps the underlying medical condition.10 Drug variables, plaque-induced inflammatory changes in the gingivae and genetic factors appear to be most significant in the expression of GO. Multidisciplinary treatment plays a pivotal role in the overall management of organ transplant patients. The provision of a preventive periodontal programme before initiation of drug therapies implicated in DIGO has been recommended.79,107 This seeks to achieve effective control of local inflammatory factors (plaque and calculus) and may minimise the severity of gingival changes. The role of oral hygiene as a risk factor for expression of GO has been identified.21,22,23,39,43,50 Dental care guidelines recommend frequent patient recall and prophylaxis, as well as daily antibacterial mouthrinses.107 Meticulous oral hygiene,61 chlorhexidine digluconate mouthrinse98 and professional cleaning24 can be significant in reducing the rate and degree of GO recurrence.95 Withdrawal or substitution of the offending medication has proven successful.86,87,88 Tacrolimus is a new-generation immuno-suppressant, which is not associated with GO.81,82 Although tacrolimus has potential as an alternative immunosuppressant, many of the studies81,82,85 lack long-term follow-up results and have relatively small sample sizes. Azithromycin is a safe, cost-effective and long-lasting treatment for GO, without needing any change in the dose and monitoring of CsA 187


and avoiding the need for repeated surgical procedures.92 Surgical intervention may be considered after non-surgical measures fail to reduce the GO to an aesthetically acceptable appearance. Although gingivectomy still remains a viable treatment option, the resulting wound may be painful and require considerable post-operative precautions to prevent infection,2 which may lead operators to consider alternatives including a total or partial internal bevel gingivectomy.2 It is projected that the use of medications with the potential to cause GO will increase in the future.2,44 Early assessment of patients and initiation of preventive programmes will be important in the life-long management of organ transplant patients.

188


References

1. Seymour, R. A. (2006). Effects of medications on the periodontal tissues in health and disease. Periodontology 2000. 40: 120-129. 2. Marshall, R. I., Bartold, P. M. (1999). A clinical review of drug-induced gingival overgrowths. Australian Dental Journal. 44: (4) : 219-232. 3. Feehally, J., Walls, J., Mistry, N., Horsburgh, T., Taylor, J., Vietch, P.S. & Bell, P. R. F. (1987). Does nifedipine ameliorate cyclosporine A nephrotoxicity? British Medical Journal. 295: 310. 4. Rateitschak-Pluss, E. M., Hefti, A., Lortscher, R. & Thiel, G. (1983). Initial observation that cyclosporine-A induces gingival enlargement in man. J Clin Periodontol. 10: 237-246. 5. Lederman, D., Lumerman, H., Reuben, S., Freedman, P. (1984). Gingival hyperplasia associated with nifedipine therapy. Oral Surg Oral Med Oral Pathol. 57: 620-622. 6. Hassell, T. M., Hefti, A.F. (1991). Drug Induced Gingival Overgrowth: Old Problem, New Problem. Crit Rev Oral Biol Med. 2: 103-137. 7. Daley, T. D., Wysocki, G. P., Day, C. (1984). Cyclosporin therapy. Its significance to the periodontist. J Periodontol. 55: 708-712. 8. Tyldesley, W. R. & Rotter, E. (1984). Gingival hyperplasia induced by cyclosporin-A. Br Dent J. 157: 305-309. 9. Thomason, J. M., Kelly, P. J., Seymour, R. A. (1996). The distribution of gingival overgrowth in organ transplant patients. J Clin Periodontol. 23: 367-371. 10. Seymour, R. A. & Jacobs, D. J. (1992). Cyclosporine and the gingival tissues. J Clin. Periodontol. 19: 1-11. 11. Seymour, R. A., Smith, D. G. & Rogers, S. R. (1987). The comparative effect of azathioprine and cyclosporin on some gingival health parameters of renal transplant patients. J Clin. Periodontol. 14: 610-613. 12. Marshall, R. I., Bartold, P. M. (1998). Medication induced gingival overgrowth. Oral Diseases. 4: 130-151. 13. McCullough K ed. (1982). Dorland’s Pocket Medical Dictionary. W.B. Saunders Company: Sydney. 14. Calne, R. Y., Thiru, S., McMaster, P., Craddock, G. N., White, D. J. G., Evans, D. B., Dunn, D. C., Pentlow, B. D. & Rolles, K. (1978). Cyclosporin-A in patients receiving renal allografts from cadaver donors. Lancet. 1: 1323-1327. 15. Boltchi, F. E., Rees, T. D. & Iacopino, A. M. (1999). Cyclosporine A-induced gingival overgrowth: a comprehensive review. Quintessence International. 30: 775-783. 16. Trevor,A.J.,Katzung,B.G.,Masters,S.B.(2005).Katzung&Trevor’sPharmacology: Examination & Board Review. Lange. 7th Ed. 474-483. 17. Howland, R. D., & Mycek, M. J. (2006) Immunosuppressive Drugs. Lippincott’s Illusrated Reviews: Pharmacology. Lippincott, Williams & Wilkins. 3rd Ed. 485 494. 18. Hess, A., Colombani, P. (1987). Mechanism of action of cyclosporine: a unifying 189


hypothesis. Adv Exp Med Biol. 213: 309-330. 19. Wysocki, G. P., Gretzinger, H. A., Laupacis A., Ulan, R.A. & Stiller, C.R. (1983). Fibrous hyperplasia of the gingival: a side effect of cyclosporine A therapy. Oral Surgery Oral Medecine Oral Pathology. 55: 274-278. 20. McGaw, T., Lam, S. & Coates, J. (1987). Cyclosporine-induced gingival over growth: correlation with dental plaque scores, gingivitis scores, and cyclosporine levels in serum and saliva. Oral Surgery Oral Medecine Oral Pathology 64: 293 297. 21. Pernu, H.E., Pernu, L. M. H., Huttunen, K. R. H., Nieminen, P. A. & Knuuttila, M. L. E. (1992). Gingival overgrowth among renal transplant recipients related to imunosuppresive medication and possible local background factors. J Periodontol. 63: 548-553. 22. Thomason, J. M., Seymour, R. A., Ellis J. S., Kelly, P. J., Parry, G., Dark, J., Wilkinson, R., Ilde, J.R. (1996). Determinants of gingival overgrowth severity in organ transplant patients. An examination of the role of HLA phenotype. J Clin Periodontol. 23: 628-634. 23. King, G. N., Fulinfaw, R., Higgins, T. J, Walker, R. G., Francis, D. M. A. & Wiesenfeld, D.(1993).Gingivalhyperplasiainrenalallograftrecipientsreceivingcyclosporin-A and calcium antagonists. J Clin Periodontol. 20: 286-293. 24. Barclay S., Thomason J. M., Idle J. R., Seymour, R. A. (1992). The incidence and severity of nifedipine-induced gingival overgrowth. J Clin Periodontol. 19: 440 441. 25. Miller, C. S. & Damm, D. D. (1992) Incidence of verapamil-induced gingival hyperplasia in a dental population. J Periodontol. 63:453-456. 26. Jorgensen, M. G. (1997). Prevalence of Amlodipine-Related Gingival Hyperplasia. J Periodontal. 68: 676-678. 27. Van der Wall E. E., Tuinzing, D. B., Hes, J. (1985). Gingival hyperplasia induced by nifedipine,an arterial vasodilating drug. Oral Surg Oral Med Oral Pathol. 60: 38-40. 28. Lucas, R. M., Howell, L. P., Wall, B. A. (1985). Nifedipine-induced gingival hyperplasia: a histochemical and ultrastructural study. J Periodontol. 56: 211 215. 29. Barak S, Engelberg IS, Hiss Z. (1987). Gingival hyperplasia caused by nifedipine: histopathological findings. J Periodontol. 58: 639-642. 30. Slavin, J., & Taylor, J. (1987). Cyclosporin, nifedipine, and gingival hyperplasia [Letter] Lancet. 2: 739. 31. Hancock,R.H.,Swan,R.H.(1992).Nifedipine-inducedgingivalovergrowth:report of a case treated by controlling plaque. J Clin Periodontol. 19: 12-14. 32. James, J.A., Linden, G.J. (1992). Nifedpine-induced gingival hyperplasia. Dental Update. 19: 440-441. 33. Harel-Raviv, M., Eckler, M., Lalani, K., Raviv, E., Gornitsky, M. (1995). Nifedipine-induced gingival hyperplasia. A comprehensive review and analysis. Oral Surg Oral Med Oral Pathol Oral Radiol Endod. 79; 715-722. 190


34. 35. 36. 37. 38. 39. 40. 41. 42. 43. 44. 45. 46. 47. 48. 49. 50.

Fattore, L., Stablein, M., Bredfelt, G., Semla, T., Moran, M., Doherty-Greenburg, J. (1991). Gingival hyperplasia: A side effect of nifedipine and diltiazem. Spec Care Dent 11: 107-109. Ellis, J. S., Seymour, R. A., Monkman, S. C., Idle, J. R. (1993). Disposition of nifedipine in plasma and gingival crevicular fluid in relation to drug-induced gingival overgrowth. J Periodontal Res. 28: 373-378. Thomason, J. M., Seymour, R. A. & Rice, N. (1993). The prevalence and severity of cyclosporine and nifedipine-induced gingival overgrowth. J Clin Periodontol. 20: 37-40. Bullon, P., Machuca, G., Martinez-Sahuquillo, A., Rios, J. V., Rojas, J. Lacalle, J.R. (1994). Clinical assessment of gingival hyperplasia in patients treated with nifedipine. J Clin Periodontol. 21: 256-259. Nery, E. B., Edson, R. G., Lee, K. K., Pruthi, V. K. & Watson, J. (1995). Prevalence of nifedipine-induced gingival hyperplasia. J Periodontol. 66: 572-578. Thomason, J. M., Seymour, R. A., Ellis, J. S., Kelly, P. J., Parry, G., Dark, J., Idle, J. R. (1995). Iatrogenic gingival overgrowth in cardiac transplantation. J Periodontol. 66: 742-746. Daley, T. D., Wysocki, G. P., Day, C. (1986). Clinical and Pharmacologic correlations in cyclosporine-induced gingival hyperplasia. Oral Surg. Oral Med. Oral Pathol. 62: 417-421. Hefti, A. F., Eshenaur, A. E., Hassell, T. M., Stone, C. (1994). Gingival overgrowth in cyclosporine A treated multiple sclerosis patients. J Periodontol. 65: 744-749. Sooriyamoorthy, M., Gower, D. B., Eley, B. M. (1990). Androgen metabolism in gingival hyperplasia induced by nifedipine and cyclosporin. J Periodont Res. 25: 25-30. Ellis, J. S., Seymour, R. A., Steele, J. G., Robertson, P., Butler, T. J., Thomason, J. M. (1999). Prevalence of gingival overgrowth induced by calcium channel blockers: a community based study. J Periodontol. 70: 63-67. Seymour, R. A., Ellis, J. S., Thomaon, J. M. (2000). Risk factors for drug-induced gingival overgrowth. J Clin Periodontol. 27: 217-223. Adams, D., Davies, G. Gingiva hyperplasia associated with cyclosporine-A. (1984). Br Dent J. 157: 89-90 Rostock, M. H., Fry, H. R. & Turner, J. E. (1986). Severe gingival overgrowth associated with cyclosporine therapy. J Periodontol. 57: 294-299. Thomason, J. M., Seymour, R. A., Ellis, J. S. (2005). Risk factors for gingival over growth in patients medicated with ciclosporin in the absence of calcium channel blockers. J Clin Periodontol. 32: 273-279. Seymour, R. A., & Heasman, P. A. (1988). Drugs and the periodontium. J Clin Periodontol. 15: 1-16. Seymour, R. A., & Smith, D. G. (1991). The effect of a plaque control programme on the incidence and severity of cyclosporin-induced gingival changes. J Clin Periodontol. 18: 107-110. Somacarrera, M. L., Hernandez, G., Acero, J., & Moskow, B. S. (1994). Factors 191


relatedtotheincidenceandseverityofcyclosporine-inducedgingivalovergrowth in transplant patients. A longitudinal study. J Periodontol. 65: 671-675. 51. Nimmi, A., Tohnai, I., Kaneda, T., Takouchi, M. & Nagura, H. (1990). Immuno histochemical analysis of effects of cyclosporine A on gingival epithelium. Journal of Oral Pathology and Medecine 19: 397-403. 52. Modeer, T., Wondimu, B., Larsson, E et al (1992c). Levels of cyclosporine-A in saliva in children after oral administration of the drug in mixture or in capsule form. Scand J Dent Res 100: 366-370 53. Seymour, R. A., Ellis, J. S., Thomason, J. M., Monkman, S. & Idle, J. R. (1994). Amlodipine-induced gingival overgrowth. J Clin Periodontol. 21: 281-283. 54. Thomason, J. M., Ellis, J. S., Kelly, P. J. & Seymour, R. A. (1997) Nifedipine pharmacological variables as risk factors for gingival overgrowth in organ transplant patients. Clinical Oral Investigations. 1: 35-39. 55. Thomas, D. W., Baboolal, K., Subramian, N. & Newcombe, R. G. (2001). Cyclosporin A-induced gingival overgrowth is unrelated to allograft function in renal transplant recipients. J Clin Periodontol. 28: 706-709. 56. Pernu, H. E., Pernu, L. M., Knuttila, M. L. (1993). Effect of periodontal treatment on gingival overgrowth among cyclosporine A-treated renal transplant recipients. J Periodontol. 64: 1098-1100. 57. O’Valle, F., Mesa, F., Aneiros, J., Gomez-Morales, M., Moreno, E., Navarro, N., Cabellero, T., Masseroli, M., Garcia del Moral, R. (1995). Gingival overgrowth induced by nifedipine and cyclosporine A. Clinical and morphometric study with image analysis. J Clin Periodontol. 22: 591-597. 58. Margiotta, V., Pizzo, I., Pizzo, G., Barbaro, A. (1996). Cyclosporin- and nifedipine induced gingival overgrowth in renal transplant patients: correlations with periodontal and pharmacological parameters, and HLA-antigens. J Oral Pathol Med. 25; 128-134. 59. Armitage, G. C. (1999). Development of a classification system for periodontal diseases and condition. Ann Periodontol. 4: 1-6. 60. Addy, V., McElney, J. C., Eyre, D. G., Campbell, D. & D’Arcy, P. F. (1983). Risk factors in phenytoin-induced gingival hyperplasia. J Periodontol. 54: 373-377. 61. Nishikawa, S., Tada, H., Hamasaki, A., Kasahara, S., Kido, J., Nagata, T., Ishida, H., Wakano, Y. (1991). Nifedipine-induced gingival hyperplasia: a clinical and in vitro study. J Periodontol. 62: 30-35. 62. Stone, C., Eshenaur, A., & Hassell, T. (1989). Gingival enlargement in cyclosporine treated multiple sclerosis patients. J. Dent. Res. 68: (Abstr.), 285. 63. Varga, E., Lennon, M. A. & Mair, L. H. (1998). Pre-transplant gingival hyperplasia predicts severe cyclosporine-induced gingival overgrowth in renal transplant patients. J Clin Periodontol. 25: 225-230. 64. Kim, J. Y., Park, S. H., Cho, K. S. (2008). Mechanism of azithromycin treatment on gingival overgrowth. J Dent Res. 87 (11): 1075-1079. 65. Brown, R. S., beaver, W. T., bottomley, W. K. (1991). On the mechanism of drug induced gingival hyperplasia. J Oral Pathol Med. 20: 201-209. 192


66. 67. 68. 69. 70. 71. 72. 73. 74. 75. 76. 77. 78. 79. 80. 81. 82.

Hassell, T. M., & Stanek, E. J. (1983). Evidence that healthy human gingival contains functionally heterogenous fibroblast subpopulations. Archs Oral Biol. 28; 617-625. Mariotti, A., Hassell, T., Jacobs, D., Manning, C. J. & Hefti, A. F. (1998). Cyclosporin A and hydroxycyclosporine (M-17) affect the secretory phenotype of human gingival fibroblasts. Journal of Oral Pathology and Medecine. 27: 260-261. Hassell,T.M.,Buchanan,J.,Cuchens,M.,Douglas,R.(1988).Fluorescenceactivated vital cell sorting of human fibroblast subpopulations that bind cyclosporine A. J Dent Res. 67: 273. Jacobs, D., Buchanan, J., Cuchens, M., Hassel, T. M. (1990). The effect of cyclosporine metabolite OL-17 on gingival fibroblast subpopulations. J Dent Res. 69: 221. Bartold., P. (1989). Regulation of human gingival fibroblast growth and synthetic activity by cyclosporine-A in vitro. J Periodont Res. 24: 314-321. Barber, M. T., Savage, N. W., Seymour, G. J. (1992). The effect of cyclosporine and lipopolysaccharide on fibroblasts: Implications for cyclosporine-induced gingival overgrowth. J Periodontol. 63: 397-404. Mariotti,A.(2005)ClinicalPeriodontology&ImplantDentistry.Lang,N.,Thorkild, K. 5th Ed. Blackwell Munksguard. 405-420. Pernu, E. H., Knuuttila, M. L. E., Huttenen, K. R. H., Tiilikainen, A. S. K. (1994). DruginducedgingivalovergrowthandclassIImajorhistocompatibilityantigens. Transplantation. 57: 1811-1813.. Hallmon, W. W., Rossmann, J. A. (1999). The role of drugs in the pathogenesis of gingival overgrowth. A collective review of current concepts. Periodontol 2000; 21: 176-196. Schincaglia, G. P., Fornit, F., Cavallini, R., Piva, R., Calura, G., Del Senno, L. (1992). Cyclosporine A increases type 1 pro-colloagen production and mRNA level in human gingival fibroblasts in vitro. J Oral Pathol Med. 21; 181-185. Modeer, T., Anduren, I., Lerner, U. H. (1992a). Enhanced prostaglandin biosyn thesisinhumangingivalfibroblastsisolatedfrompatientstreatedwithphenytoin. J Oral Pathol Med. 21: 251-255 Wondimu, B., Modeer, T. (1997). Cyclosporine A upregulates prostaglandin E2 production in human gingival fibroblasts challenged with tumour necrosis factor alpha in vitro. J Oral Pathol Med. 26: 11-16. Trackman, P. C., Kantarci, A. (2004). Connective tissue metabolism and gingival overgrowth. Crit Rev Oral Bio Med. 15: 165-175. MacCarthy, D., Claffey, N. (1991). Fibrous hyperplasia of the gingival in organ transplant patients. J Ir Dent Assoc. 37 (1) :3-5. Claffey, N. (2003) Plaque induced Gingival Diseases. Clinical Periodontology and Implant Dentistry. Jan Lindhe. 4th Ed. Chapter 7: 203-4. Greenberg, K. V., Armitage, G. C., & Shiboski, C. H. (2008). Gingival enlargement among renal transplant recipients in the era of new-generation immunosuppressants. J Periodontol. 79 (3): 453-460. James, J. A., Jamal, S., Hull, P. S., Macfarlane, T. V., Campbell, B. A., Johnson, R. 193


W. G., Short, C. G. (2001). Tacrolimus is not associated with gingival overgrowth in renal transplant patients. J Clin Periodontol. 28 : 848-852. 83. Knoll, G. A. & Bell, R. C. (1999). Tacrolimus versus cyclosporine for immunosuppresioninrenaltransplantation:meta-analysisofrandomisedtrials. Br Med J. 318: 1104-1107. 84. Ellis, J. S., Seymour. R. A., Taylor, J. J., Thomason, J. M. (2004). Prevalence of gingival overgrowth in transplant patients immunosuppressed with tacrolimus. J Clin Periodontol. 31: 126-131. 85. De Oliveira Costa, F., Diniz Ferreira, S., de Miranda Cota, L. O., da Costa J. E., Aguiar, M. A. (2006). Prevalence, severity, and risk variables associated with gingival overgrowth in renal transplant subjects treated under tacrolimus or cyclosporine regimens. J Periodontol. 77: 969-975. 86. Bader, G., Lejeune, S. & Messner, M. Reduction of cyclosporine-induced gingival overgrowth following a change to tacrolimus. (1998). A case history involving a liver transplant patient. J Periodontol. 69: 729-732. 87. Hernandez, G., Arriba, L., Lucas, M. & de Andres, A. (2000). Reduction of severe gingival overgrowth in a kidney transplant patient by replacing cyclosporine A with tacrolimus. J Periodontal. 71: 1630-1636. 88. Hernandez, G., Arriba, L., Frias, M.C., de la Macorra, J. C., de Vicente, J. C., Jimenez, C., de Andres, A. & Moreno, E. (2003). Conversion from cyclosporine A to tacrolimus as a non-surgical alternative to reduce gingival enlargement: a preliminary case series. J Periodontol. 74: 1816-1823. 89. Westbrook, P., Bednarczyk, E. M., Carlson, M., Sheehan, H. & Bissada, N. F. (1997). Regression of nifedipine-induced gingival hyperplasia following switch to a same class calcium channel blocker, isradipine. J Periodontol. 68: 645-650. 90. Wahlstrom, E., Zamora, J. U., Teichman, S. (1995). Improvement in cyclosporine associated gingival hyperplasia with azithromycin therapy. N Engl J Med. 332: 753-754. 91. Puig, J. M., Lloveras, J., Bosch, J. M., Munne, A., Mir, F., Barbosa, F. & Masramon, J. (1997). Treatment of gingival hyperplasia secondary to cyclosporine by the new macrolide azithromycin. Transplantation Proceedings. 29: 2379-2380. 92. Gomez, E., Sanchez-Nunez, M., Sanchez, J. E. (1997). Treatment of cyclosporin induced gingival hyperplasia with azithromycin. Nephrol Dial Transplant 12: 2694-2697. 93. Strachan, D., Burton, I., & Pearson, G. J. (2003). Is oral azithromycin effective for thetreatmentofcyclosporine-inducedgingivalhyperplasiaincardiactransplant recipients. Journal of Clinical Pharmacy and Therapeutics. 28: 329-338. 94. Khocht, A., Schneider, L. C. (1997). Periodontal management of gingival over growth in the heart transplant patient: A case report. J Periodontol. 68: 1140 1146. 95. AcademyReport,InformationPaperoftheAmericanAcademyofPeriodontology. Drug-Associated Gingival Enlargement. (2004). J Periodontol. 75: 1424-1431. 96. Camargo, P. M., Melnick, P. R., Pirith, F. Q. M., Lagos, R. & Takei, H. H. (2001). 194


Treatment of drug-induced gingival enlargement: aesthetic and functional considerations. Periodontology 2000. 27: 131-138. 97. Dongari, A., McDonnell, H. T., Langlais, R. P. (1993). Drug-induced gingival overgrowth. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. Endod. 76: 543-548. 98. O’Neill, T. C. A., Figures, K. H. (1982). The effects of chlorhexidine and mechanical methods of plaque control on the recurrence of gingival hyperplasia in young patients taking phenytoin. Br Dent J. 152: 130-133. 99. Mavrogiannis, M., Ellis, J. S., Thomason, J. M., Seymour, R. A. (2006). The management of drug-induced gingival overgrowth. J Clin Periodontol. 33: 434 439. 100. Seymour, R. A., Smith, D. G. & Turnbull, D. N. (1985). The effects of phenytoin and sodium valproate on the periodontal health of adult epileptic patients. J Clin Periodontol. 12: 413-419. 101. Thomason, J. M. & Seymour, R. A. (1990) Phenytoin-induced gingival over growth in general medical practice. Journal of Dental Research 69: 969. 102. Mavrogiannis, M., Ellis, J. S., Seymour, R. A., Thomason, J. M. (2006). The efficacy of three different surgical techniques in the management of drug-induced gingival overgrowth. J Clin Periodontol. 33: 677-682 103. Krejci, R. F., Kalkwarf, K. L. & Krause-Hohenstein, U. (1987). Electrosurgery-a biological approach. J Clin Periodontol. 14: 557-563. 104. Darbar, U., Hopper, C., Speight, P. (1997). Combined treatment approach to gingival overgrowth due to drug therapy. J Clin Periodontol. 23: 941-944. 105. Pilloni, A., Camargo, P. M., Carere, M. & C. Jr. (1998). Surgical treatment of cyclosporine-A and nifedipine-induced gingival enlargement: Gingivectomy versus periodontal flap. J Periodontol. 69: 791-797. 106. Seymour, R. A., Thomason, J. M., Ellis, J. S. (1996). The pathogenesis of drug induced gingival overgrowth. J Clin Periodontol. 23: 165-175. 107. Guggenheimer, J., Eghtesad, M. D., & Stock, D. J. (2003). Dental Management of the (solid) organ transplant patient. Medical Management Update. Oral Surg Oral Med Oral Path. 95 (4): 383-389.

195


DRAMA, FILM & MUSIC PANEL

Judging Panel Prof. Brian Singleton (Trinity College Dublin) – Chair Dr. Harvey O’Brien (University College Dublin) Dr. Paul Murphy (Queen’s University Belfast) Prof. Fiona Palmer (NUI Maynooth) Dr. Marcus Zagorski (University College Cork) Dr. Niamh Doheny (Huston Film School) Judges’ commentary This is an outstanding piece of writing of publishable standard. The author demonstrates finely honed skills in summary and analysis pacing her movement between context and close reading with almost uncanny precision. Her succinctness in compressing complex theoretical ideas in a way which does them justice but does not prevent her from moving forward is truly remarkable. This is a piece of writing that is never less than clear about what is in question and what relevance it has to the furtherance of its argument. The student’s reading of Spanish cinema is nuanced and well informed, and yet, again, she does not feel the need to become overly focused on historical context to the exclusion of a rhetorical point of view. The sense of a grounded critical orientation is extremely strong – demonstrated in the confidence with which she is able to summarise existing critical frameworks and then smoothly segue into her own, clearly delineated, point of view on the subject without being drawn into mechanical contradiction of other authors. The close reading of the films themselves is equally strong, and you feel it comes after sufficient groundwork has been done to allow her freedom to explore the films as she does. There is a tendency to relegate key descriptive passages to footnotes, and this may be something to do with word lengths, but the material in these (long) footnotes is also very good. If it were to be published, I think an ‘upgrade’ of all of these would be required. There is no question that this work is of a very high standard, both in terms of quality film studies work and purely as writing. The confidence, clarity, and sense of pace is excellent, and the work demonstrates both scholarship and execution equivalent with good Masters-level writing and even professionally published articles.


Dr a m a , F i l m & M usic

Bigas Luna’s Retratos Ibericos & the gendered performance of Self Ciara Barrett

I

n 1976, one year after the death of Franco and a year into Spain’s transicion period into secular, socialist democracy,1 Spanish film auteur-to-be Bigas Luna directed his first feature film, Tatuaje. Between this and the first film of his acclaimed Retratos ibericos, or Iberian trilogy, Jamon jamon in 1992 (followed by Huevos de oro in 1993 and La teta y la luna in 1994), Bigas Luna directed six films: Bilbao (1978), Caniche (1979), Renacer (1981), Lola (1985), Angustia (1987) and Les edades de Lulu (1990). Luna carried his work into the nineties with a distinct interest in themes of desire, sexuality, perversion, and generational and/or familial conflict.2 His films may be seen as in keeping with a trend towards deliberate provocative-ness from within Spanish cinema after 1975, when Francoist film censorship collapsed. Repressed for so long under the former regime, the visible presence of sex onscreen served as a vehicle for historical self-analysis,3 a symbol for, and indicative of, socio-political and artistic freedom. Bigas Luna, many of whose films earned the Spanish equivalent of an “X” or “NC-17” rating, may be seen, actively participated in this culture of revolutionary politics sublimated into artistic production. The most offensive of Luna’s films fall into the popular genre of the destape film, newly-born to post-Franco Spanish cinema, devoted to representing all that is 1 Kowalsky, Daniel. “Rated S: Softcore pornography and the Spanish transition to democracy, 19771982.” Spanish popular cinema. Antonio Lazaro Reboll and Andrew Willis, Eds. Manchester: Manchester University Press, 2004. (188) 2 Evans, Peter William. Jamon jamon, Bigas Luna. Barcelona: Ediciones Paidos, 2004. (13) 3 Ibid. (21)

197


crude and offensive.4 Like the work of Vicente Aranda (including Amantes from 1991, which breaks the taboo of explicitly showing an erect penis onscreen) and the accumulating repertoire of Pedro Almodovar (whose Pepi, Luci, Bom, released 1980, is prototypical of the kind of bad taste and good humour characteristic of destape films thereafter), Bigas Luna’s work of the transicion period may be seen as an oppositional cinema after the fact of oppression, in discourse with its repressed past by virtue of its very rejection of repressive influence. However, it has been argued by Peter Evans that by the 1990s Bigas Luna significantly opened himself to a more critical and ironic stance towards historical, political discourses surrounding Spanish society’s relationship to that same repressed past.5 It could be seen as a stance-as-distance: in his Retratos ibericos, he treats symbols of, and relative to, the traditionalist Spanish past with a tendency towards parody and/or irony. Arguably these are methods of discursive engagement allowing for a certain degree of self-detachment and critical perspective on the part of the director, a potential site for internalised socio-political conflict subsequently channelled into artistic proliferation. Alternatively, we might see Bigas Luna’s role as director, especially in the Retratos ibericos, as less a site in/on/from which to glean evidence of social change and see resulting conflicts played out, but, and as Anne Marie Stock has argued, as that of a “collector”—and with that, organizer and regenerator—of social observations, “identifying, analyzing and revealing the operative mechanisms of cinema.”6 Through self-consciously cinematic techniques and modes of representation, Stock argues that Bigas Luna “strives not to meet expectations but [rather] to underline them as such.”7 I would argue, in this vein, that by the time of Jamon jamon, incontestably a highly sexual film, Luna has taken to using the omnipresence of sex-for-the-sakeof-sex in contemporary Spanish cinema as a self-conscious jumping-off point for maturely critiquing gender politics and their modes of filmic representation, in relation to filmic discourses of Spanish nationhood. As Marvin D’Lugo says, sexuality is the “narrative ‘bait’” Bigas Luna uses in order to entice his audience into reading (into) the text of his films.8 In that it has a reputation for meaninglessness, apart from its present standing-in for what was previously necessarily absent from censored Spanish cinema, sex becomes the most potentially significant symbol, or space/site open to symbolic inscription, in the highly—even to the point of overly— symbolic texts of the Retratos ibericos. Indeed, the three films of the Retratos, La teta y la luna, Jamon jamon, and Hue4 Ibid. (21) 5 Ibid. (21) He writes, “Bigas Luna es el historiador de un mundo arcaico que se enfrenta a los retos del futuro.” 6 Stock, Ann Marie. “Eyeing Our Collections: Selecting Images, Juxtaposing Fragments, and Exposing Conventions in the Films of Bigas Luna.” Modes of Representation in Spanish Cinema. Jenaro Talens and Santos Zunzunegui, Eds. Minnesota: University of Minnesota Press, 1998. (171-172) 7 Ibid. (184) 8 D’Lugo, Marvin. “La teta i la lluna: The Form of Transnational Cinema in Spain”. Refiguring Spain: Cinema/Media/Representation. Marsha Kidner, Ed. Durham: Duke University Press, 1997. (207)

198


vos de oro9, are completely shot through with sexual images and symbols, most all of which are unambiguously phallic. In this light, the Retratos ibericos might be seen as evidence of Bigas Luna’s regression back to an alignment with more fully traditional and phallocentric discourses of filmic representation, apparently insistent on the symbolic ordering of images with predetermined and clichéd meanings. However, it will be my argument here that Bigas Luna so insists on the overrepresentation of symbols of patriarchy (alluding to Spain’s oppressively ordered fascist past), so insists on inscribing—in fact, and as I will be showing, quite literally writing—meaning into these texts that he intends to reveal the abusive nature of subjecting things—objectifying things, especially people, and even more especially women, as the frequent objects/victims of patriarchal discourse and practice—to the symbolic construction of meaning. Bigas Luna’s aims to abuse his own power as filmmaker, as constructor of meaning, in order ultimately to expose that power of symbolic ordering as always already illegitimate, as inherently unstable, built on reproduction after reproduction of gendered images and symbols so overdetermined, by this point, in terms of “meaning” that they have lost all true symbolic and narrative value in and of themselves. We may then see the films of this trilogy, upon close analysis, as working ever-so subtly and subversively to play against type and underline the extensive symbolic ordering of their narratives; they subvert the hold of traditional, patriarchal modes of representation and methods of narrativization from the inside out. For one thing, the characters of Bigas Luna’s Iberian Trilogy are all strictly typed, or significantly functional, in that they are obviously coded for a to-be-looked-for (and assumedly accessible) meaning. In all three films, the end credits identify the actors not by the names of the characters they play, but by the types that their characters, before they are even given voice, are always already performing. Cases in point: in Teta, only the young protagonist of the narrative, Tete, is linked directly by his (fictional) name to his real self, the actor Biel Duran. All others are identified by part, as opposed to person: thus the character of Estrellita (Mathilda May) is reduced to her part as “La Gabacha”, her husband Maurice (Gerard Darmon) to “El Gabacho”, her teenage lover Miguel (Miguel Poveda) “El Charnego”, and Tete’s mother (Laura Mana) simply “La madre”. In Jamon jamon, Silvia (Penelope Cruz) is “la hija de puta”, her mother Carmen (Anna Galiena) “la puta madre”, Silvia’s boyfriend Jose Luis (Jordi Molla) “el ninato”, his mother Conchita (Stefania Sandrelli) “la madre puta”, her husband Manuel (Juan Diego) “el padre”, and macho-man Raul (Javier Bardem) comically “el chorizo”. Finally, in Huevos, as in Teta, the male main character is allowed to retain his name (Benito, also played by Javier Bardem), but his first love Rita (Lisa Tovati) becomes, appropriately, “El primer amor, 47 kilos”, his wife Marta (Maria de Madeiros) “La mujer, 45 kilos”, his first mistress Claudia 9 Though out of chronology, this ordering of the films reputedly follows Bigas Luna’s own suggestion that they be watched in order of “the temporal progression of the trilogy from the simple world in which love, rather than eroticism or an aggressive sexuality, dominates, to progressively more brutal stages of sexual desire and its manipulation” (D’Lugo, Marvin. Op. Cir. p. 203)

199


(Maribel Verdu) “La mujer, 52 kilos”, and his last mistress Ana (Raquel Bianca) “La comehombres, ? kilos”. The actors and the roles they play are therefore recognized not as personalities or even personifications in and of themselves, but as signifiers of a second level of signified meanings. Consequently, both the characters and their respective actors are robbed of a certain degree of subjectivity; they become narrative objects, whilst the narrative itself is bared as an artificial means towards the end of signification, or the “making of meaning” from purposely-structured story elements and events. Before going any further, I would make the distinction here between the terms “part” and “person” as they will be used in this discussion. A “part” is a role, inanimate, before it is characterized, and thereby made animate. It is given voice and action, and ultimately personality, by the person who plays the role or part, who individually embodies this manufactured person and gives it (virtual) life. There is, of course, the question as to whether a role might ever be fully embodied in such a way (whereby the unreal—or as yet unrealised—character effectively appropriates the actor’s real body; s/he takes on—or is taken over by—a wholly new personality, which is no longer mere character but person-made-flesh). It may be that an actor can only go so far as to personify a character, taking on certain personality traits subsequently subsumed into his/her own always already apparent body and identity; the effect of this characterization is not only transitory—that is, impermanent—but transient, slipping always between the actor’s “real” personality and “unreal” character as function of part, and repeatedly taking the audience into and out again from the performance as a whole. Thus there is always a slippage or instability between the three elements of performance—part, person, and character/personality—inherent to characterization. It is just one example of the deconstructive elements and potential of cinema with which Bigas Luna plays. By making such use of typecasting obvious and explicit in the Retratos ibericos, Luna makes us see and re-read each performance therein as an instance of type masquerading as character. He discovers, unearths and exhibits the different layers of meaning out from under his actors’ performances, and accordingly out from under all their characters’ performances.10 They are forced (if they are really a “they”—after all, “they” the characters are not “real” people) to bear the burden of type from within these narratives—types verbalised, made manifest in the end credits, and looked at for what they are: constructive acts of signification that collapse, upon closer inspection, under the weight of their doubled and overdetermined meanings. Each character is revealed to be not an unreal identity but rather a hyperreal entity, a performance of a performance without original derivation from a “real”, in postmodern terms. In such a way, Bigas Luna exposes the act of character impersonation, as seen and as literally written into the texts of his Retratos ibericos, as a vehicle for the 10 Bigas Luna is credited not only for the direction but also for writing the Retratos ibericos with frequent collaborator Cuca Canals; therefore it is safe to say Luna is responsible, at least in part, for the scripting of his characters and how they are referred to in his films.

200


construction of narrative meaning, and further as a narrative myth. Characters are presented, are called out directly in these texts as specific types; their personalities, and with them their actions, are, being functions of their labels, always already foregone conclusions, and beyond predetermined, they are overdetermined symbols, copies of other copies of persons. They are always already paradoxically overloaded and emptied of significant meaning. However, if this thesis is flipped around, so that we say not that the characters of the Retratos ibericos are masks for types, but that that these types, imposed at the end credits after the fact of their being narratively played out, are in fact masks for successfully individuated characters, we realise these films to be even more densely (over)layered with meaning than previously thought. Especially as in cinema studies discourses on “masquerade” performance tend to serve the purposes of feminist criticisms of gender representation,11 I would like here to examine the ways in which Bigas Luna allows certain subtle and individual characterizations of gender types throughout the Retratos ibericos to subvert the very metanarrative of character-typing—stereotyping, archetyping, etc.—with which he is often seen to work in collusion. Jamon jamon has in particular drawn critical attention for its presentation of generic familial melodrama populated by heavily stereotypical characters,12 and it is, perhaps, that film out of the trilogy that most obviously engages with gender performance as relative to, and as a function of, character-type. Jamon jamon, on one level, may be seen as overly insistent on the traditional stereotyping of its female characters; it seems bent on cornering its actresses into over-played, overly familiar parts and performances, whose narrative significance is always already apparent. The three main female parts are, as identified in the end credits, “la puta madre” Carmen, the good mother who prostitutes herself for the sake of her children, “la madre puta” Conchita, the castrating mother whose sexual voraciousness is the undoing of her own son and lover, and “la hija de puta” Silvia, the pregnant daughter of Carmen who is wronged by two lovers over the course of her own foray into sexual liberation. All three parts are maternal, versions of the mother figure cliché, and are thus evidence of the film’s being in dialogue with specifically Spanish modes of representation and socio-political and historically situated constructs of meaning. As Gamez Fuentes points out in his article on “Women in Spanish Cinema”, the figure of the mother in modern Spanish films must everywhere be seen, directly or indirectly, as in the context of—and as an intertext with—the clichéd symbol who is both significant of patriarchal (Francoist) power, and by virtue of her gendered difference from the male, threatening to his authority.13 Her problematic/problematizing presence “as a fictional space which 11 Hayward, Susan. Cinema Studies, The Key Concepts. Third Edition. London and New York: Routledge, 2006. (132-134) 12 Evans, Peter William. Op.Cit. (14) 13 “Women, as the necessarily submissive element of the patriarchal power balance, pose a constant threat—not just to male superiority and power position—but to male identity itself. The very essence of traditional formulations of male identity hinges on men’s ability to reproduce themselves

201


has articulated historical-political conflict,”14 as Fuentes writes, explains why in contemporary Spanish cinema, maternal figures are often elided from texts, overburdened as they are by historical, socio-political, and uniquely Spanish meaning. But nevertheless they are emphatically present in Jamon jamon, which might lead us to believe that the very idiosyncrasy of Bigas Luna’s including not one, but three mother figures in this very modern text must be significant of his defiance of Francoism’s lingering influence over modern modes of representation and codes of signification—in itself a meaningful act of rebellion against traditional nationalist practices, both past and of the movie’s present. I have offered, however, that Luna’s own take on the mother as a gendered symbol of patriarchal/sexist discourses of power is rather more deconstructive of that same archetype’s signifying potential than it is respectful of an actual “significance” or “meaning.” Instead of leaving his three main female characters to be reread into their prescribed, typed roles, Luna literally pre-scripts them as types long clichéd in Spanish cinema, thereby allowing them to deviate in subtle ways from their prototypical models (with type predetermined, they bear with them already a weight of meaning that has only the potential to be cast off, as opposed to established/developed narratively). For the most part, these female types perform their traditionally gendered roles in traditionally “feminine” ways, but self-consciously and attentively so that they are in full control of their own female masquerade, their gender performativity. The most self-aware performance in Jamon jamon is that of Carmen, the puta madre, whose job as a prostitute requires her to repeatedly play up her difference to/for men, to reaffirm her own subjugation to their desires while being in complete control of her own abilities to do so—and therefore in complete control of masculine desire as well. This is most clearly evinced in the scene where Silvia’s boyfriend Jose Luis comes to Carmen begging for sex with her: assenting somewhat reluctantly, Carmen then performs a partial striptease and seduction for Jose Luis, the enjoyment of which, on her behalf, is somewhat ambiguous. It appears that her own sexual satisfaction derives from having the power to grant her sexual favours and/or to take them away, but not in the act of sex itself (Jose Luis reaches for her breasts, but she will not let him touch them; she touches him intimately, then moves away before he can actually go any further).15 Most interestingly, Carin the light of those role models and reproduce the patriarchal order.” (Jordan, Barry and Rikki Morgan-Tamosunas. Contemporary Spanish Cinema. Manchester and New York: Manchester University Press, 1998. p. 144) 14 Fuentes, Gamez. “Women in Spanish Cinema: ‘Raiders of the Missing Mother’?” Cineaste. 29 no 1, Winter 2003. (38-43) 15 The touching of breasts—and perhaps more significantly, the prevention of doing so—is a recurring trope of the trilogy. In Teta, the main character Tete’s primary goal is to claim “a breast of his own”, and he spends most of his time in the narrative trying to steal glimpses of (and a suckle at) the dancer Estrellita’s. In contrast to Carmen’s powerful and meaningful withholding of her breasts from Jose Luis’s desirous grasp in Jamon jamon, her daughter Silvia freely gives hers to both Jose Luis and Raul, who fetishize them as symbolic of tortilla and ham. For these men, her breasts

202


men then proceeds to mimic the sounds of her pet parrot, whilst performing a parody of Spanish dance movements; effectively she is mimicking a mimic, the parrot, whilst compounding Jose Luis’s sexual desire in performance for him and simultaneously referencing an aspect of traditionalist Spanish culture. The result is that her idealised status as the puta madre, with all its connotations of the nurturing Virgin and of the sexually gratifying whore, is contaminated by her self-consciously drawing attention to the performativity of her own feminine actions on multiple levels—sexual, socio-political/historical, and cultural—and the fact of her type, as the puta madre, being similarly performative. Also compounding the allusion to performativity on all levels is Jose Luis’s (Jordi Molla) performance of his character’s reaction to Carmen’s “seduction”: as Chris Perriam notes, physically he is “prone and masturbating”,16 acting out in exaggerated form the physical response of any audience—especially of the cinema—to performance, and especially that of female masquerade. Its intended audience is symbolize of maternal domestication and nourishment—Silvia’s breasts are thus overdetermined symbols, made to bear a weight of meaning attributable to patriarchal ideology. However, this weighting of Silvia’s breasts-as-symbols is played for laughs, judging by the immediate gusto with which each of her lovers immediately goes for her bosom, and how emphatically they declare her to taste of this tortilla and ham; her breasts therefore parody symbolism, more than they are supposed to have meaningful connotation from within the text of Jamon jamon. Finally, in Huevos, much is made of Benito’s ever-shifting privilege to touch the breasts of his various lovers: his first love Rita appears not to care whether or not he does; Claudia, his future mistress, prohibits him from touching them—they are, until she is ultimately used up and betrayed as sex object and tool for Benito’s commercial ambition, hers alone to handle and elicit pleasure from; Marta, Benito’s wife, like Rita (to whom he also compares her closely in weight) does not find anything objectionable to being handled; and ultimately Ana, with whom Benito ends up in Miami, has fullest control over her body— she has no particular hang-ups about her breasts being touched, but she knows how to use them as offerings at key moments when she wants attention or to make a point (like Claudia’s in the Spanish business world, her breasts are a bargaining tool with Benito; however, unlike Claudia, she dares to renege on her promise that “only you will fondle my breasts”); Ana is invulnerable. As a final thought, as breasts are so highly prominent in the Retratos texts—not only are they constantly literally visible, but also, as I have noted, they carry considerable symbolic weight—it would be interesting to discover to which gendered sphere they might appropriately be considered as belonging. Both men and women in the Retratos ibericos consider the possession of breasts as meaningful, as if personal control over them reflects a degree of social influence and liberty. Indeed, more than the women in these films appear to have penis envy, or to have interest or anything to do with the many phallic symbols throughout the Retratos (ham and bulls in Jamon, material possessions and skyscrapers in Huevos, human pyramids and ewers—though, complicating matters, these ewers might also be seen as symbolic of breasts—in Teta) men are constantly fixated on using or acquiring them, similar to the way they are almost always fixated on women’s breasts. Ultimately, I think this must prove that women’s breasts ain these three films are yet one more example of over-abundant phallic symbols, despite the fact that they belong to women’s bodies—or perhaps because of it, they show phallocentric structures of meaning and symbolism attempting to control the female body outside its comfort zone of manly material objects. Thus when a woman in the Retratos ibericos keeps her breasts to herself for her own pleasure, she subverts patriarchal authority over her body as symbol, and in fact, she denies the male pleasure of objectifying her body. 16 Perriam, Chris. Stars and Masculinities in Spanish Cinema, From Banderas to Bardem. Oxford: Oxford University Press, 2003. (133)

203


impotent in terms of its potential affect on the object of its gaze, hence audience response to audiovisual cues is, at least metaphorically, masturbatory, that is, procured for personal pleasure and enacted mentally and physically only on the self— the actual physical manifestation of elicited pleasure in no way affects the source of that pleasure.17 I shall return to discuss the problematic nature of subject/object relationships of desire in greater depth as regards the reversion of the male gaze back on itself as effected via the Retratos ibericos of Bigas Luna. For now, however, it is enough to say that in acting out the role of the sexually indulgent puta madre, the actress Anna Galiena is able paradoxically to individuate and to characterise her typical role away from the stereotypical; her control of the sexual situation successfully undermines and destabilises the puta madre type as sexually submissive, as opposed to dominating and (potentially) powerful. The other two films of the trilogy, Huevos de oro and La teta y la luna, similarly, though less obviously than in the instance of Jamon jamon detailed above, play with the idea of feminine performativity or masquerade as a function of greater role play; “type” is ultimately betrayed as an inadequate signifier of personality and/or female subjectivity.. Huevos offers a greater number of female roles (four: Benito’s “primer amor”, “la mujer, 52 kilos”, “la mujer, 45 kilos”, and “la comehombres”) than Jamon’s three, but interestingly they are allowed less overall subjective autonomy in their individual narratives and characterization. Whereas Jamon jamon is very much the shared story of six main characters, and as Celestino Deleyto has effectively argued, split relatively equally between its three men and three women,18 Huevos is rather more Benito Gonzalez’s personal narrative, with the four “main” female characters taking a relatively marginalized role.19 However, it is interest17 Laura Mulvey in her essay “Visual Pleasure and the Narrative Cinema” would argue, however, that ‘the gaze’, usually identified with the male, is, on the other hand, sadistically voyeuristic—that we take pleasure in subjecting the female to cinematic objectification because our very watching her for meaning implicates her from the very start in narrative processing. We inflict, according to Mulvey, narrative evens onto the passive female image. 18 Deleyto, Celestino. “Motherland: Space Femininity, and Spanishness in Jamon jamon (Bigas Luna, 1992)”. Spanish Cinema: The Auteurist Tradition. Peter William Evans, Ed. Oxford, Oxford University press, 1999. (270) 19 This discrepancy between Jamon and Huevos could, however, be down to the fact that generically they are very different films. Though both, as I am trying to show, ultimately deconstruct their own patriarchal structures of meaning by overloading on symbolism and ‘meaningful’ performances of type, this does not preclude a brief analysis of their genre codification—for it is just one more construct of meaning with which these narratives are to play. Jamon jamon, as Peter Evans has been seen to argue, “{e}s un gran melodrama”, therefore it is only fitting generically that its narrative should incorporate many (stereotypical) characters into a domestic setting, ending in the same place, not very far off in time from whence it began. Huevos, on the other hand, fits more the pattern of an epic tragedy, set across stretches of space and time, charting the rise and fall of its male protagonist, whose hubristic personality and ambition ultimately implode into self-pity and –destructiveness. It is, above all, the narrativization of Benito’s inner struggle against himself, so it is only natural that his story be Benito-centric. Incidentally, Robert Lang has written about the links between gender and genre constructions as pertains to the codification of meaning in symbolic discourse in his book Masculine Interests. He

204


ing to note that, in a way, the female characters of Huevos may all be read against one another, relative to type—or even that the final three may be read as versions of Benito’s first love, Rita. After all, it is to her, the “primer amor, 47 kilos”, that all the other women are compared—in terms of personality (mainly as a function of how “whorish” they are) and, more importantly it seems, in weight, just as if they were any gemstones ready for appraisal by Benito, (to continue in the metaphor) their “stone-cutter” of sorts. Claudia, heavier than Rita (which makes her apparently less desirable, though an authoritarian control over her breasts appears to lend her an idiosyncratic appeal to Benito), and Marta, his wife, lighter (she is otherwise boring unto herself, but being a “featherweight” in Benito’s arms appeals to his need to feel in command and control), together seem to comprise two halves of Benito’s ideal woman (who remains the cheating Rita). They are significantly visually doubled by virtue of Benito’s having drawn on them each like little more than two big blueprints. Also, in the scene leading up to and during their threesome with Benito, their mutual fascination with each other’s bodies and pleasure seems subversively to suggest the redundant presence of the man, Benito, in their womanly, sexual sphere. And finally, it is suggested that Claudia and Marta’s roles as lover and wife, respectively, are not mutually exclusive—that is, meaningful in and of themselves as parts played out for Benito’s sadistic pleasure—when Marta expresses her desire for them to switch places, so that she might have the excitement of being mistress and Claudia the comfort and security of playing wife. At no point do Claudia and Marta’s personalities or subjectivities switch, thus each of their individuated characterizations is preserved, but the social roles they play are revealed to be fluid and, to some degree, voluntary. Therefore, their characters take on distinctly female masquerades as dutiful wife and lover as they so choose; the parts they play as stereotypes/myths from within Spanish society do not actually determine their individual personalities. La teta y la luna is more conservative in terms of its narrative scope and characterization than the other two films of Bigas Luna’s Iberian Trilogy. As it is in many ways a children’s story, a coming-of-age narrative (perverted though it may be by sexuality and shades of morbidity), it presents more balance/stability and textual unity than do either of the other two of the Retratos. Thusly more generically clearcut, as compared to the melodramatic (with aspects of the horrific) Jamon jamon20 writes that “masculinity itself is a genre formation. Gender, like genre, is a performative accomplishment… Just as a film genre has no content until a number of genre films render it visible, gender [is].” (Lang, Robert. Masculine Interests. New York: Columbia Univeristy Press, 2002. p. 4) Thus we might see, perhaps, as the Retratos ibericos as being so variably generic as a reflection, or perhaps as a function, of the slightly shifting perspectives on gender and its performance we have seen from film to film. 20 Like both Huevos and Teta, Jamon incorporates a surrealist dream sequence, allusive of Dali and Bunuel, which, as with the other films, disrupts its particular generic, codified progression (in a previous note, I identified Jamon jamon as generically coded for melodrama). However, there are elements within it that, beyond flavouring the film with a touch of surrealism, also infect its narrative with the generic codings of horror. According to Deleyto, the ‘madre puta’ of Jamon, Conchita,

205


or the tragic Huevos de oro, Teta is located most firmly within phallocentric discourse. All conflicts are put to rights by its end, and the boy protagonist Tete is decidedly taken up into the patriarchal social structure and phallocentric discourse of its regionally specific Catalan setting,21 the specificity of which, in this case, denotes the tenacity of patriarchal symbolic order holding place/subjects to significant, objective meanings. Married though it has been to a precariously heavyhanded symbolism, by Teta’s end the film has still not been disrupted by an Jamonlike eruption of hypermasculine angst and anger, or the implosion of the same, such as at the end of Huevos de oro, when Benito despairingly cries out against every single symbol or marker of his past success and machismo. Nevertheless, there are indeed, as in Jamon and Huevos, instances of self-consciously performed female masquerade, played by Mathilda May playing “La Gabacha” Estrellita, that subversively draw attention to the complete artificiality—and thus the precarious fragility—of the wholly symbolic world to which Tete is inducted throughout the course of the narrative. Most obvious are those moments when Estrellita is onstage performing her act with Maurice, sending up the image of the demure ballerina, an idealised image of womanhood, in a performance that has her cheering for her partner’s farts, baring one of her breasts, and flitting inanely about the stage to an Edith Piaf song. Also in the beginning and ending credits of the film, Estrellita is seen emerging and finally disappearing back into a box like the figurine in a music box, aping the motions of an inanimate object which is itself the copy of a real female figure. In such a way she ironically copies her own feminine body after an inauthentic copy; she hyperrealizes herself to the point that subjugating herself to the male gaze—for which she predominantly performs onstage, judging from the breast-baring sketch—is emptied of any sexual significance because the body with which she presents herself is not significant of her own being but of some simularepresents the two aspects of the ‘monstrous feminine’ as defined by theorist Barbara Creed: the phallic woman and the archaic mother (who has the vagina dentata). (Deleyto, Celestino. Op. Cit. pp. 279-280) Meeting Creed’s criteria for monstrous femininity, Conchita both has been abandoned by her child in his attempts to insinuate himself into the patriarchal symbolic order (i.e. move out, marry Silvia, and raise a child of his own) and brings about a confrontation between this symbolic order “and that which threatens its stability”, that is, subjecting a macho male, Raul, to her sexual whims. (Creed, Barbara. Pandora’s Box: Essays in Film Theory. Melbourne: Australian Centre for the Moving Image, 2004. p. 39) Effectively, Conchita subjects both the two young main males to an encounter with the abject, that is, as defined by Kristeva, death and the corpse, “a weight of meaningless”, “ambiguity”, and an “attempt to release the hold of the maternal entity”. (Kristeva, Julia. Powers of Horror: An Essay on Abjection. New York: Columbia University press, 1982. pp. 4-13) This abject-ification, of sorts, of the Jamon text, fully disrupts and deconstructs its textual phallocentricism. 21 Teta’s setting is specifically “micro-regional” (D’Lugo, Marvin. Op.Cit. p. 198), whereas Jamon jamon, set in Los Mongros in Aragon, is more largely regional. (Deleyto, Celestino. Op. Cit. p. 271) Huevos covers the most space, set in several different places in Spain before the action relocates to Miami, Florida. This might be read across the Retratos ibericos as a movement from the most limited/immature framework of social discourse to the most international, which is also the most sexually mature and ultimately also the most divested of social consequence.

206


crum of an idealised womanly body. While D’Lugo has identified “Estrella’s role as the elusive object of collective desire that transforms the cultural fetishization of sexual identity into a constructive identity,”22 I believe it is rather more that she, as the “elusive object of collective desire” remains intangible to her diegetic and extra-diegetic audiences, in that her constant over-performance of this paradoxically parodic and sexually idealised ballerina protects her real identity from being used and construed/constructed as meaning as such.23 Overall, female masquerade in the Retratos ibericos, as it results in the constant confounding of type through subversive acts of characterisation and gender performativity, upends the phallocentric discourse of symbolism running parallel to these recurring instances of the subversion of gender-as-meaning in Bigas Luna’s Iberian Trilogy. But the films also go one further: other than destabilising the symbolic authority of the filmic image to convey meaning through the acting-out of female performance beyond the logistic realms of phallocentric discourse, Luna’s Trilogy paints male performativity of gender as equally malleable to females’ and as frequently disturbed. For example, the first-made Jamon jamon ends with (seemingly) macho through-and-through Raul kneeling, weeping, before Conchita, whose son he has just killed in an outburst of hypermasculine aggression and overcompensation for his sexual subordination to Conchita. In this case, Raul’s machismo is over-performed to the point where both body and mind can no longer support the performance, exploding outwards in violence against others, and imploding inwards on Raul’s sense of self. He can no longer express himself past a few gutteral apologies to Conchita, and after that, even verbal discourse and structures fail him. According to Chris Perriam, “[w]hat he seems to register in these moments is the shocked acceptance of the punishment for performing excessively and without nuance those masculine ideals that ought to have brought sexual and material success.” This is also seen in Huevos de oro,24 wherein Benito (who is also played by Javier Bardem) similarly systematically destroys every lasting symbol of his successful foray into the highly structured, heavily male-gendered business world. He collapses after cursing even “Passion”, the sexual liaison between man and woman upon which male dominance is theoretically founded. Then, as D’Lugo writes, Coming right after Huevos de oro, Teta might well be read as a manifestation of the longing for a return to simpler times moti22 D’Lugo, Marvin. Op. Cit. (212) 23 Adding one more dimension to the elusive complexity of Estrellita’s character as masked by her show persona is the fact of her character’s Portuguese heritage. She is consistently referred to as French in the film, and indeed she is married to a Frenchman, dances to Edith Piaf, and regularly wears a large ballet tutu; however, towards the beginning of the film Tete states that she is actually Portuguese—yet another aspect of her character that is elided in her performance of idealised femininity onstage. 24 Perriam, Chris. Op.Cit.(98)

207


vated by the Iberian male’s recognition of his ensnarement in the self-deluding myth of his own power and virility… yet the “age of innocence” evoked in Teta, similar to that of Huevos de oro, is fixed on the image of male anxiety and immobility, this time atop the human pyramid.25

Whereas “[t]he final scenes of both Jamon and Huevos could thus be interpreted as a nostalgic farewell to the macho Iberico, a figure unable to survive in a newly globalized, and at the same time de-centralized, Spain,”26 the lasting image of Tete set atop the (heavily phallic-coded) nationalistic symbol of the Catalan castella promises, at the end of the Retratos Ibericos cycle, the rebirth/resurgence of just such a macho figure. He has grown, similarly to Benito and Raul, to be fixated on women’s breasts as symbols of female difference, and thus deference, to the Spanish male authority that distinguishes them originally and dichotomously as female, the not-male. It is the ultimate sign of Tete’s initiation into patriarchal Spanish society that he finally “has balls” enough to climb to the top of the human tower, and that he has accordingly been recognized as such, as male, by the female objects of his affection, Estrellita and his mother, who give (again) their breasts to him in appreciation for his proven sexual difference.27 However, if we have learned anything from the similar narrative trajectories of Raul and Benito in Jamon and Huevos, respectively, we know that the achievement and over-performance of masculinity now—a flaunting of what he deems to be his hard-earned “balls”—can only result in later castration. Just as Raul and Benito face metaphorical castration—the diminishment of their sexual, economic, and social success—by the end of their narratives, and just as the well-known symbol of Spanish patriarchal-national identity the Osborne bull, omnipresent in Jamon jamon, is in that film literally (as much as a giant animal cut-out can be literally anything) and unceremoniously castrated, so it seems that Tete, following in the narrative footsteps of his predecessors, will, as a result of his similarly performing male masquerade, eventually be too. Consequently, enacted (in Jamon jamon and Huevos de oro) and promised (in La teta y la luna) throughout Bigas Luna’s Retratos Ibericos, we have the ultimate upheaval of what film theorist Laura Mulvey has famously criticised narrative cinema as imposing on the female figure, which is the bearing of the burden of narrative signification, of being made to represent, to symbolize meaning, by virtue 25 D’Lugo, Marvin. Op. Cit. (205) 26 Fouz-Hernandez, Santiago and Alfredo Martinez-Exposito. Live Flesh: The Male Body in Contemporary Spanish Cinema. London and New York: I.B. Tauris, 2007. (26-27) 27 Proof of Steve Neale’s position that “Where women are investigated, men are tested.” (Neale, Steve. “Prologue: Masculinity as Spectacle, Reflections on men and mainstream cinema”. Screening the Male: Exploring masculinities in Hollywood cinema. Steve Cohan and Ina Rae Hark, Eds. London: Routledge, 1993. p. 19)

208


alone of being looked at.28 Here instead, we have femininity and masculinity as seen to be equally performative, equally masqueraded, so that both the male and female are signifiers of socio-cultural types (whether they be valid and reliable reflections of “real” persons or not), and so that performances of either gender may be said to connote signification (to be read, to-be-looked-at) itself. The crucial difference is that throughout Luna’s trilogy of films, women appear to be always already aware and willing to self-perform, so that each individuated character is protected under several layers of parodic performance. For the male, however, and especially for the protagonists Raul and Benito, the act of looking outside the self in order to perform masculinity as an element of one’s character necessitates the uncomfortable crossing of a boundary between the self and that which exists outside it. This verges on bringing about abject experience, as defined by Kristeva, and results in the male characters’ ultimate crying out against this self-introspection-cumprojection of the self onto externalized character and personality. For as Mulvey has said, the male cannot bear the objectification of his own male selfhood to the gaze—even if it his own. To repeat an observation by Evans regarding characterization within Jamon jamon: “Es un gran melodrama… con se personajes que prototipos de la nuestra pais.”29 As a unified cycle of films, it is Bigas Luna’s objective to present by way of his Retratos Ibericos a (granted, heavily parodic and self-conscious) cast of prototypical character symbols, representing whom he sees as, if not populating his nation in fact, then representing through a phallocentric/patriarchal/symbolic discourse of cinema those reflections of self-hood made familiar to the Spanish nation through the filmic medium. Thus we have a prototypical self-effacing female of indeterminate origins set against an alternately enthusiastically macho-performative and self-doubting Spanish male. Whilst the female prototype is evidently capable of gaining some degree of ironic distance from herself as contextualized entity, the male, at least in the films of Bigas Luna, still finds it to be a destabilizing, abject experience. Thus we may see Bigas Luna as setting the Spanish male, and with him the future of Spanish cinema and modes of gender, genre, and symbolic representation on the edge of itself, located somewhere between past historical context, present ambition, and apprehension of the promise of future self-actualization—or disappoinment.

28 Mulvey explains, “According to the principles of the ruling ideology and the psychical structures that back it up, the male figure cannot bear the burden of sexual objectification. Man is reluctant to gaze at his exhibitionist like…In a world ordered by sexual imbalance, pleasure in looking has been split between active/male and passive/female…with their appearance coded for strong visual and erotic impact [women] can be said to connote to-be-looked-at-ness.” Thus the male gaze always already imposes meaning on the visual sign of the female. (Mulvey, Laura. Visual and Other Pleasures. Indianapolis: Indiana University press, 1988. pp.15-19) 29 Evans, Peter William. Op. Cit. (14)

209


EARTH SCIENCES PANEL

JUDGING PANEL Prof. Paul D. Ryan (NUI Galway) – Chair Dr. John Graham (Trinity College Dublin) Dr. Ed Jarvis (University College Cork) Judges’ COmments This essay analyses the possible reasons why fossils of Irish Pleistocene megafauna are not associated with evidence for early man, whilst the two are closely associated elsewhere particularly in the UK. It is a mature, balanced, well illustrated, well referenced and authoritative account that is worthy of publication in a journal promoting science. An essay of this quality can only help promote science in Ireland.


E A RT H S C I E NC E S

Early human settlement in the British Isles & Northern Hemisphere glaciations Caroline Martin “Evidence of evolution, including evolution of cultural complexity, is generally found not in a time series but rather in a series of discrete ‘snapshots’ that frequently cover a broad geographic area” (Vrba 1995).

T

Introduction he above statement refers to the scant fossilised remains and, in some cases, the historic folklore of erstwhile populations that shed intermittent light on the course evolution may take. Evolution and species migration, although characterised by changes in life’s genetics and behavioural patterns, are not processes solely defined by their organic components and cultural artefacts; the host environment of any given living assemblage steers evolutionary direction in space and time, and thus climatic signals in the geological record, which can be relatively unremitting, allow for an interrelationship between climate and evolutionary trajectories to be assessed. This hypothesis has been tested for the case of human evolution by Behrensmeyer (2006). The course of early human settlement in Britain correlates in time with the retreat and advance of Northern Hemisphere glaciations during the Pleistocene (~ 1800-11 thousand years before present (B.P.)) (Ashton and Lewis 2001). To date, no remains of human populations in Ireland have been found that are older than the 211


Holocene (~ 11,000-0 years B.P.). This study collates the relevant palaeoclimatological data – marine oxygen isotope stages –­and archaeological data – mammalian faunal remains and evidence of human settlement – currently in the literature for the British Isles, so as to test the correlation in time of early human settlement with other major mammalian groups, within the climatic framework of coeval changes in environmental conditions, such as ice-sheet advance, sea-level fluctuation, temperature change, landscape alteration and habitat perturbation. By placing the timing and trends of early human settlement in Ireland within the context of that of Britain and mainland Europe, this account elucidates the contrasts between the distinct faunal and palaeoclimatological data sets characteristic of the British Isles. The role of climatic variability in the shaping of evolutionary trends is discussed with cross-referencing of case studies from Europe, the Americas and Asia. Climate change, together with the associated palaeogeographical variation of the British Isles during the Pleistocene, provides a backdrop from which to assess mammalian migratory patterns. The driving forces behind why early man travelled from mainland Europe to insular or peninsular Britain and Ireland across waterway barriers or land bridges has implications for understanding the stark contrasts within the British Isles’ prehistory.

Fact or ‘Artefact’? Attempts to study terrestrial geology or archaeology from a pre-glacial setting are thwarted by the fact that glaciers despoil the landscape by their rock-scouring ability. This argument, however, which is based on first principles, by no means eliminates faunal studies from pre-glacial and indeed inter-glacial environments; for example, extensive research in to Pleistocene Ireland was undertaken as part of The Irish Quaternary Fauna Project (Woodman et al. 1997). In their study of the British Late Mid-Pleistocene, relating climate, habitat preference and sea level, Ashton and Lewis (2001) provide a summary of the problems encountered in attempting to relate human remains and artefacts with population densities. Stuart (1995) compares remains of Pleistocene mammalian faunas, such as giant deer, hyena and humans from Europe, Britain and Ireland and highlights the difficulties in using faunal diversity as a measure of insularity of the British Isles due to uncertainties in age correlations. For example, a species of reindeer (Rangifer tarandus), known in Pleistocene Europe to be adapted to cold conditions, persisted in Britain into the Holocene, whilst several other cold-adapted taxa, such as artic fox, didn’t survive the Pleistocene/Holocene transition. However, in the case of Ireland, the obstacles pertain to having, as of yet, no Pleistocene human records with which to correlate with the other mammalian fossils that are present in abundance; perhaps they once existed but records are unfound or have been destroyed. This fact or ‘artefact’ that humans and other mammals did not coexist in Ireland until the Holocene has implications for the current understanding of the nature of human migratory patterns throughout Europe and indeed the rest of the world. 212


Fig.1. An outline of the temporal, palaeoclimatological and archaeological sequences of Quaternary divisions for Britain and Ireland based on δ180 sub-stages, glacial (G)/interglacial (I) stages as defined by pollen sub-stages, and cultural artefacts. Attention is drawn to two main points: firstly, the onset of the Midlandian pre-dates the Devensian, indicating the distinction between Britain and Ireland. Secondly, the first human settlements in Britain occur in the Cromerian, during which time Ireland was in the Pre-Gortian glacial. Data taken from Godwin (1956); Bowen and Sykes (1994); Tzedakis et al. (1997); West (2000); Waddell (2007). 213


For example, there is definitive evidence that Homo sapiens had migrated from Asia to the Americas by at least ~ 11,000 years B.P. via Beringia, the connective land bridge that existed in the Late Pleistocene between Alaska and Northeast Asia (Meltzer 1995; Fiedel and Haynes 2004; Gonzalez 2007). Evidence of the ‘Clovis’ method, an excellent example of precise lithic technology, designed for spearing fauna (Meltzer 1995), indicates that ‘Clovis’ man may have been the pioneer of North America that followed these fauna to Alaska out of Asia and thence to South America (Gonzalez 2007). The ‘Clovis’ theory relies on the acceptance of a clear temporal relationship between early man and their faunal quarries; however, there is mitochondrial DNA, archaeological and linguistic substantiation of an earlier group of Homo sapiens whose presence in North America is not supported by coeval migration with fauna via Asia, but nonetheless they appear to have been predecessors of the ‘Clovis’ people and may date to as far back as 40, 000 years B.P. (Nichols 1990; Schurr 2004). The earliest homininae (currently a subfamily of hominidae) records in Europe have recently been unearthed from the site of Sima del Elefante, Atapuerca, Spain, and date to 1.1-1.2 million years ago (Carbonell et al. 2008). Britain boasts artefacts from the genus Homo since ~ 500,000 years B.P., during the Cromerian interglacial in the Palaeolithic division of the Pleistocene (Bowen and Sykes 1994; Waddell 2007) (Fig. 1). In Ireland, the earliest homininae are Homo sapiens and the first settlements date to ~ 9,000 years B.P., in the Mesolithic division of the Holocene (Waddell 2007) (Fig. 1). This places rudimentary cultural and societal development of the genus Homo in Ireland up to ~ 500,000 years later than that of Britain, which itself is more in tune with early settlement of mainland Europe (Bowen and Sykes 1994; Waddell 2007; Niven 2007). Fig. 2 illustrates the global distribution and the timing of the first definitive settlement of early man during the Pleistocene, with an emphasis on the temporal and spatial contrasts between adjacent land masses. Biological and Botanical Evidence of Environmental Change Several key taxa of Pleistocene fauna and flora have been used to constrain the timing and nature of environmental change that affected northeast Europe. Evidence comes from terrestrial and marine samples that range from pollen grains and spores to foraminifera and ostracods to giant elk and mammoths. By representing migratory pulses, faunal turnovers provide insight in to palaeogeography and the behavioural patterns of major and mega-fauna in response to sea-level fluctuation (e.g. Turner 1995; Schreve 2001b), which in turn reflect the changing temperatures and physical conditions of the oceans that are preserved in the isotope signatures of micro-fauna (e.g. Bowen et al. 1989; Ashton and Lewis 2002), which can be corroborated with terrestrial palynological evidence that elucidates seasonal and longer climatic shifts (Turner and Kerney 1971; Tzedakis et al. 1997). Fig. 3 provides a summary of the broad scale temporal correlations between fauna, flora and climate that define the European environment during the Mid-Upper Pleistocene (500,000-0 years B.P.). 214


Fig. 2. Global distribution of early man in the Pleistocene as indicated by stars. The dates, in thousands of years (ka) and millions of years (Ma), represent the timing of the earliest definitive settlement of early man on that land mass. Attention is drawn to three main points: Firstly, human settlement in Britain is dated to ~ 500 ka, whilst Ireland, ~250 km away, is unpopulated. Secondly, many other isolated land masses are unpopulated, including New Zealand, Madagascar and the Polynesian Islands. The colonisation of these islands coincides with the first definitive evidence of boat excursions (Cosgrove 2007); most are situated in further isolation from nearby populated land masses than Ireland is from Britain and mainland Europe. Thirdly, migration from North to South America, which must have been through a multifarious and sometimes harsh environment, took place in a matter of centuries. This represents a distance of ~17, 000 km. The African presence, being the birth place of human evolution, is denoted only by a single large star. Data taken from Groube et al. 1986; Bowen and Sykes 1994; Meltzer (1995); Gibbons (2001); Morwood et al. 2004; Zhu et al. (2004); Parfitt et al. (2005); Cosgrove (2007); Gonzalez (2007); Waddell (2007); Carbonell et al. (2008).

Ice Sheet Behaviour & Habitat Preference The Last Glacial Maximum (LGM) in Britain occurred during an ice age known as the Devensian, which persisted from ~ 110,000-10,000 years B.P. (Stuart 1982) and reached a maximum of southerly extent around ~ 22,000 years B.P. (Mix et al. 2001; Bowen et al. 2002). The Devensian broadly correlates in time with the Midlandian ice sheets, which periodically advanced across Ireland in what are known as Heinrich events during the last ice age, from ~ 120,000 – 10,000 years B.P., and like the Devensian reached their maximum extent ~ 22,000 years B.P. (Bowen et 215


al. 2002). In fact, some authors refer to both the Devensian and Midlandian ice sheets collectively as the British and Irish Ice Sheet (BIIS) (e.g. Bowen et al. 2002), which for studies of prehistoric human settlements, such as presented by White and Schreve (2000), is an oversimplification. It is shown in Fig. 1 that they are not entirely coincident. These glacial advances shoaled global sea levels to that of ~ 120 m below the present mean value (Siddall et al. 2003; Gonzalez 2007). For the British Isles, the magnitude of the rise and fall of sea level relative to present levels was of the order of 30-50 m (Keen 1995; Lambeck and Purcell 2001). These sea level fluctuations were sufficient to cause sub-aerial exposure of hitherto submerged land masses, including connective bridges between Ireland and Britain and between the British Isles and continental Europe and littoral and sub-littoral zones (Devoy 1985; Lambeck 1996; Lambeck and Purcell 2001; White and Schreve 2000). Isostatic rebound of the land occurred during the interglacial retreats of the ice sheets, causing local relative changes to sea level as differential uplift and subsidence occurred (Lambeck and Purcell 2001). It was during the warmer interglacial periods of the Pleistocene known as interstadials that a rich Mammalian fauna diversified in the British Isles (Woodman et al. 1997). During colder times, abandonment and migration took place, often southwards, when available ecospace in northern Europe was limited by the extent and distribution of the ice (White and Schreve 2000). The early interstadials also were times of great mammalian migrations from continental Europe to the British Isles, providing animals like woolly mammoth with essential exposed connective land bridges by which they could make the transition (White and Schreve 2000). Perhaps the necessary interplay of marine regression and land exposure was optimised during the early and late phases of interstadials, when ice sheets were less of a barrier whilst sea levels were still low or beginning to shoal. Conditions during the LGM, for example, may have imposed a climatic threshold, which when crossed would have rendered Ireland, and perhaps most of Britain, wholly inaccessible. The mammalian populations of Pleistocene Britain included settlements of Homo heidelbergensis, an ancestor of Homo neanderthalensis and Homo sapiens, as far back as 500,000 years B.P., where remains have been found in temporal, geographical and ecological association with several faunal taxa, such as tools carved from deer bones, butchery marks on large mammal bones and later even cave art depicting wild fauna (Pike et al. 2005; Waddell 2007; Ugan and Byers 2007a; Ugan and Byers 2007b). These early inhabitants exploited flint from within the southern chalk cliffs for use as weapons and tools (Roberts and Parfitt 1999). It has been proposed that early man, by exploiting fauna for food, clothing and materials, would have followed herds across land masses as glaciers advanced and retreated and that they would have settled accordingly (e.g. Haynes 1980). Such quasi-synchronous migrations could have occurred via land connections between continental Europe and Britain during the late Pleistocene (White and Schreve 2000). 216


Fig. 3. Temporal correlation between the marine oxygen isotope stages (OIS) 1 – 13, European palynological stages, and major faunal sites in Britain and Ireland. The oxygen isotope curve represents global ice volume fluctuations as derived from the composition of benthic foraminifera. The odd-numbered OIS stages (interglacials) are tuned to orbital frequencies. The palynological curve illustrates the percentage of pollen grains, which when high, signifies forest growth. A high percentage during interglacials shows that forests are favoured during these intervals; however the terrestrial pollen data records more climatic variability than the marine record. The dashed line and solid lines are based on data from two species of pine. Oxygen isotope and palynological graphs modified after Tzedakis et al. (1997). Data compiled from Martinson et al. (1987); Toth and Schick (1993); Mithen (1994); Woodman (1997); Aldhouse-Green (1998); Currant (1999); White and Schreve (2000); Schreve (2001b); Currant and Jacobi (2001); Stuart and Lister (2001); Waddell (2007). 217


M ajor fauna vs human occurrence

Major faunal assemblage dates (thousands of years B.P.)

500 450 400 350 300 250 200 150

R2 = 0.9971

100 50 0 0

100

200

300

400

500

Human artefact dates (thousands of years B.P.)

Fig. 4. Strong spatial and temporal correlation of British major faunal and human artefact assemblages since 500 ka. The gap in the data represents ~150 ka in Britain’s prehistory for which there is no evidence of human occupation. Data taken from Dawkins (1910); Stringer and Gamble (1993); White and Schreve (2000); Currant and Jacobi (2001); Schreve (2001a-b); Gowlett (2006); Gilmour et al. (2007); Waddell (2007). By collating relevant data, it is illustrated here that, there is a strong correlation in time between British sites of major fauna and those of early man since the Mid-Late Pleistocene (500,000-0 years B.P.) (Fig. 4). In contrast, the mammalian taxa of Late Pleistocene Ireland, whilst rich in major fauna, is devoid of remains of early man; however, a strong correlation in time exists between the major faunal groups since the Late Pleistocene of Britain and Ireland (50,000-0 years B.P.) (Fig. 5). Results show that unless the remains of Irish early man have been obliterated entirely or as of yet, are unfound, synchronous migration of major fauna and early man, in to Ireland, did not occur. Several authors provide palynological, palaeoclimatological and archaeological evidence of faunal evolutionary turnovers and colonisation – re-colonisation human settlement cycles in Britain, which occurred in response to the fluctuating climatic conditions during the Pleistocene (Stuart 1982; Stuart and Van Wijngaarden – Bakker 1985; Currant 1989; Tzedakis et al. 1997; Currant and Jacobi 2001) (Figs. 4-5). The same data types provide evidence for similar patterns of climatol218


ogy and biological behaviour existing in Pleistocene Ireland, with the exception of human activity (Fig. 3). It is possible that, the British Isles and Ireland in particular, being the extremity of a northwestward faunal migration pattern observed across the whole of Western Europe (Hewitt 2000), may have been characterised by less of a demand for fauna by early hunters. Woodman et al. (1997) suggested that the evidence for human activity in Late Pleistocene Britain was not sufficiently resolved to provide time constraints on the duration of discrete settlements, and that they may have been intermittent and relatively short-lived. Following this line of investigation, Woodman et al. (1997) suggested that the absence of Pleistocene humans in Ireland could be attributed to rising sea levels, which would have isolated the island during habitable interstadials from an already sparsely populated Britain, and would also explain why faunal diversity was somewhat lower. However, of the taxa that are identified to have been abundant in Ireland during the Pleistocene, several are known to have been principal species in Britain that were exploited there by early man as a staple resource (Waddell 2007) (Figs. 4-5).

Patterns of Climate, Palaeogeography & Mammalian Migration White and Schreve (2000) labour the point that the late glacial/early interstadial temporal distributions of human settlement in the British Isles represent pulses of abandonment and re-colonisation episodes. These would have been characterised by a time lag in deglaciation with respect to rising temperatures and sea-levels and it is suggested that efficient re-growth of flora would have promoted herbivore, then carnivore and/or human re-colonisation. In their tripartite model, the late glacial/early interstadial period, when reestablishment of erstwhile or new communities occurred, is the second of three phases, which combine palaeogeographical, palaeoclimatological and archaeological evidence in support of a working framework from which to reconstruct human settlement in Britain. The first phase is during maximum ice extent, when abandonment to the southeast occurs, and the third phase is during the regressive extremes of the interstadial, characterised by occupation of Britain coupled with isolation from mainland Europe. Skeletal remains of Pleistocene assemblages containing woolly mammoth, brown bear, spotted hyena, giant deer, reindeer, red deer, wolf, wild horse, arctic fox, hare and lemming are described from several locations across Britain and Ireland, during both glacial and interstadial times, where they survived the destructive weathering of the ensuing glaciers (Woodman et al. 1997; Currant and Jacobi 2001; Stuart and Lister 2001). If abandonment was the status quo of Pleistocene Britain, the latter point suggests that, during glacial advances, perhaps humans were the first to leave (Fig. 3). Some communities of fauna, such as those listed by Woodman et al. (1997), may have remained in un-glaciated pockets during total insularity. However, endemic evolution, such as there is evidence for in Late Pleistocene Jersey (Lister 1995), has not been reported from mainland British fauna sites of any age. In addition, evidence of pulsed migration of major fauna in to Ireland during the Pleistocene, including re-colonisation by some species, has been 219


M ajor Fauna (Britain and Ireland) vs time

Major faunal assemblage dates (thousands of years B.P.)

50 45 40 35 30 25 20

Ireland

15

Britain

10 5 5

15

25

35

45

Time (thousands of years B.P.)

Fig. 5. Strong temporal correlation of major faunal assemblages from Upper Pleistocene Britain and Ireland. Data taken from Currant (1989); Stuart (1995); Woodman et al. (1997); Coard and Chamberlain (1999); Currant and Jacobi (2001); Schreve (2001b); Gilmour at al. (2007); Waddell (2007). presented by Woodman et al. (1997). Ireland is not differentiated in the speculative investigation of mammalian migratory patterns presented by White and Schreve (2000) and the questions surrounding the absence of Pleistocene man persist. Using the tripartite model of White and Schreve (2000), the time-lag idea could have potential connotations for elucidating migratory patterns in the British Isles. If man and fauna abandoned and re-entered Britain on a cyclical basis, as suggested by the model and supported by archaeological evidence (Mithen 1994; Toth and Schick 1993; Aldhouse-Green 1998) (Fig. 3) during discrete windows of opportunity, then perhaps Ireland lay beyond the scope of reach, at least by land, to early man, who for genetic, social or demographic reasons, may not have travelled as far so fast as other mammalian faunas. There is evidence that ‘Clovis’ man spread from Alaska to Patagonia, a distance of over 17, 000 km, in fewer than 2, 000 years (Gonzalez 2007), but this is contested on the grounds of it being of insufficient time to be of a plausible evolutionary trajectory (Meltzer 1995). The most conservative evidence for the evolution 220


and geographical dispersal of early man in the Americas, based on mitochondrial DNA evidence, linguistic patterns and dispersal rates, puts their initial arrival at ~ 40,000-50,000 years B.P. (Meltzer 1995); therefore, at its most cautious estimate, migration from the north of North America to the south of South America took place in just under ten times less time than that of migration from the initial occupation of Britain at ~ 500,000 years B.P. to that of Ireland at ~ 9,000 years B.P. A more conservative estimate of the time lag between human colonisation of Britain and Ireland is met if human and major faunal migrations are coupled; this would place plausible human settlement in Ireland at ~ 120,000 years B.P. (Waddell 2007) and would reduce the time deficit to just over twice as long as human migration from North to South America.

Discussion The points presented in this study raise two possibilities: one, there being any combination of physical or biological restrictions that could have impeded human migration in to Pleistocene Ireland – for example, an impenetrable landscape such as thick vegetation, an insurmountable physical barrier in the form of water or ice, or having populations of insufficient densities with which to support re-establishment – or, that there did exist humans in Ireland in the Pleistocene, but that their remains have been obliterated, or as of yet, are unfound. The two ideas in support of no humans in Pleistocene Ireland will be further discussed here. Roe (1996) described two lithic cultures that influenced cultural development in Britain. The first of these are known as the Clactonian people and are largely defined by the absence of hand axes in their society. The second are Acheulean, who unlike the Clactonians, were well equipped with hand axes. If thick vegetation were to hinder human settlement in Ireland, it may be supposed that the Clactonians would have been particularly inept at felling niches for themselves. However, hand axes comprise some of the earliest artefacts of human settlement in Britain (Meltzer 1995), described from the Boxgrove site in Sussex and dated to ~ 500,000 B.P. (Roberts and Parfitt 1999), and so manpower was not the limiting factor. However, if Ireland was only accessible by tenuous land connections, as has been suggested by Devoy (1985), then perhaps migration was physically impeded during times when Britain was occupied by the more capable human settlers. The latter point is to some extent refuted by the abundant Irish Pleistocene fauna that would have made their way via land connections likely suitable, too, for human passage. Excursion by boat would seem to provide the obvious answer to the quandary of accessing island Ireland. However, at this time in prehistory, there is yet only unsubstantiated evidence that trans-oceanic migrations were taking place, and no evidence that Northern Europeans had developed the technology at this point (Meltzer 1995). Climatic conditions and environmental instability impose a constraint on population growth and migration speeds are thus retarded (Whitley and Dorn 1993). This puts pressure on founding groups to be large enough in size to cope with dis221


persal; according to Woodman et al. (1997) these were discrete and isolated settlements in Pleistocene Britain. Thus, there may have been population density limitations due to environmental duress, and physical barriers that prevented Ireland from being populated by humans until amelioration of the climate occurred in the Holocene, and agriculture took over from hunting as the main form of human sustenance; which offered incentives to devise technologies for the purpose of clearing land of coarse vegetation, and to demographically expand (Kalis et al. 2003). Many other parts of the world remained unpopulated by humans until well in to the Holocene, including New Zealand, Madagascar and the Polynesian Islands (Green and Sussman 1990). New Zealand, whose first human inhabitants arrived ~ 1,000 years B.P. (Diamond 2000), is over 2,000 km from mainland Australia, and as such is considerably more isolated than Ireland is from Britain (~ 240 km max.). Madagascar’s and the Polynesian Islands’ first humans did not arrive until the Late Holocene (Diamond 2000), but perhaps their location was not in the immediate paths of human migratory trajectories and in most cases, deep and vast expanses of water lay between them and the populated mainland. The delayed human migration from Britain to Ireland is analogous and somewhat temporally related to that of the inhabitants of Pleistocene Indonesia, who failed to cross the 100 km water divide to the ancient landmass of Sahul, north of Australia, despite their close proximity from 800,000-12,000 years B.P. (Morwood et al. 2004; Cosgrove 2007). Cosgrove (2007) heralds the arrival of man on Sahul at ~ 12,000 years B.P. as definitive evidence for the first trans-water way excursions by boat. To further the study of ‘late’ human migrations, attempts to correlate climatic and evolutionary patterns in the British Isles with those of Asia and Sahul, where major sea-level fluctuations are also recorded, should be undertaken.

Conclusion The fluctuating climate and geography that characterised the environment of Pleistocene Europe provides a framework with which to assess the trajectory of mammalian evolution across the British Isles. The oxygen isotope and terrestrial pollen records of the Pleistocene can be compared in order to temporally correlate the retreat and advance of Northern Hemisphere glaciations with shifting habitats on land. The contrasting faunal assemblages and human remains, in terms of temporal and spatial distribution and diversity, found thus far in Pleistocene Britain and Ireland, offer insight into the physical and biological constraints on mammalian migratory patterns. This allows comparisons to be made with the course of Asian and American mammalian migrations and evolution during the Pleistocene. Despite the destructive capacity of glaciers, several, well-preserved and diverse mammalian faunal and human artefact sites in Britain, dating to ~ 500,000 years B.P., have been comprehensively studied, with ample mammal catalogues presented for each of the investigation sites. Ireland hosts several, and in some cases equally diverse, faunal sites dating to ~ 120,000 years B.P. The principal difference 222


across the British Isles’ prehistory is the absence of human remains, in archaeological sites thus far unearthed, in Pleistocene Ireland. This fact, or ‘artefact,’ is particularly puzzling in light of there being such a rich and diverse fauna in Late Pleistocene Ireland analogous to the major fauna of Britain with which human remains are found in intimacy, as results of collated data in this account support. The fact that nearby Britain lay populated since just under half of a million years earlier suggests that, at least during ~ 500,000 – 120,000 years B.P., Ireland was not suitable for mammalian habitation. There is evidence that, the patterns of Pleistocene Britain and Ireland’s mammalian migrations indicate cyclical colonisation/re-colonisation patterns in response to changing environmental conditions dictated by the Northern Hemisphere Heinrich events, which were characterised by both temporally and spatially alternated pulses of ice-sheet advance and retreat. In studying the palaeogeographical and palaeoclimatological complexities that such a varying landscape introduces to mammalian environments, collated data in this account show that it essential to avoid describing the British Isles in terms of a single, British-Irish Ice Sheet when assessing mammalian migration. Such a generalization neglects the interplay of varying ice thicknesses, sea-levels, land exposure and uplift that differentially affected the particular land masses within the British Isles. Given that the environmental conditions in Ireland between ~ 120,000 – 9,000 years B.P. were suitable for a wide range of mammalian fauna, it is plausible to suggest that human occupation was hindered by inherent biological constraints. In particular, there being evidence that human settlements in Pleistocene Britain were discrete and intermittent, it may have been that population densities were insufficient to promote dispersal into Ireland.

223


ECONOMICS PANEL

•

Judging panel Prof. Niamh Brennan (University College Dublin) – Chair Prof. Frances Ruane (ESRI) Mr. Constantin Gurdgiev (Business&Finance Magazine) Mr. Donal de Buitleir (AIB) Mr. Austin Hughes (IIB Bank) Prof. Colm Harmon (University College Dublin) Judges’ comments While fixed income analysts use the yield curve to understand conditions in financial markets, economists use it to comprehend economic conditions. This term spread helps to forecast recessions and changes in real economic activity. Unlike most applications, the direction of the change in the spread is not indicative of the duration or the strength of the following recession. The best forecast of future real activity is provided by the level of the term spread, not the change in the spread, nor even the source of the change in the spread. This paper is concerned with in-sample prediction. Reliable quarterly data from the ECB with no gaps were used with residual analysis to justify the model. It was shown that including the spread of the previous year is sufficient. The results fit in with the stylised observation that the term spread has lost its predictive ability as a predictor of the growth in real GDP since the great Moderation and since the EMU was established. This paper shows that it is important to divide the sample into periods corresponding to different monetary regimes. The sale of human organs, whether from a cadaver or a living donor, is proscribed in virtually every country in the world. There is a persistent shortfall between the number of organs available and the demand for them. This is an intriguing economic research question because it is a significant health-care problem for which a market solution already exists. This report explores whether allowing a market for kidneys is economically efficient. The economics of kidney transplants is explored through a brief analysis of the current procurement approach and a critical appraisal of suggested market solutions. Transplant professionals are increasingly evaluating patients for transplantation who are more likely to die than receive a donor organ, while potential recipients cling to the hope that they may beat the odds and receive an organ while they are still healthy enough to undergo surgery. Economic solutions such as allowing a market for kidneys must be considered; the argument for blanket prohibition is flawed and should be rejected.


E C ONom ic s

An econometric investigation into whether the term spread helps to explain the dynamics of GDP growth in the euro area. Michael Curran

A

I. Introduction good predictor is a variable that helps to forecast another variable. While fixed income analysts use the yield curve (the difference between long and short yields) to understand conditions in financial markets, economists use it to comprehend economic conditions. This term spread helps to forecast recessions and changes in real economic activity since it incorporates expected changes in rates; it is therefore an indicator of future changes in real activity. Unlike most applications, the direction of the change in the spread is not indicative of the duration or the strength of the following recession, however: ‘The best forecast of future real activity is provided by the level of the term spread, not the change in the spread, nor even the source of the change in the spread.’ (Estrella, 2005: 6, my emphasis)

The literature suggests that the yield curve Granger causes output growth;1 however, this breaks down after the mid 1980s in the US, corresponding to 1 There are few empirical applications that test Granger Causality with more than two variables.

225


Greenspan’s term as chairman of the Fed, i.e. during the past twenty years, in what has been referred to as the ‘Great Moderation’, or the ‘maestro’ years. The spread does not Granger cause real GDP growth post-EMU, while it does preEMU, which may be explained by an update of information in the information sets used to form private expectations – i.e. the information contained in the yield curve. I.I A Brief History of the Literature Prelude from Estrella (2005) While research as it is presently recognised on the difference between long- and short-term interest rates originated in the late 1980s, the history of the subject dates back until at least 1913 with Mitchell, according to Estrella (2005). Kessel (1965) was the first to refer to the behaviour of the term spread, albeit in prose, rather than in quantitative terms. Using data dating back to 1858, he showed that the term structure turned negative at peaks in the business cycle. This difference or term spread, which is the slope of the yield curve was seen to be inversely proportional to real economic activity, as investigated by Laurent (1988), who used the term spread to predict GNP growth. Furlong (1989) was the first to look at the term spread between 10-year- and 3-month-interest rates. Estrella & Hardouvelis (1989, 1990, 1991) regressed real GNP growth on the spread between 10-year- and 3-month-interest rates. Reviewing data from 1870-2003 on Germany, Baltzer & Kling (2005) conclude that predictability changes intertemporally and is related to monetary policy. The robustness of this relation was considered by Stock & Watson (2003), who emphasised the necessity to search for changes in parameter values arising from changes in monetary policy regimes. Recent studies suggest that the relationship may have undergone some structural changes, especially with regard to parameter values. There are problems however, in using the Chow test since the assumptions on which it is based are violated for AR regressions.2 If one does not want to specify the point at which the structural break in the underlying relationship may have occurred, there are alternative methods such as the recursive residual test. With time series data the recursive estimation is appealing due to the property of times series, viz. that it gives a unique order to data. (Johnston & DiNardo, 1997: 118) Models with qualitative dependent variables appear to be more robust to shifts in policy or other economic conditions. (Estrella, Rodrigues & Schick, 2003) Binary variable models have the same accuracy as continuous variable models; however, the results are dependent on the sample period, especially for continuous variable models; binary models are more robust with respect to recessions. In many circumstances, the yield curve is the best indicator; ‘at predictive horizons beyond one quarter, there is no match for the term structure as a predictor of recessions.’ 2 It assumes that the two error terms (one for each equation) u1t and u2t are independently distributed and that error variances in the regression must be the same.

226


(Estrella, 2005: 8) With a tightening of monetary policy, a rise in the short term interest rates helps reduce inflationary pressures; the market expects a subsequent easing of monetary policy. The fact that expected future short-term rates are important determinants of current long-term rates induces an increase in long-term interest rates. Though monetary policy is not the only determinant, albeit an important one, of the predictive power of the yield curve for future real GDP growth, and while the Expectations Hypothesis is rejected in econometric tests, interest expectations play an important role in their relation with future real demand for credit and future inflation.3 Far from being a ‘passing phenomenon’, the predictive power of the yield curve may not have disappeared, but rather the values of the parameters may have changed. According to theory, there is a persistent predictive relationship between term spreads and future growth in real GDP, while the exact parameter values might change over time. As Estrella concludes, ‘although yield curve inversions may not e followed by recessions as a matter of universal mathematical principle, they should definitely raise warning flags about future output growth.’ (2005: 10)4

Bordo & Haubrich (2004)5 Like Baltzer and Kling (2007), Bordo & Haubrich find that a broader historical perspective is helpful in discovering the reasons behind the ability of the yield curve to predict future output. Perceptively, they make the point that recent work treats ‘the spread as one variable in a regression designed to predict future output’, citing the equation: DYt+4 = α + βspreadt + γ(L)DYt [1.1] where DYt is annual growth rate of real GNP (at a quarterly frequency) and γ(L) is lag polynomial of length four (current and three lags). They find that predictability changes both intertemporally and between different monetary regimes: low-credibility, high-inflation-persistence regimes have better predictability.6 Similar to the pre-EMU period, Bordo & Haubrich find that the sub-period before the Federal Reserve was established (1913) had the highest predictive ability; the term spread (before 1985) did better than the lags of GDP alone, particularly in the post-Bretton Woods period between 1971 and 1984. The predictive content of the term spread 3 Other important determinants include the risk of liquidity premia. 4 The feature of negative term spreads anticipating recessions is still reliable. 5 Intriguingly, Bordo & Haubrich include a passage reminiscent of Hume’s automatic price-specie flow mechanism, only in modern monetary form in order to illustrate the effect of inflation on prices and interest rates. 6 Bordo & Schwartz define a monetary regime: ‘as a set of monetary arrangements and institutions accompanied by a set of expectations … expectations by the public with respect to policymaker’s actions and expectations by policymakers about the public’s reaction.’ (1999: 152)

227


may be investigated by conducting a Granger causality test; the problem is how to know when the difference between the mean square errors is not due to chance alone; also the methods adapted do not indicate how the coefficients change over time or regimes. With a credible commitment to price stability, long-run expectations of inflation are approximately zero. Shocks can be real or inflationary;7 real shocks are taken as temporary, and like that of temporary inflation, have different effects on short-term and long-term interest rates (the expected inflation effect is assumed to dominate the liquidity effect with inflationary shocks). While the liquidity effect tends to lead to a steeper yield curve, only short term interests rise due to the inflationary shock and so eventually, the yield curve will become flatter; likewise, a real shock flattens the yield curve and may even invert it. As an inversion could be due to either real or inflationary shocks, the yield curve becomes a noisy signal under credible regimes. In a fiat money standard without credibility, inflation can be seen to follow a random walk, which is very persistent, so expectations are formed towards a persistently higher rate of inflation; short and long rates increase leaving the term spread unchanged. This contrasts to a temporary real shock, which tends to flatten the yield structure towards an inversion. The yield curve is a better predictor in less credible regimes because nominal shocks will raise short and long interest rates leaving the term spread unaffected, whereas under more credible regimes, only short rates change and so the signal from the yield curve becomes noisy, since both real and inflationary shocks affect the yield curve. In either regime (credible or non-credible), temporary inflation does not affect long-term interest rates, a result deriving from the expectations hypothesis combined with the Fisher equation: it = rt + pet [1.2] where i is the nominal interest rate, r is the real rate and pe is the expected inflation rate. After the demise of the Bretton Woods international monetary system, the fiat money regime succeeded, with a decade of high inflation (the OPEC oil price crisis had a part to play);8 my present paper covers the Euro-area since January 1970. Increasing persistency of inflation is taken as evidence of a regime shift after Bretton Woods by Bordo & Haubrich. The persistence of inflation is inversely proportional to credibility.9 This subject is perfectly suited to Bayesian econometrics, which Co7 An inflationary shock may occur as a monetary authority’s response to unemployment, for example. 8 Bordo & Schwartz (1997) poured vitriol over policymakers ‘use of inappropriate policy models and indicators’, evocative of statements made by centuries of previous economists about the monetary policy of John Law and his Mississippi system. 9 I take exception to treating low credibility and high persistence as synonymous since it may attribute causation to where there is only correlation: ‘persistence can provide a quantitative way to

228


gley & Sargent (2001) applied to a VAR, finding that persistence varies over time, being high in the 1970s and 1980s and lower since then. Bordo & Haubrich thus argue that persistence of inflation ought to be a key determinant of the predictive content of the term spread. Like great inflation regimes, the pre-EMU period witnessed very persistent inflation and good predictive power. My approach is justified by Bordo & Haubrich: “A somewhat different approach to uncovering the effect on predictability would be to correlate the two at a quarterly frequency. At least initially this gives a more continuous variable than shifts in regime and does not require assuming particular dates for regime changes” (2004: 27)

Fellner’s (1976, 1979) introduction of credibility to the lingua of economists was incorporated into the stance that the real costs of disinflation may be lower than expected. However, the New Classical Macroeconomic contention that there can be ‘gain without pain’ may be turned on its head by referring to the UK, which witnessed a fall in its inflation from 13.5% (1979) to 4.6% (1983) at the cost of a rise of unemployment from 5% to 12.4% in response to the massive monetary contraction – was there a credibility gap? The ECB is the most independent central bank in the world. (Salvatore, 2002) Post-EMU, there has been a continued pursuit of low inflation and an emphasis on transparency. This ‘echoes the convertibility system of the old gold standard.’ (Bordo & Haubrich, 2004: 23) Bordo & Haubrich conclude that the yield curve has predictive power for future growth over the past 125 years and is robust (across various specifications); low credibility (high persistence) monetary regimes tend to have better predictability. Their conclusion supports the belief that monetary regime is vital in interpreting the yield curve. It is ‘misleading to draw general conclusions from data generated in one inflation regime.’ (2004: 28) They articulately and astutely sell their theoretical and empirical results to policymakers with the witty phrase, viz. ‘credibility is not always an unmixed blessing.’ By this, they mean that following Friedman (1968), monetary policy can be made less a source of instability through credibility, but that credibility itself may create problems for policymaking since the information content from the yield curve source diminishes with credibility. Importantly, they view an historical perspective beneficial in illuminating the inflationary and real effects of different monetary regimes. distinguish policy regimes and assess how the predictive content of the yield differs across them.’ (Bordo & Haubrich, 2004: 24) A diligent worker may not reach his productive target, nor a hardworking student achieve high marks due to bad luck in the same way that a central bank may fail to meet its target, even if the public believes in what the central bank says it wants to achieve ex-ante – i.e. even if it has credibility.

229


Baltzer & Kling (2007) Using Bordo and Haubrich’s model with their own data, Baltzer & Kling (2007) inversely link credibility of monetary regimes with the predictive power of the yield curve. My paper compares the impact of different monetary regimes on the predictability of economic growth by having a sufficiently long study, although my paper uses standard simple multiple regression instead of structural VAR models. Like my paper, they regress economic growth rates (DYt) on lags of spreads β(L) spreadt (where L is the lag operator); unlike it they apply maximum likelihood to their equation: DYt+4 = α + β(L)spreadt + γ(L)DYt + εt [1.3]10 Peel & Ioannidis (2003) make a similar comparison as Baltzer & Kling, except that they focus on a monetary policy reaction function, which shows that when central banks focus solely on inflation (forecast) targeting, the relationship between economic growth and spreads breaks down. With non-credible regimes (pre-EMU) linked with high inflation persistence, short-term and long-term rates rise so that there is no effect on spreads; Baltzer and Kling treat real shocks as temporary, so only the short-term interest rate is affected and therefore the term spread changes; thus under a non-credible regime, only real shocks affect term spreads. Under a credible regime (post-EMU), inflation is perceived to be temporary, and so inflation and real shocks affect short term interest rates only and hence spreads; observing spreads becomes a ‘noisy signal’ – changes in spreads may be driven by either real economic shocks or inflationary shocks; as a result, under a credible monetary regime, spreads have low predictive power. Like the post-EMU period, spreads experienced low predictive power during the highly credible gold standard period as opposed to the interwar period (with an R2 of 0.62) that resembled the free float period pre-EMU. Baltzer & Kling conclude that they find that spreads are useful to improve predictability post 1971.

II. Data Description In contrast to former studies this present paper looks at real GDP growth as opposed to that of GNP in the Euro-area, since GDP is available and consistent during the time frame under investigation (1970 – 2006); other papers focus mainly on real GNP growth, e.g. Laurent (1988, 1989). Real GDP growth versus 10-year- minus 3-month-interest rates is a frequently used measure.11 148 quarterly observations were taken from the ECB on real GDP, the ten-year interest rate and the three10 This is merely the statistical version of the mathematical equation [1.1]. 11 As opposed to forecasting inflation, where forecasting horizons are matched, interest rates with maturity yields that are far apart should be used (and are used) in forecasting real activity; the 10-year interest rate is chosen, as on a consistent basis, it is the longest maturity available in most countries over a considerable sample period.

230


month interest rate spanning from the first quarter of 1970 to the final quarter of 2006.12 However, as the investigation concerned the annual growth in real GDP13, the first observation for term spread had to be omitted to make both the regressors and the regressand dimensionally compatible; also, since the focus is on lags of the growth rate, another observation is lost – a total of two observations are lost for any sample period; as a result, 146 observations of the growth rate in real GDP and the term spread were used for the full sample.14 The sample statistics for the data are reported in table 2.1. Figure 2.1 plots real GDP, which is clearly nonstationary as its mean rises over time; the growth rate of real GDP appears to be stationary from figure 2.4 (although the variance seems to be smaller after 1997 with the advent of the EMU) – logarithmic transformations reduce the variance, while the differences remove any deterministic trends15; figure 2.2 plots the ten year- versus the three month-interest rate; and figure 2.2 plots the spread between the two (i.e. the difference).

16.1147

Term Spread (%) 3.0669

Real GDP growth (%) 7.6278

2.0627

-2.3538

-6.424

8.5295

7.6878

0.8478

2.3947

2.9306

3.4858

1.1104

2.2716

Variable

Real GDP (€)

10-yr (%)

3-m (%)

Maximum

1,763,745.4164

15.2246

Minimum

739,110.1636

3.2624

1,216,702.1976 290403.8425

Mean Std. Deviation

Table 2.1. Summary Statistics for the data used for Full Sample Period

iii. Empirical analysis This paper uses a formal statistical model in terms of a linear regression with continuous variables, instead of non-linear statistical equations, or those such as the logistic Generalised Linear Model for binary data. An advantage of this formal approach is that it quantifies guidelines regarding the responsiveness of real GDP 12 If the interest rate chosen is one controlled by the central bank, it will not fully reflect the expectations of participants in financial markets; in the US, the usefulness of the short-term Federal funds rate as a provider of information on market expectations is diminished by the Federal Reserve’s control over it. 13 Calculated by multiplying the quarterly growth by 4: [log(Yt+1) – log(Yi)]*100*4], where Yt is the real GDP at time t (t+1 is the next quarter). 14 The RHS observations run from 1970Q3 until 2006Q4 – in order to take account of first order lags of the growth rate of real GDP. 15 The real GDP time series is not a purely deterministic function, but rather it is a linear combination of functions; it comprises two waves: a high-frequency, short-period wave and a low-frequency, long-period wave; differencing eliminates the long-run component.

231


growth to changes in the term structure and provides specific lead-lag relationships. Lagged effects are included in order to distinguish between changes in market expectations of economic fundamentals (in which case the change in spread may be persistent) and temporary disequilibria in markets for interest-bearing assets (driving high frequency changes in spreads); interest rates may react contemporaneously to news, whereas GDP does not react as fast due to friction; the ECB raises interest rates when inflation is high, but the affect on the economy will be delayed with a lag on average of 9 months. One statistical issue is overlapping observations. Using quarterly data to estimate real GDP growth implies that two consecutive observations of real GDP growth tally with time periods having three quarters in common. In such circumstances, measures of statistical significance are rendered inconsistent, so must be replaced, for example, by generalised method of moments (Hansen, 1982). The above discussion suggests the following regression: yt = c + Si=1-to-priyt–i + Si=1-to-q gi xt–i + et [3.1] where yt is the growth rate of real GDP; xt denotes the term spread; p and q are the number of lags of yt and xt, respectively; and et is assumed to be a Normally distributed error term with constant variance and zero mean.

Whole-sample analysis The optimal number of lags determined from the Schwartz-Bayesian criterion was one for both the growth in real GDP and the term spread; the BIC corresponding to the regression for p = 1, q = 1 is 1.5023, which is the minimum value of the BIC for any number of lags. So, a regression was carried out on the equation: yt = c + riyt–1 + gi xt–1 + et [3.1a]. Figures 3.1.1 and 3.1.2 are histograms of actual (raw) residuals and Studentised residuals, respectively for the full sample. Importantly, while the bin size is large, it can be seen that the residuals are approximately normally distributed (the histogram of Studentised residuals is bell-shaped with about 95% of the area contained within ±2).16 Figures 3.1.3 and 3.1.4 plot actual and Studentised residuals, respectively. The variance is smaller for the Studentised residuals and for the period after 1997, so it would be better to look at the two samples individually as I have done in section 3.2; but since there are only seven outliers (oustside ±2) from 146 (<5%), it seems that assumption of normally distributed error terms of constant variance and zero mean is justified. The ith standardised residual is defined as:

16 Histograms will underestimate and overestimate the area to the left and right of the centre (zero), respectively

232


ei = ri σ^√(1 – hii) [3.2] where hii is the diagonal element of the projection matrix H = X(XTX)-1XT and the estimate of σ2 is: σ2^ = RSS/(n – p)

[3.3]

where n is the number of observations and p is the number of parameters.

CONSTANT (c) 1.3365

Standard Error 0.2540

yt-1

0.1943

0.0810

2.3981

[.0178]

xt-1

0.6752

0.1650

4.0933

[.0000]

Regressor

Coefficient

T-Ratio

[Prob]

5.2614

[.0000]

Statistic R-Squared

0.1996

R-Bar-Squared

0.1884

BIC Durbin’s hStatistic F-Statistic

1.5023 F(1,143)

0.5507

[.5818]

15.000

[.0002]

Table 3.1. Results of Regression on [3.1a] for the full sample period:17 The positive intercept (1.3365) may be interpreted as the extra positive growth in real GDP after the first lag of yt and xt (i.e. yt-1 and xt-1) have been taken into consideration – an extra 1.3365% to be added to the growth of real GDP. The coefficient on yt-1 implies that 19.43% of the annual growth of real GDP in the previous quarter (t-1) is added to the annual growth of real GDP (t); that on xt-1 implies that 67.52% of the term spread in the previous period is added to the annual growth of real GDP (t). The p-values for c and g1 are far less than 0.01 and so these coefficients are statis17 Although not presented due to space constraints, the Durbin-Watson (DW) d statistic is approximately 2 for the whole sample (and for the individual samples in section 3.2) – ‘there is built-in bias against discovering (first-order) serial correlation.’ (Gujarati, 2003: 690) However, since this model is a modified autoregressive model, the d statistic is not appropriate to test for serial correlation in the data; instead, Durbin’s h statistic is used to detect autocorrelation between the errors. (Durbin, 1970)

233


tically significantly different from zero. At a 5% significance level, all the estimated coefficients are statistically significant; however, at the 1% significance level (α = 0.01), if the rigid Frequentist interpretation in the style of Neyman-Pearson of the p-value is adopted, then r1 is not statistically significant; however, Fisherians may interpret the p-value of r1 = 0.0178 as being close to 1% and so it would be less straight forward to decide on the basis of this value whether the null hypothesis should be rejected – a more Fisherian approach may suggest repeating the experiment – further investigation may be beneficial. I take α = 5% and reject the null hypothesis; the conclusion is that the intercept is significantly positive, and that both the first lag of the growth rate of real GDP and the first lag of the term spread are helpful in explaining the current growth in real GDP – all coefficients are significantly positive. 95% confidence intervals for the estimates are more informative: 95% confidence interval for b = (b^ – t0.025143SE(b^), b^ + t0.025143SE(b^)):18 95% confidence interval for c: (0.8285, 1.8445) 95% confidence interval for r1: (0.0323, 0.3563) 95% confidence interval for g2: (0.3452, 1.0052)

None of these intervals contain zero and so all estimates are significantly different from zero (in fact significantly positive, since they all lie to the right of zero on the number line). Derived in the appendix, the h statistic (0.5507, which is within ±3) under the null hypothesis asymptotically has standard normal distribution and so the probability of obtaining this value is not very small. The conclusion is that there is no evidence of positive or negative autocorrelation. The multiple coefficient of determination, the R 2 value (0.1996), suggests that the predictors – first lag of yt and first lag of term spread – explain 19.96% of the variation in the growth of real GDP; this compares with 0.1058 for a model without the term spread. The higher R 2 value should not be surprising as there are more variables in the unrestricted model. However the R 2 is not a good statistic; it is a non-decreasing function of the number of explanatory variables in the model and has no known distribution. The adjusted R 2 (R^bar2) of (.1884) takes into account the number of predictors in the model, being lower than the R 2 as expected.19 To compare the above model with a simpler one without the lag of the growth of the term spread (in order to justify adding the term spread to the model), the restricted20 F-test was carried out, since there is a common intercept: F = (R 2UR – R2R)/(difference in number of regressors) 18 t0.025143 » 2. 19 ‘… it is good practice to use R^bar2 rather than R 2 because R 2 tends to give an overly optimistic picture of the fit of the regression, particularly when the number of explanatory variables is not very small compared with the number of observations.’ (Theil, 1978: 135) 20 The restricted model is y t = c + ρ1y t-1 – i.e. without the lag of the term spread.

234


(1 – R 2UR)/(df UR) [3.4] = (.1996 – .1058)/1 (1 – .1058)/(146 – 3) where UR refers to the unrestricted model (i.e. equation [3.1a]) and R refers to the restricted model ([3.1a] without the lag of the term spread). The F value (15.0004) is highly significant since it is greater than that for F0.025,1,143 (3.91); this is confirmed from the p-value (0.0002), which is much less than 1%. The F-test compared the difference in model [3.1a] over and above a model with only a constant and a lag of the growth of real GDP: this shows that adding the first lag of the term spread to an equation with only the constant term and the first lag of the growth in real GDP improves the results.

Sub-sample analysis The results on splitting up the previous estimation into two sub-samples, (i) preEuropean Monetary Union (EMU) and (ii) post-EMU – i.e. from 1970Q3 until 1997Q4 and from 1998Q3 until 2006Q4, respectively – are reported in tables 3.2.1 and 3.2.2; the same equation [3.2] is used. The optimal lag length is p = 1 and q = 1 for both, determined from the minimum value of the BIC. Pre-EMU Figures 3.2.1 and 3.2.2 plot the actual residuals and the Studentised residuals for this period. Less than 5% of Studentised residuals lie outside ±2 (4 from 110); this

CONSTANT (c) 1.5966

Standard Error 0.2973

yt-1

0.0957

0.0944

1.0136

[.3131]

xt-1

0.9057

0.1963

4.6140

[.0000]

Regressor

Coefficient

T-Ratio

[Prob]

5.3700

[.0000]

Statistic R-Squared

0.2384

R-Bar-Squared

0.2241

BIC Durbin’s h Statistic F-Statistic

1.6682 F(1,107)

1.3508

[.1768]

21.285

[.0000]

Table 3.2.1. Results of Regression on [3.1a] for pre-EMU sample with one lag (p = 1, q = 1):

235


appears to suggest that the assumptions underlying the model for this data hold. The conclusion from the p-value accompanying the h statistic (0.1768) is that there is no evidence of positive or negative autocorrelation. The constant and slope coefficient on the first lag of the term spread are statistically significant, although the multiple regression coefficient on the first lag of the growth in yt is not. The R^bar2 (0.2241) is higher than the regression for the whole sample conducted above in section 3.1 suggesting that this model has a better fit over the pre-EMU period than over the whole period. However, it is important to note that in comparing two models on the basis of the multiple coefficient of determination (or the adjusted R2), the sample size (as well as the dependent variable) must be the same. The pvalue accompanying the F-statistic shows that pre-EMU, model 3.1a is an improvement over the same model without the lag of the term spread.

Post-EMU Figures 3.2.3 and 3.2.4 plot the actual residuals and the Studentised residuals for the post-EMU period. Just over 5% of Studentised residuals lie outside ¹2 (2 from 34); this appears to suggest that the assumptions underlying the model for this data may not hold, although the closeness to 5% and the smaller sample size render this conclusion open to further investigation, especially from a Fisherian-Frequentist interpretation. Only the multiple regression coefficient on the first lag of the growth in yt is statistically significant; that of the first lag of the term spread is non-significant. Durbin’s h statistic once again suggests that there is no serial correlation between the disturbances (its p-value is .7604). The adjusted R 2 (0.3693) is higher than the regression for the pre-EMU sample suggesting that this model has a better fit over

CONSTANT (c) 1.0722

Standard Error 0.5378

1.9938

[.0550]

yt-1

0.6316

0.1374

4.5983

[.0000]

xt-1

-.2041

0.3171

-.6437

[.5245]

Regressor

Coefficient

T-Ratio

[Prob]

Statistic R-Squared

0.4075

R-Bar-Squared

0.3693

BIC Durbin’s h Statistic F-Statistic

0.3738 F(1,31)

-.3050

[.7604]

0.4133

[.5250]

Table 3.2.2. Results of Regression on [3.1a] for post-EMU sample with one lag (p = 1, q = 1) 236


the post-EMU period than over the pre-EMU period, and therefore than over the whole period. However, this contradicted by the p-value accompanying the F-statistic, which shows that post-EMU, model 3.1a is not a significant improvement over the same model without the lag of the term spread; adding the first lag of the term spread does not significantly improve the model without the term spread.

IV. Conclusion There is a two-way causation between economic activity (growth of real GDP) and the term spread. My model might be misleading since the spread affects the growth rate of real GDP, but not vice-versa. For further research, like Baltzer & Kling, I would recommend a structural VAR model to take into account feedback mechanisms, which imposes the ‘short run restriction that innovation in economic growth rates do not have an immediate impact on spreads.’ (Baltzer & Kling, 2007: 4) My paper is concerned with in-sample prediction.21 Reliable quarterly data from the ECB with no gaps were used with residual analysis to justify the model. It was shown that including the spread of the previous year is sufficient. The results fit in with the stylised observation that the term spread has lost its predictive ability as a predictor of the growth in real GDP since the great Moderation (post 1985 for the US) and since the EMU was established (for Europe). The cause in the US was the departure of Volcker and the presence of Greenspan, where volatility was reduced. The predictive ability of the term spread is better in times of more turbulence. The EU area has witnessed more stability since the EMU came into force in 1998 and correspondingly, the ability of the term spread to forecast the growth in real GDP has diminished.22 Lucas’ surprise inflation, with the implied policy proposal that Central Banks should not be transparent applied to the European economies pre-EMU following the breakdown of the Bretton Woods and James Callaghan’s speech to the UK Labour party that the belief that a country could spend itself out of recession was obsolete. Even though the BOE may seem to have more credibility today than the ECB, the importance of credibility proposed as early as 1976 by academics such as Fellner took time to channel through into Central Bank policy, much like the evolution of the concept of conjectural equilibria from General Equilibrium theory (developed in the 1960s and 70s), which did not pass into game theory until 1993 in the form of self confirming equilibria, and finally into macroeconomics until recently with the effort of Thomas Sargent and others. In conclusion, my paper shows that it is important to divide the sample into periods corresponding to different monetary regimes. It shows that the term spread improves the model’s ability to predict output growth. And it shows that credibility diminishes this ability. Credibility is a mixed blessing. 21 Bordo & Haubrich (2004) maintain that the out-of-sample predictive ability of spreads is lower. 22 The sample period ended in the third quarter of 2006, which marked the beginning of the current global recession. With volatility in the financial markets ebbing into the real economy, future studies could assess the possibility of a return of the yield curve’s predictability.

237


238


E C ONom ic s

Should we allow a market for kidneys? An Economist’s consideration M. Lorraine Chadwick

T

Abstract he sale of human organs, whether from a cadaver or a living donor, is proscribed in virtually every country in the world, and has historically been viewed as unethical by the world’s medical associations. In almost every country around the world also, there is a persistent shortfall between the number of organs available and the demand for them, resulting in loss of life for some and diminished quality of health for others. This is an intriguing economic research question, because it is a significant health-care problem for which a market solution already exists. The debate as to whether or not we should allow a market for kidneys is a compelling one which necessitates further investigation. This report explores the term ‘should’ from an economic perspective - allowing a market for kidneys is economically the efficient thing to do. The term ‘should’ is also considered from an institutional perspective which takes the broader social context of allowing a market for kidneys into account. The economics of kidney transplants is explored through a brief analysis of the current procurement approach and a critical appraisal of suggested market solutions put forward by economists. Retorts to these solutions are also considered. The broader social and ethical considerations which surround this debate have also been explored leading to my conclusion that this health care question has reached a critical juncture. Transplant professionals are increasingly evaluating patients for transplantation 239


who are more likely to die than receive a donor organ, while potential recipients cling to the hope that they may beat the odds and receive an organ while they are still healthy enough to undergo surgery. Economic solutions such as allowing a market for kidneys must be considered; the argument for blanket prohibition is flawed and should be rejected.

i. Introduction Shortages of commodities generally occur through either natural disasters or when governments impose a below market price on a commodity (McMillan, 2002). The shortfall between the demand for organs and the number available for transplantation exists primarily because of a scarce supply of transplantable kidneys; a situation which has arisen, according to Kaserman (2005), because both the purchase and sale of kidneys are considered criminal acts in most countries around the world. It has been argued that creating an open or a regulated market for kidneys would help alleviate this shortage; Barnett (2004) suggests that creating a regulated market might lessen the activities of current illegal ones. However, ethical considerations make the decision to ‘commodify’ kidneys more challenging; a market for kidneys may offend the personal or religious beliefs of some or may deter the more altruistic in society from donation (Titmuss, 1972). This paper is an exploration of the economics of kidney transplants. It considers the current statistics relating to kidney transplants to elucidate the shortfall between supply and demand, the current procurement approach, suggested market solutions, retorts to these solutions and the broader social and ethical considerations which surround this debate. The crux of the discussion lies with the implications of the term ‘should’; section two considers the term from an economic perspective with an emphasis on efficiency and incentives while the third section considers the term from the ethical approach with an emphasis on equality and fairness for potential donors and recipients. II. The Economics of Kidney Transplants Supply and Demand – The Statistics In terms of kidney transplant lists, the statistics are startling; the graph1 below illustrates the widening gap in the UK between those waiting for transplantation and the actual number of transplants which have taken place over a ten year period from 1997-20061. In the Republic of Ireland, statistics are similar; 1,482 people needed a kidney transplant as of 31 December 20062, however only 142 transplants were carried out in the same year. Not every kidney which becomes available for transplant can 1 www.uktransplant.org.uk/ukt/statistics/calendar_year_statistics/kidney/kidney.jsp 2 http://www.ireland.com/newspaper/breaking/2007/0326/breaking47.htm

240


be used; some kidneys may be deemed unsuitable by transplant personnel due to disease or other medical factors. In Ireland, the gap between those waiting for transplantation and those actually receiving one is increasing, although the deceased donor rate per million population in the Republic of Ireland is 19.8, which compares favourably with the figure of 13.2 in the UK (Collett, 2007). This can be explained by the fact that kidney donor levels have remained static over the past 10 years but the numbers of people being diagnosed with kidney problems which require dialysis and transplantation are increasing.3

The Current Procurement Approach Most donated organs4 in Ireland and the U.K. come from people who die while on life support, following a severe brain injury. Brain death, unlike a coma, is the complete and irreversible cessation of all brain function. Brain death usually occurs when a person receives a severe head injury, suffers a stroke or a brain haemorrhage, or any other event which deprives the brain of oxygen. In some countries organs are also taken from non-heart-beating donors (NHBDs)5. Organ donation is seen as a gift between the deceased donor and their family and the organ recipient. Kidneys can also be donated by living donors; in Ireland very few of these types of transplants have taken place due to constraints within the health service in terms of personnel and surgery facilities. With numbers on kidney transplant lists increasing and donor numbers remaining static, the reasons why the current approach of altruism is failing must be considered. According to Kaserman (2002), the issue of altruism and vested interests colliding within the current procurement system is one system failure which must be investigated. He contends that the current system of kidney procurement which has been adopted and maintained by most countries remains unaltered primarily due to the systems impact on profits to physicians and hospitals. He asserts that; “The economic truth is that reliance on altruism at one stage of production can serve the purpose of greed at another. A legal restriction on the purchase and sale of transplantable organs is economically equivalent to the formation and maintenance of a cartel in the provision of transplant services. Therefore, the current policy and the shortage it creates enhance the overall profitability of transplant providers. Such profitability, in turn, ensures continuing political support for that policy” (Kaserman, 2002:91). Expansion of Donor Criteria as a Supply-Side Solution One medical solution currently employed to address the shortage of donor kidneys is the increasing acceptance of ‘marginal’ kidneys by transplant surgeons desperate to see their patients receive an organ. Donor-criteria has gradually been 3 www.uktransplant.org.uk 4 Irish Kidney Association (2007) 5 �������������������������������������������������������������������������������������������������� NHBDs are patients who have died from cardiac death i.e. irreversible loss of heart and lung function.

241


expanded and kidneys are now being accepted from deceased donors who are geriatric, hypertensive and even proteinuric (Friedman, 2006). This situation is clearly not sustainable; transplanting ‘marginal’ kidneys will only lead to further medical problems for recipients, given that the kidney received was less than perfect at the point of transplantation. Many economists (Friedman, 2006, Barnett and Saliba, 2004, Becker and Elias 2003) and medical professionals (Hippen, 2005) have concluded that in order to alleviate increasing levels of demand, supply-side solutions such as allowing a ‘market’ for kidneys must be considered.

The Market Approach In terms of a market for kidneys, the law in most counties around the world currently bans the sale of human kidneys (Friedman, 2006). This effectively establishes a maximum legal price for kidneys of €0 which is known as a price ceiling. With the demand for kidneys increasing and supply remaining static6, an obvious shortfall between supply and demand is created. Some economists (Becker and Elias, 2003), (Kaserman, 2002) argue that this is a market failure which can be resolved by allowing a market for kidneys, which would increase the supply of transplantable kidneys therefore ensuring that those on transplant waiting lists receive the kidney they need. The mechanisms of the varying market solutions put forward by these economists will now be explored to determine if a market for kidneys should be considered. Incentives The issue of monetary incentives for organ donation is one which has been explored by Becker and Elias (2003). They contend that the introduction of monetary incentives into a market in the U.S. for live and cadaveric organ donations might increase the supply of organs for transplant sufficiently to eliminate current transplant waiting lists, without increasing the total cost of surgery by more than 12%. Becker and Elias (2003) build their argument through ‘value of life’ literature together with econometric models and sensitivity analysis to approximate the equilibrium cost of live transplants for kidneys. They conclude that: “the supply curve in this market would start just slightly above the cost of surgery since some cadaver organs would be made available cheaply. A rise in price would induce more cadaver organs to be offered, and perhaps even a few from live donors. Eventually, the available organs from cadavers would run out, and the supply price would rise sharply to reach the main market for live donors. At that point the supply elasticity rises sharply because the potential live donor market is huge relative to demand” (Becker and Elias, 2003:24). 6  Transplant Newsletter (2007)

242


Computing the Equilibrium Price for a Kidney Becker and Elias (2003:11) also “estimate the value or price of an organ from living donors by computing how much additional income or market consumption an individual will require in order to be indifferent between selling an organ or not. Following the value of life literature, the additional income required by an individual in order to be willing to sell his organ will be given by the change in the value of life induced by changes in health, or quality of life, mortality risk and full income.” Other factors are taken into account by both economists to compute what they determine is the total expected cost of donation per kidney. They have computed this cost to be $15,200 and the factors they considered in their computation include: Monetary compensation for the risk of death Monetary compensation for time lost during recovery Monetary compensation for risk of reducing quality of life (Becker and Elias, 2003:12).

Moreover, regardless of what the price of a kidney might end up being in a hypothetical market, it is the potential cost-savings achieved from a successful transplant compared to the long-term cost of dialysis that should form the crux of any economic debate in considering whether a market for kidneys should be allowed. Matas and Schnitzler (2004) have shown that in the U.S., a doubling of the number of organs procured from a market system would permit Medicare to pay $50,000 per kidney and still break even. This is an economic consideration which cannot be ignored in any economic debate surrounding a market for kidneys; the cost of a kidney transplant operation is usually a non-recurring cost while the cost of kidney dialysis per patient is a large recurring expense for health services to bear.

Cost of Queuing A further economic cost which should be considered is the cost of queuing. The economic cost of queuing is one which Becker and Elias (2003) explore and they contend that allowing a market for kidneys can also be contextualized in terms of value to society through the increase in numbers of people who would be transplanted without prolonged waiting times, as waiting time increases the risk factor for those waiting on kidney transplants. While this may have a positive societal or welfare benefit; “perhaps the most egregious effect of organ shortage on those people who wait, is the suffering and the deterioration in their quality of life while waiting for an organ” (Becker and Elias, 2003:19), the recurring economic cost in terms of dialysis per patient is also substantially reduced. 243


The Truncated Market In exploring the financial and market processes that would evolve if a free market in kidneys were to be permitted, the types of institutions which would develop, how transactions would be facilitated, what the short-run and the long-run effect would be on the price for kidneys and how such a market would function, Barnett and Saliba’s (2004) paper was particularly insightful. They assert that “the demand for kidneys unlike the demand for most goods is truncated. That is, it is satiated at a positive price; the quantity demanded does not increase if the price falls below the satiation price” (Barnett and Saliba, 2004:39). They further elucidate this point by asserting that “as long as the government remains as the payer of last resort, the demand curve is actually truncated at a price greater than zero because the maximum quantity of kidneys demanded is equal to the number of people on the waiting list, and no more” (Barnett and Saliba, 2004: 39). This point buttresses the argument to consider a market for kidneys; future demand which would drive the cost of organs will always be substantially offset by the far greater supply of potential organ vendors. Short-term Nature of the Market In a similar vein to Becker and Elias (2003), the matter of price is also considered by Barnett and Saliba (2004). Both are cognizant of the fact that if a free market for kidneys were permitted, the price of a kidney would be relatively high, but they estimate this to be only for an initial period. An example of this high price was experienced when an offer of a kidney for sale was posted on eBay. Prince (1999) noted that the price of the kidney rose from an initial offer of $25,000 to nearly $6 million during the week it was listed on the site. However, Barnett and Saliba (2004) stress that because the demand for kidneys is typically non-recurring, in the longer-term the price would decrease as demand declined. Auctioning a kidney on the internet is not the methodology economists who support a market for kidneys would advocate, but other compensation mechanisms they have put forward are explored briefly in the following paragraphs. Donor Compensation The issue of donor compensation is an area explored by Kaserman (2002). He compares compensation to that of an organ market and maintains that they differ in two ways; “with compensation, the form and amount of payment to organ suppliers are largely arbitrary, whereas market prices change with supply and demand so as to eliminate shortages. Second, markets provide greater incentives to individuals involved in the procurement process both to acquire more organs and to acquire them in a cost-efficient manner” (Kaserman 2002:100). Maintaining that if a market for kidneys were to be permitted, Kaserman (2002) also asserts that more organs would become available and consequently higher standards would be set for transplantable organs. He concludes that this would lead to the average quality of transplanted organs being higher through organ markets. The factors outlined 244


above are further reasons why a market for kidneys rather than a system of donor compensation should be considered. Barnett and Saliba (2004), in a similar vein to Becker and Elias (2003), are supportive of compensation for cadaveric kidneys but they assert that in a free market, the demand for kidneys to be harvested upon an individual’s death is likely to be very small, as relatively few people experience brain death while their bodily organs are still ‘viable’ – this is an essential prerequisite for cadaveric transplantation. Barnett and Saliba (2004) contend that the major source of kidney supply in a free market is more likely to be from living persons; as only one functioning kidney is necessary to lead a ‘healthy’ life, the potential supply from living individuals is enormous relative to the demand.

Financing the Market Another question which would have to be addressed if a market for kidneys were to be allowed, is how the financing for such a market would be transacted. In an effort to address this question, Barnett and Saliba (2004) compare four possibilities, options, futures, forward and spot markets and conclude that a spot market is the one most likely to emerge7. “Those living individuals who desire to sell one of their kidneys, assuming the price and other relevant factors are satisfactory, would make this known through a listing with one or more brokers” (Barnett and Saliba, 2004: 44). Brokers would gather initial medical information on the potential seller and if viability were established, the brokers then incur the expense of the required antigen testing� conducted to ensure blood and tissue match. Barnett and Saliba (2004) contend that because of the truncated demand curve, a gap would exist between the demand and supply prices at the actual quantity of kidneys exchanged, a difference which the economists note would equal the amount of rent per kidney exchanged that would depend on bargaining. Mechanisms would have to be put in place to ensure that these rents would not form part of a collusive arrangement between brokers and hospitals, this mechanism might take the form of regulation. Government Regulation and Policy Given the widespread government regulation of health care and medical industry, it would be likely that governments would establish an independent regulatory authority, which would heavily regulate brokers to minimize as far as possible collusion in the marketplace. This regulation could be justified if it prevented an occurrence of kidney theft through the maintenance of exacting data on the source and destination of each kidney brokered through the institution. Barnett and Saliba (2004) stress that these licensed brokers would be the only bodies from which a transplant patient could legally acquire an organ apart from cases of altruistic 7 Further details on Options, Futures and Forward Organ Markets can be exam-

ined in Barnett, W. and Saliba, M. (2004) ‘A Free Market for Kidneys: Options, Futures, Forward, and Spot.’ Managerial Finance, Vol. 30 (No.2): Pages 41-44

245


gifts between donors and recipients. In conclusion, this section has appraised some suggested market solutions put forward by economists in terms of allowing a market for kidneys and focused on the implication of the term ‘should’ from an economic perspective with an emphasis on efficiency and incentives. The next section deals with retorts to these suggestions.

III. The Context of the Economics of Kidney Transplants Direct Retorts to the Market for Kidney Transplants Many people still view organ donation as a purely altruistic gift and their concerns and difficulties with allowing a market for kidneys will be explored in this section of the report; the term ‘should’ will be considered from the ethical approach with an emphasis on equality and fairness for both potential donors and recipients. My reflections on their concerns will be dealt with at the end of this section. The concept of purchasing kidneys from compensated donors has traditionally been decried by medical associations and ethicists globally; they deem such transactions as morally irresponsible (Roth, 2007) or ethically unacceptable (ScheperHughes 2002a, b). Others contend that a market in kidneys would not be an equitable one for the donor. For example, Kolnsberg (2003:1056) maintains that “no matter what the market structure scenario, this initial application of economic theory leads to a conclusion that the donor-seller will not gain very profitably by selling his/her organs. In each scenario, the price for organs will drop in response to increasing supply. It seems that donor-sellers will not be able to profit in the long run from selling their organs. This stands true for both cadaver and living-donor organs.” Kolnsberg (2003) maintains that a naturally limited demand for transplants will keep prices low as increasing supply outstrips demand and third parties will limit the price they are willing to pay donor-sellers in order to maintain their own profitability. Kolnsberg (2003:1060) also asserts that a market for kidneys should not be allowed as “only the poorest would sell their organs at a risk to their wellbeing and for little financial reward in the long run. Although the selling of organs would conveniently increase supply, which benefits recipients, it would not really benefit donors.” While Kolnsberg views a market for kidneys as being potentially inequitable for donors, others view the concept of a possible market for kidneys as repellent to society as a whole and offer other suggestions in terms of kidney procurement. Broader Social Context - Moral & Ethical Considerations Roth (2005) contends that creating a market for kidneys is politically unfeasible and socially repugnant. He maintains that it would be difficult politically to change the current system of procurement and he also asserts that many people find the idea of donor compensation repugnant. Furthermore, Roth (2005) main246


tains that economists subscribe to a particular point of view which avows that if two people engage in a voluntary transaction; it must be because they both want to and because it makes both parties better off. However, Roth (2007) determines that transactions are morally repugnant when they are transactions that some people (certain members of society) don’t want other people (certain other members of society) to engage in. He maintains that economists must appreciate and engage with the phenomenon of repugnant transactions and he asserts that there are legitimate concerns about the monetisation of transactions which fall into three categories; concerns which he stresses apply to any proposed market for kidneys. The first of these concerns is what Roth (2007:44) calls objectification; “the fear that putting a price on certain things and buying or selling them might move them into a class of impersonal objects to which they should not belong.” A second concern of Roth’s is that offering substantial monetary payments for a kidney “might be coercive, in the sense that it might leave some people, particularly the poor, open to exploitation from which they deserve protection” (Roth, 2007:44). Roth’s third concern, which he asserts may not always be articulated clearly, is that “monetizing certain transactions that might not themselves be objectionable may cause society to slide down a slippery slope to genuinely repugnant transactions” (Roth, 2007: 45). Interestingly Roth (2007) also notes that while the repugnance associated with regard to a market for kidneys shares characteristics with repugnance for the monetisation of other types of transactions, the market for kidneys also has unique features. One of these features is the dilemma faced by transplant surgeons; taking a kidney from a health donor is contrary to the Hippocratic tradition of ‘first, do no harm’. As Roth (2007: 48) puts it, “a surgeon who is already overcoming some distaste for performing a nephrectomy (kidney removal) on a healthy person may find the distaste more difficult to overcome if he views himself as facilitating a commercial transaction.” Intriguingly, no tangible evidence of this type of dilemma is provided by Roth to substantiate this claim in his 2007 paper.

Kidney Exchange While Roth (2004) does not deny that an organ market would have some merits, he contends that a more practical and politically feasible approach to solving the current kidney shortage is that of an ‘organ exchange’. Roth (2004:2) puts forward the idea of a kidney exchange on a national scale in the U.S. which he explains involves “two donor-patient pairs such that each (living) donor cannot give a kidney to the intended recipient because of blood type or immunological incompatibility, but each patient can receive a kidney from the other donor.” Founded in New England in 2005, the New England Program for Kidney Exchange offers life-saving options to those seeking a kidney transplant, but whose potential living donor is not a good biological match due to either blood type incompatibility or cross-match incompatibility. The exchange uses a computer program to find cases where the donor in an incompatible pair can be matched to a recipient in another pair. Through 247


exchanging donors, a compatible match for both recipients may be found.

Conclusion In concluding this section of the report, I will argue that the current system of kidney procurement which emphasises donor altruism is positioned to fail and that arguments against organ markets also fail when they are considered in any great detail. Legalising a market for kidneys would have a positive outcome for society; the number of donors would very likely increase, countless lives would be saved and donors could be acknowledged for their donation through monetary compensation. This conclusion has been reached through the analysis of the work of those against a market for kidneys; the following paragraphs outline my retorts to those who are against considering a market for kidneys. In terms of Kolnsberg’s (2003) argument regarding donor compensation, it is my view that her line of reasoning is inconsistent. Kaserman (2005:894) explains my contention pithily when he asserts that if the impact of a proposed policy on social welfare (taken to be the sum of consumer and producer surplus) is examined, and if that policy is observed to increase such welfare, then “absent other non-economic objections, it should be implemented.” He further elucidates: “Instead of this traditional criterion and the huge amount of literature that exists to support it, Ms. K comes up with her own novel standard – whether the supplier of living donor organs will be able to earn significant positive economic profits in the long run. If the answer is no (as she attempts to demonstrate) then she argues that such sales should continue to be banned; despite the fact (which she conveniently ignores) that that ban is currently causing the deaths of over 6,000 patients each year” (Kaserman, 2005:894). Other suggested non-market solutions to this question are also inconsistent. Roth’s (2004) solution of a ‘kidney exchange’ will do nothing to alleviate current waiting lists in the United States due to the fact that kidney exchanges are rare; only five exchanges have been completed in centres in New England from the date of set up in September 2005 to 31 December 2005. While Roth (2005) maintains this low exchange rate is because databases are just being assembled, I would assert that even if levels of exchange were increased ten-fold it would still not have any serious impact on transplant waiting lists in the United States. Roth’s (2007) opinions on repugnant transactions also lack consistency in that he does not actually define what he means by the word repugnant; because the term is subjective it should have been clarified by him when building his argument. It is my understanding that Roth has classified transactions into two categories; acceptable and repugnant. While Roth does not (to the best of my knowledge) actually define the word repugnant, my understanding is that he deems transac248


tions to be repugnant when they are ‘offensive’ to some members of society. Many transactions take place on a daily basis which although legal, might be viewed by some members of society as repugnant. It is not unusual to see advertisements in newspapers seeking reproductive cells from healthy, intelligent students8,9 so why is it so ethically wrong to buy and sell a kidney? Both acts are basically the same thing; a transaction where one person willingly chooses to give away a part of their body in exchange for monetary compensation. It could be argued that selling sperm or ova is more repugnant than selling a kidney, as these cells have the potential to create new human life, while a kidney does not. Roth (2005, 2007) maintains that repugnance is viewed by economists as a side issue rather than a real phenomenon which he asserts must be considered when investigating this compelling question. It is difficult to view repugnance with any seriousness at all when one is not actually sure what is actually meant when Roth uses the term. If ‘transaction repugnance’ does exist, perhaps one solution to maintaining society’s confidence in a hypothetical market for kidneys would be to ensure that all facets of such a market would be regulated by an independent medical and patient body on a national and international scale. In support of this view, Hippen (2005:612) maintains that a defensible market in kidneys should, at the minimum, have the following four characteristics; “the priority of safety of the vendor and recipient, transparency regarding risks to the vendor and recipient and regarding institutional outcomes and follow-up care, institutional integrity… and operation under a rule of law.” Hippen (2005:613) also asserts that through fashioning policy at an institutional level, health professionals, vendors, donors and recipients “with compatible moral commitments can cooperate with each other, and, unlike the current system, the forbearance rights of each can be respected in full.” This is the type of market for kidneys which could be considered by policymakers in the coming years as the plight of those on dialysis continues to deteriorate. In addressing Roth’s (2007) concern about the objectifying of kidneys; economists such as Kaserman (2005), Becker and Elias (2003) and Barnett (2001), support the ‘commodification’ of kidneys; the main thrust of their argument asserting that the current prohibition of the purchase and sale of kidneys has resulted in unnecessary suffering for some people and premature death for others. It is clear therefore, that through the research conducted for this literature review, it is clear that the idea of establishing a market for kidneys, while not new, is now attracting unprecedented support from economists, ethicists and some members of the transplant community.

IV. Conclusion In life there are questions that never really go away. The idea of establishing a mar8 ‘Women Shopping for Super Sperm’, The Vancouver Sun, 10 December 2005 9 ‘Sperm Bank Seeks Donors’, Journal of Business, 17 June 2004

249


ket for kidneys is one of those questions. Today, it is a topic at the centre of much controversy. Rothman et al (2006:1524) note that “proponents emphasize the concept of autonomy; opponents invoke fairness and justice.” No matter the approach taken, it is clear that a shortage of kidneys exists and every solution put forward must be considered to address this problem. Removing the ban on the sale of kidneys or allowing some type of compensation for donors has become the solution most discussed by those involved in the debate. A healthy human is born with two kidneys, but only one is necessary to survive. This raises a very important question; should people have the right to sell a kidney they do not actually need? Property right issues surrounding body parts have becoming increasingly complex. Howley (2006:2) contends that “every corpse has a legal value of zero, but transplantable organs and tissues grow more valuable every day. Body parts aren’t legal property to the people born with them, but can be distributed by doctors, universities, biotech companies, and procurement agencies for profit or otherwise” (my emphasis). Is it right that the donor or the donor’s family receives no compensation? In effect, the donor is the only component of the current ‘market’ procurement system who does not receive payment. The following example illustrates this point succinctly. On Feb 16, 1990 Susan Sutton from Moore, Oklahoma, shot herself in the temple in an attempt to take her own life. She was declared dead in a local hospital some hours later and her parents gave permission for her heart, liver and corneas to be harvested for transplantation. “The hospital and medical teams that removed the organs received thousands of dollars for their services from patients who needed transplants, from the patients’ insurance companies or from Medicare or Medicaid. The nonprofit agency that coordinated the transplants also received thousands of dollars, as did the surgeons and hospitals where the transplants took place. But Ms. Sutton’s parents received nothing, and unable to afford the cost of a headstone, buried their 28 year old daughter in an unmarked grave” (Young, 1994:1)

Courts and legislatures habitually link their resistance to corporeal property rights with the corrupting power of markets. The United Network for Organ Sharing10 note that “the laws and regulations surrounding a deceased organ donation, allocation and transplantation have purposefully established a legal infrastructure that excludes property law concepts … instead, organs are donated for transplantation voluntarily (not sold or appropriated) and are regulated as a scarce national resource.” It is my view that economists are well positioned to engage in this debate; kidneys are a scarce resource and Economics is a study of how best to 10 www.unos.org (cited 17.11.07)

250


allocate scarce resources. As we move toward considering alternative solutions to the current shortage of kidneys, in time perhaps kidneys will be viewed without repugnance as a scarce economic resource. Remarkably, the fact that it is proscribed to either sell or buy a kidney has not actually stopped the sale of organs from occurring. The problems which have stemmed from the burgeoning global black market in kidneys are monumental and have led to a kind of transplant tourism or ‘neo-cannibalism’ as ScheperHughes (2004) describes it. Patients travel from wealthier countries without a black market organ trade to parts of the world where both an organ can be procured and an operation can be carried out, for a price. However, Hippen (2005:602) counters Scheper-Hughes argument when he asserts that “a regulated market in human organs is better suited to reduce organ trafficking by offering vendors and recipients alike a safe alternative, while significantly reducing the demand for organs that perpetuates organ trafficking.” On the same issue, Finkel quotes the Israeli nephrologist, Michael Friedlaender, at a conference in Denmark in 1999 as stating: “What’s happening now is absurd … airplanes are leaving every week. In the last few years, I’ve seen 300 of my patients go abroad and come back with new kidneys… it’s a free-for-all. Instead of turning our backs on this, instead of leaving our patients exposed to unscrupulous treatment by uncontrolled free enterprise, we as physicians must see how this can be legalized and regulated. … Examining those 300 patients brought me down from my high horse of ethics. Now I’m more practical. My patients don’t want my opinion on whether or not buying a kidney is moral – they want to know if it’s safe. And I have to say that it is. The current system of organ donation without remuneration is a failure” (Finkel, 2001:5).

In conclusion, one cannot deny the enormous challenges that come with introducing compensation into a deep-rooted scheme built on the premise that altruism is the only legitimate motive for giving. Yet, as death and suffering escalates, constructing a market-based compensation programme which will increase the supply of transplantable kidneys has undoubtedly become imperative. Concerns about donor safety must be given serious consideration certainly, but repugnance and vigilance are not in themselves arguments in opposition to change but rather, they are motivations for caution, concern and care.

251


EDUCATION PANEL

Juding panel Mr. Michael Cotter (Dublin City University) – Chair Mr. Padraig O’Murchu (Intel) Dr. Kevin Marshall (Microsoft) Mr. Paul Rowe (Educate Together)

252


E duc at ion

In Ireland, recent legislation & policy in health, education & social services have changed the nature & practice of early childhood education care & services Dairine Taaffe

C

Introduction hildren represent nearly one third of the population of Ireland (Hayes, 2002), and as such represent the future of Ireland and have a central place in our ever changing society as present and future citizens. Numerous significant policy documents and decisions have been developed in recent years in relation to the practice of early childhood education, marking what might be considered a change in attitude towards children in our society. This wealth of documents has endeavoured to provide Ireland with the highest quality of services and care for our children. The UN Convention on the Rights of the Child (UNCRC) is an international human rights treaty which grants a comprehensive set of rights to all children without discrimination, and this has had an important role in stimulating progress in this area within Ireland. In this essay I will argue that the adoption of the principles of the UNCRC has had a positive impact on this country in relation to the nature and practice of early childhood care and services. I intend to chose an area from social services, education and health (in that order) and discuss these in rela253


tion to developments in Irish policy and legislation and the effects of these policies on early childhood care and services. With regard to children’s rights, historically Ireland has an ethos of expecting children to be seen and not heard. Many believe this stigma is changing due to the increased awareness and acknowledgement of children’s rights. The UNCRC (1989) includes the participation rights of children (Article 12). This involves children having freedom of expression and the right to have a say in matters that affect their lives. Conversely authors such as Hayes (2002) and Martin (2000) believe children in Ireland are not given sufficient opportunity to speak for themselves and are voiceless in society. I will address this issue and examine the ways and means by which children are empowered and given a voice in society with relation to the relevant legislation and policies. Under the development rights of children, article 28 of the UNCRC (1989) states that every child has the right to education. Since 1990 and particularly since the millennium, the Irish government has endeavoured to improve early childhood services especially in education and these key milestones will be examined and how these have effected and changed practice within Ireland. A health survey carried out by the World Health Organisation in 2008 saw children in Ireland score very highly (Irish Examiner, 2008). An increased awareness of health inequalities has lead to policies being put in place in relation to children’s health. I am keen to explore these policies, focusing on traveller children and breastfeeding and their affects on services. According to the UNCRC (1989), every child has the right to avail of health and medical services and to enjoy the highest quality of health that is attainable (Article 24).

Social Services: Children’s Rights In recent years policy makers are recognising the importance of giving children their own voice. They are including children’s opinions in research and on topics relevant to them. The benefit of including children in policy making was clearly seen in the National Children’s Strategy (2000). Children and young people were consulted in the formation of this government policy, the publication of which is a huge step towards the implementation of the UNCRC recommendations. This document provides a clear policy statement which reflects the hopes and concerns of children themselves and all parties involved in working with children (DOHC, 2000). In this strategy three national goals were identified: children will have a voice, children’s lives will be better understood and children will receive quality supports and services (Richardson, 2005). The first national goal concerns giving children a voice in matters that affect their lives. Their opinions are to be given due weight in accordance with their age. One way in which the strategy hoped to achieve this was by putting in place new procedures in the public sector to boost participation by children in matters that affect them, and to promote and support the development of a similar system in the private and voluntary sector. As a result of the 254


strategy many significant developments were made; such as the establishment of a National Children’s Advisory Council, together with the National Children’s Office in 2002 (Richardson, 2005). Dáil na nÓg, the national children’s parliament, was set up giving children the opportunity to raise and debate issues of concern. The first meeting of Dáil na nÓg took place in 2001, and 200 children aged between eight and seventeen years attended, representing every county and socio-economic group (Office for the Minister for Children and Youth Affairs, 2002). A further measure of the strategy was that a children’s ombudsman be established by legislation as an independent office. The Ombudsman for the Children Act (2002) states that the role will be an independent office (DOHC, 2002). Again children were consulted on their views as to what should be the priorities of the Irish Children’s Ombudsman during the first 12 to 18 months of office. This is a prime example of when children were listened to and their opinions considered and valued (Children’s Rights Alliance, 2003). Another recommendation of the strategy was that children’s views should be represented at a national and local level in relation to relevant services. A 2002 progress report indicated that many city and county development boards had established mechanisms to give children a voice, therefore improving children’s participation at a local level.

Education The 1990s was a period of intense debate, examination and policy development in Irish education (Clancy, 2005). According to Clancy (2005) initially the main focus of this educational reform was focused at primary, secondary and third level. By comparison pre- school education received little attention until the late 1990s. The Report on the National Forum for Early Childhood Education was published in 1998. It explores some of the main areas involved in Early Childhood Education and Care, and proposes some areas in which intervention is needed. This was the beginning of a rapid growth of policies and legislation focusing on early years’ care and education. Following the Forum, the Department of Education and Science produced the White Paper on Early Childhood Education entitled ‘Ready to Learn’ (1999). This paper is concerned with children from birth to six years. It sets out the core objective of early childhood education as “supporting the development and educational achievement of children through high quality early education, with particular focus on the target groups of the disadvantaged and those with special needs” (Department of Education and Science, 1999, pp. 14). The Centre for Early Childhood Development and Education (CECDE) was established in 2002 on the recommendation of the White Paper (Citizens Information, 2009). The purpose of the CECDE was to begin implementing some of the key recommendations of the White Paper (Duignan, 2004). It published ‘Siolta: the National Framework for Quality in Early Childhood Education’ (2006), which has influenced the practice in childcare settings across the country (Citizens Information, 2009). Unfortunately due to cutbacks, the CECDE was forced to close in November 2008 (Carr, 2008). During its existence, the CECDE made a substantial contribution to the develop255


ment of policy and practice in early childhood education and care (St Patricks College, date unknown). Despite evidence to support the fiscal and social rationale for investment in early childhood education, there is still a great need for improvements in this area. In 2002 the Department of Education and Science invited the OECD Directorate for Education to conduct a review of Early Childhood Education policies and services in Ireland. The OECD team met with many government departments, agencies and other stakeholders dealing with early childhood issues and made site visits covering a range of services for young people from 4 months to 6 years of age. In 2004 OECD published their report on early childhood care and education in Ireland. It made a number of recommendations across the key areas of access, quality and co-ordination, some of these include: The integration of all early education and care policy and funding under one department or under a designated funding and policy organisation. It recommended that the White Paper, ‘Ready to learn’ (1999), be implemented. The urgent formation of a national plan for early childhood services development. The formation of a National Goal and Quality Framework. (Schonfeld, 2004)

The report also found that funding for early childhood services in Ireland has been low and made recommendations that there should be a significant increase in this funding (OECD, 2004). The report also described provision for under three year of age as very weak and coverage for three to six year olds among the lowest in the EU (Womens Health Council, 2009). Unfortunately many of the recommendation of this report have not been implemented. For example, there is still no universal department dealing with early education and care policy. The National, Economic and Social Forum report (NESF) (2005) demanded that early childhood education and care be made a priority and showed that investment in this area will yield significant future dividends. The NESF report showed that for every euro invested in early years’ education and care between €4.60 and €7.10 would be returned (NESF, 2005). Similar to the OECD report (2004), the NESF report showed that, among the OECD countries, Ireland has a low rate of investment in the early years’ education and care. In 2005 Ireland invested 0.2% GDP compared to an average of 0.4% in other OECD countries (NESF, 2005). The NESF strongly recommended that state funded quality care and education should be provided for all children in the year before they go to primary school (Ring, 2005).

Health 256


This section will address two areas in relation to health in childhood: namely breastfeeding and the health of traveller children. These were chosen as I have recognised them as two important areas in which there has been significant development. The UNCRC states that children have a right to a good standard of health and health care, and this right applies to all children without discrimination. Here in Ireland there is an ethnic minority group which experiences a level of health that is far short of the level experienced by the general population. The Children’s Rights’ Alliance (2004) states that children from the travelling community are particularly at risk of experiencing poverty and in turn have a poorer level of health than the general population of Ireland. In 1987 the infant mortality rate amongst travellers was 18.1 % compared to a national figure of 7.4%. In 1999 the rate of Sudden Infant Death Syndrome in traveller families was twelve times the national figure (Department of Health and Children, 2002). In 2002, The Department of Health and Children published ‘Traveller Health: A National Strategy’. The document contained 122 recommendations in relation to the improvement of travellers’ health. Many of the recommendation refer to the need for health care workers working with travellers to have additional training (Murphy, 2002). There have been significant changes since this document has been published, resulting in better services being provided for travellers and their children. Traveller health units around the country have drawn up regional plans in terms of traveller health. Also Primary Health Care projects have been developed for travellers around the country (Pavee Point, 2003). During the late 1980s and early 1990s the incidence of breastfeeding in Ireland was 32% - a very low figure in comparison to other countries. Babies who are breastfed receive better protection against disease in both the long and short term (Holden et al 2000). In 1992 there was a committee with representatives from various professional and voluntary groups set up to develop a national breastfeeding policy. This policy was published in 1994 and outlined a series of recommendations and targets aimed at improving the breastfeeding rate in Ireland (HSE, 2007). This strategy greatly changed attitudes and increased awareness in Ireland around breastfeeding. As a result of the policy Ireland put in place structures to implement the Baby Friendly Hospital Initiative (DOHC, 2005). This initiative encourages best practice in the maternity service which is crucial to the success of programmes to promote breastfeeding (Baby Friendly Hospital Initiative, 2008). Also under this policy there was a national breastfeeding coordinator appointed in 2001. In addition a national committee on breastfeeding was set up in 2002. This committee was to review the 1994 Breastfeeding policy and to produce a new five year strategic plan for breastfeeding in Ireland which was published in 2005 (DOHC, 2005). This expert working group was to continue the work begun with the 1994 policy by drawing up evidence based goals, targets and objectives to increase the uptake and duration of breastfeeding in Ireland (HSE, 2007).

Conclusion 257


Since 1990 there has been a huge growth in attention, discussion of, and debate about young children in Ireland. This began mainly from our ratification of the UN Convention on the Rights of the Child in 1992 and also our publication of ‘Our Children- Our Lives: The National Children’s Strategy’ (2000). This essay addresses the articles from the UN Convention which I felt were significant and which covered areas of social services, education and health. Firstly the issue of children’s participation rights and their right to have a voice and be heard in society was addressed. The first goal of the National Children’s Strategy (2000) was to give children a voice in matters that affected their own lives. The fact that this was the first goal shows the weight and importance that this policy rightly gives to this issue, because it, in turn, impacts on our information on all other areas of children’s lives in relation to their education, health and general well-being. It had significant outcomes which helped empower children across the country, namely the appointment of the Ombudsman for children which is a landmark step in the promotion of children’s rights and welfare. The developments arising from the National Children’s Strategy have been very positive and I believe have drawn attention to policy makers about the necessity to consult and include children at national and local level. In the late 1990s there was a huge move to focus on early years’ education with numerous policies and legislation being brought to light. The White Paper on Early Childhood Education led to the establishment of the CECDE which promoted and worked to implement the objectives of the white paper. Unfortunately the CECDE ceased business in 2008 as a result of lack of funding. This indicates a lack-lustre approach and commitment on the part of the government to early childhood education, care and services; and demonstrates that in times of less economic prosperity, children’s issues could once again slip down the agenda, having made good progress. The OECD report strongly criticised Ireland’s early childhood education especially in terms of access. Again only a few of the recommendation of this report have ever been implemented. The NESF report (2005) showed that investment in early years’ education results in large dividends being received in later years. Both the OECD (2004) and the NESF report (2005) suggested that more money be invested in early years’ services but in these recessionary times it is clear that early childhood services are one of the first to be cut. It is important to change the attitudes and opinion of the government and assure them that the necessary investment in this sector will reap benefits in later years. Certainly as a result of the ‘Traveller Health: A National Strategy’, there has been an improvement in the access to and availability of services for the travelling community. It is no longer possible to ignore the health needs of this vulnerable group in society. Although some of the recommendations of the strategy have been implemented, there is still a long way to go to get traveller health on par with that of the general population. The WHO (1985) prerequisites for health are adequate food, safe water and sanitation, decent housing, basic education and employment; these are areas that often fall short in travellers’ lives (Hainsworth, 1998). It is the responsibility of our government to improve each one of these services for the trav258


elling community and perhaps then the health status of this ethnic minority group may improve. The 1994 breastfeeding policy initiated many changes and improvements in this area. This policy led to numerous developments in breastfeeding protection, support and promotion within Ireland. It has been proven that breastfeeding is now more common; between 1981 and 1991 the rate of breastfeeding in Ireland remained static at 32% and in 2008 the rate was 47%. This is a huge success for both breastfeeding policies and they have clearly changed breastfeeding attitudes and practices within Ireland. Due to the scope of the essay it was impossible to discuss all the landmark documents in relation to health, education and social services since 1990. There has been a huge amount of literature published due to increased awareness and pressure coming from national and international sources for the government of Ireland to look more closely at early years’ education, care and services. However, I chose documents which I felt had been influential in the development of early childhood care, education and services, and have made the greatest impact within these areas. Some progress has been made, but there is still a need for significant improvement and for the impetus in this area to continue to be strong. If the Irish government takes its eye off of this area to the detriment of children, they will be as culpable as previous administrations who have been found wanting in relation to early childhood services and care.

259


ENGINEERING PANEL

Judging Panel Prof. Nick Quirke (University College Dublin) – Chair Prof. Clive Williams (Trinity College Dublin) Prof. Patrick FitzPatrick (University College Cork) Judges’ commentary This is an original contribution to predicting risk levels due to subsurface construction. The authors the propose a new, integrated methodology that uses a three-stage process as follows: (1) traditional empirical relationships for predicting ground movement from tunneling, (2) recent advances in condition assessments, and (3) architectural designators related to usage and preservation listings. The methodology is applied to a study area of nearly 260 buildings in the city centre of Dublin Ireland, which is being considered for a metro. Although the test case is for tunneling, the approach can be readily adapted to many other subsurface construction activities including excavation and piling. The committee felt that the paper was academically excellent and well-written.


E NGI N E E R I NG

An integrated condition assessment & empirical approach to predict risk levels due to subsurface construction Julie Clarke & Laura Hannigan

I

n 1939 Ralph Peck criticized the Civil Engineering community for taking a bifurcated approach to the prediction of building damage due to subsurface geotechnical works. Specifically, he observed that the Geotechnical Engineers concerned themselves only with subsurface issues believing that their results were largely independent of the presence and condition of structures above. He continued that Structural Engineers charged with the risk assessment of buildings approached their work nearly in isolation of the activities below ground. Seventy years later, although the state-of-the-art has enabled a better understanding of soil-structure interaction, the reality remains that risk assessment from subsurface construction, is not yet an integrated activity when done on a large-scale basis. To help move beyond this, the following project proposes a new, integrated methodology that considers a three-stage process that involves the following: (1) traditional empirical relationships for predicting ground movement from tunneling, (2) recent advances in condition assessments, and (3) architectural designators related to usage and preservation listings. The methodology is applied to a study area of nearly 260 buildings in the city center of Dublin Ireland, in an area slated for an upcoming metro. The results are compared to the Environmental Im261


pact Statement issued by the consultants for the official project. Although the test case is for tunneling, the approach can be readily adapted to many other subsurface construction activities including excavation and piling.

Introduction The potential risk to structures as a result of subsurface construction is commonly determined through empirical methods and the use of a three-stage damage assessment process. Such an approach was adopted by Burland (1995) for assessing the risk to buildings resulting from the London Underground Jubilee Line Extension, and it was also utilized with the construction of the underground Crossrail project in London (Torp-Petersen and Black, 2001). While surface settlements may inflict the greatest degree of damage, numerous factors relating to the building itself, not just the ground below it, have a large part to play. This study developed a new methodology to establish which buildings were susceptible to significant damage. While based on the traditional empirical methods and settlement formulae as utilized by the above projects, the current condition of the building, the land usage and the architectural significance were incorporated into predictions through the development of new damage scales. The methodology was applied to the proposed construction of Metro North. This lightweight rail system has been proposed for Dublin City by Ireland’s Railway Procurement Agency (RPA). The rail will provide a vital transport link within the city and will serve to unify existing infrastructural systems. The first 5.5km of the proposed route will consist of twin tunnels situated below ground level to avoid encroaching on Dublin City’s limited space. The location of the stops in the initial portion of the proposed route means that tunneling under Grafton Street, the commercial centre of the city and a designated Architectural Conservation Area, is unavoidable. Since ground loss, surface settlements, and horizontal movements can all result from tunneling through soils, the assessment of the impact this tunneling will have on structures in this area is of the utmost importance. Background The findings of Skempton and MacDonald (1956) and Burland and Wroth (1975) provide the basis for the damage assessment of buildings subjected to tunnelinginduced subsidence and have changed marginally over the past decades. As early as 1956, Skempton and MacDonald established damage limits from a survey conducted of 98 buildings. The limits were defined in terms of angular distortion: for load-bearing masonry structures an angular distortion of 1/300 was suggested as a limit above which cracking was likely to occur; for framed buildings an angular distortion of 1/500 was suggested. The relationship between the initial visible cracking and the tensile capacity of material, known as the critical tensile strain, was investigated by Burland and Wroth (1975), for buildings subjected to excavation-induced settlement. 262


Their study was based on the concept of critical tensile strain, originally introduced by Polshin and Tokar (1957), which showed that the onset of visible cracking was associated with a critical tensile strain (0.05% for brick walls). This finding was reinforced by Burhouse (1969), who investigated that the tensile strain varies from 0.038% to approximately 0.06% at the onset of cracking. Based on these studies, Burland and Wroth (1956) conducted an analysis by representing the structure under investigation as a uniform, weightless, elastic beam of length L, height H, and a unit thickness. The value of the deflection ratio, ∆/L, which caused visible cracking for a material with a known value of critical tensile strain, varied from 0.05 to 0.1% for brickwork and blockwork set in cement mortar, depending on the mode of deformation, the relative shear to tensile stiffness, and the geometry of the structure. This agreed with the various existing criteria for brickwork. The concept of the critical tensile strain was further developed by Boscardin and Cording (1989). To establish a risk level for buildings, they proposed a relationship between the tensile strain and potential damage categories (Table 1). Another vital parameter in assessing the potential impact of tunneling is the prediction of the resulting ground settlements. Peck (1969) proposed that the shape of a settlement trough above a tunnel may be approximated by a normal Gaussian distribution curve of the form. Sv = Smax exp -x02 2i2

[Eq.1]

where Sv is the surface settlement at a distance x0 from the tunnel centerline, Smax is the maximum settlement, and i is the distance from the centre line to the point of inflection of the settlement trough. O’Reilly and New (1982) further investigated the extent of settlement by examining a variety of parameters, such as ground conditions, details of tunnel construction and percentage ground loss. For soft fissured clay, they found the trough width parameter, K, to have a value of 0.4 - 0.5, and the volume loss at the surface, Vl (%), to be between ½ and 3. Formulae for the maximum settlement (Smax) and the slope of each building were proposed by Rankin (1988) (Eq. 2 and Eq. 3). Smax = 0.0125Vl r2 [Eq. 2] I Slopemax = Smax [Eq. 3] i 263


In addition, O’Reilly and New (1991) proposed an equation for ground settlements for the more complex scenario of twin tunnels Sv combined = Vl exp -x02 + exp -(x0 – D)2 Kz√(2π) 2(Kz)2 2(Kz)2

[Eq. 4]

where Vl is the volume loss and z is the tunnel depth. Traditional ground settlement troughs, based on greenfield conditions, were proven to be overly conservative when Potts and Addenbrooke (1997) showed that the inclusion of the presence of the structure reduced trough depth due to the building’s bending and axial stiffness. The study was conducted using the finite element (FE) program ICFEP (Imperial College Finite Element Program) and, as in Burland and Wroth’s (1975) study, the existing structure was represented as a beam located at ground level. Modification factors were developed as a result, which could be applied to greenfield values of deflection ratio and horizontal strain to give a more realistic prediction of the likely damage to be experienced by the existing structure. This provided an important development in creating more realistic settlement predictions. A study by Burd et al. (1999) agreed with the findings of Potts and Addenbrooke (1997) by revealing an approximate 30% decrease (compared to greenfield settlements) in ground settlements. Their analyses showed that the building appeared to act as a stiff beam resulting in a decrease in ground settlements but an increase in settlement trough width.

Scope The chosen study area focuses on the first 5.5km of the proposed route, beginning at St. Stephen’s Green and running north as far as the southern edge of the River Liffey (Figure 1). For this portion of the Metro line, twin tunnels are proposed, to be located at a depth of roughly 15-20m below ground level, with an excavated diameter of approximately 6.4m, situated 7m apart. This area of approximately 0.005km2 includes roughly 259 buildings. The majority of the buildings are Victorian or Edwardian, date from the late 19th century to the early 20th century and primarily consist of narrow, four-storey terraced masonry structures. Many of the buildings have historical importance and this is reflected in the high number of protected structures present in the area (approximately 42). Data Collection Scaled drawings were obtained for the buildings located in the study area, providing relevant dimensions. Fieldwork was also conducted and involved carrying out a visual assessment of each building. This provided information regarding glass 264


frontage, fan windows, window sills, lintels, ornamentation, part shear walls, full length shear walls, party walls, building material, parapets and penthouses. In addition, a photograph was taken of each individual building. This thorough inventory of photographs enabled the classification of the current condition in conjunction with damage scales described later.

Three-Stage Assessment Process To predict potential tunnel-induced damage for the buildings within the study area, a new methodology was developed, based on the three-stage assessment process, as outlined in Fig. 2. The assessment process selected is similar to those applied to previous tunneling projects, such as the London Underground Jubilee Line Extension (Burland, 1995) and the underground Crossrail project in London (Torp-Petersen and Black, 2001).

Fig. 2. Methodology This paper advances the three-stage assessment process by incorporating the current condition of the building into the process. In the past, a condition assessment of buildings has not routinely been part of the tunnel-induced damage prediction process. The assessment process requires time, effort and experience and, thus, is a whole field unto itself. However, the role it plays is important in the overall damage prediction process, and so it has been strongly incorporated into the proposed 265


new methodology.

Stage 1 Stage 1 of the analysis considered traditional empirical limits relating to the maximum settlement (Smax) and the slope of each building within an anticipated soil trough. Ground settlements were considered under greenfield conditions, where the presence of the building is ignored. The settlement troughs obtained, based on greenfield predictions, may be considered conservative. However, the detailed assessment in later stages will account for this initial conservatism. The maximum settlement (Smax) and the slope of each building were calculated using Equations 2, 3 and 4. It was necessary that various data be obtained for the aforementioned formulae: x0, the distance from the tunnel centerline to the building edge, was obtained by measuring the distance off scaled maps using AutoCAD; the percentage volume loss at ground level (Vl) was assumed to be 1%; the distance of the tunnel axis to ground level (z) was taken as 15m; the distance from the centre line to the point of inflection of the curve (i) was calculated using an equation proposed by Rankin (1988) (Eq. 5) where K was assumed to be 0.5 i = Kz [Eq. 5] Buildings with a slope of less than 1/500 and whose settlement was less than 10mm were considered to be of negligible risk (Rankin, 1988), and were eliminated from further study.

Stage 2 In Stage 2, a damage rating relating to the severity of potential damage was assigned to each building. This overall rating was determined using a combination of a condition assessment and limiting tensile strain calculations. Firstly, a category of damage was assigned to each building based on the structure’s current condition. The inventory of photographs compiled was utilized to assess the current condition of each building in conjunction with damage scales proposed by Burland et al. (1977) (Table 2) and Laefer et al. (2008) (Tables 3-5). The damage scale proposed by Burland et al. (1977) relates visible external cracking of the building to a damage level. The various damage scales developed by Laefer et al. (2008) considered external parameters such as protruding or loose brickwork, replaced or repaired brickwork, damage due to exposure, and plant growth. The scores achieved on each scale were weighted accordingly and totaled. The combined score was used then used to designate an overall damage category relating to the current condition. A separate analysis was then conducted, where a damage category was assigned to each building based on the limiting strain concept proposed by Burland 266


(1977) and the risk chart subsequently developed by Boscardin and Cording (1989) as outlined in Table 1. The two damage category ratings obtained for each building were then combined, to achieve a damage estimation based on both the current condition and as a result of the potential settlement. Buildings which received a damage level of moderate or greater were further considered in Stage 3 of the process.

Stage 3 Stage 3 of the assessment process involved the classification of the remaining buildings according to the architectural designators land usage and architectural significance. Scales were developed (Tables 7 and 8) which weighted these parameters, with higher modifiers given to those deemed to be of greater significance in influencing potential damage. The architectural significance of the building was deemed to be of greater importance in assessing potential damage and was thus weighted twice that of land usage, when the score from Table 6 and Table 7 were combined. The severity of potential damage was then predicted for each building based on the resulting combined score. Results In Stage 1 of the assessment process, 98 of the buildings in the study area (38%) were subjected to a predicted surface settlement of less than 10mm and a slope of less than 1/500, under greenfield conditions. They were therefore considered to be of negligible risk and were eliminated from further study. The remaining 161 buildings (62%) were further considered in Stage 2 of the assessment process (Fig.1 (a)). In Stage 2, a damage rating relating to the severity of potential damage was assigned to each building. This overall rating was determined using a combination of a condition assessment and limiting tensile strain calculations. The condition assessment rating was obtained using a combined score from Tables 2-6. All of the buildings fell into a damage category of slight or above. 33% of the 81 buildings in Stage 2 obtained a damage classification of negligible, 32% as very slight, 26% as slight, and 9% as moderate. None were classified as posessing a severe or very severe risk of damage, based on their current condition. Limiting tensile strain calculations also assigned a rating based on Boscardin and Cording’s damage scale (Table 1). 21% of the buildings fell into the slight category, 37% the moderate category, and 42% the severe to very severe category. These two ratings were then combined to receive the overall damage category based on both the current condition and limiting tensile strain of the building. 100 of the buildings considered at Stage 2 were classified as being of very slight to slight risk of damage, and were eliminated from further study. 61 buildings were further identified for Stage 3 consideration (Fig. 1 (b)). At this point it was noted that clusters of buildings susceptible to damage began to emerge; most notably at either end of Grafton Street. Of particular concern 267


0

Degree of Damage Negligible

Limiting Tensile Strain (%) 0 – 0.5

1

Very Slight

0.05 – 0.075

2

Slight

0.075 – 0.15

3

Moderate Severe to very severe

0.15 – 0.3

Risk Category

4 to 5

> 0.3

Table 1. Relationship between category of damage and limiting tensile strain (after Boscardin and Cording 1989) Risk Categor y 0 1 2 3

Degree of Damage

Description of Existing Damage

Negligible Very Slight

Hairline Cracks Fine cracks easily treated during normal decoration Cracks easily filled. Several slight fractures inside building. Exterior cracks visible

Slight Moderate

4

Severe

5

Very Severe

Cracks may require cutting out and patching. Doors and windows sticking.

0.1-1 1-5 5-15 or a number of cracks greater than 3

Extensive repair involving removal and replacement of walls, especially over doors 15-25 but also and windows. Windows and door frames depends on distort. Floor slopes noticeably. number of cracks Major repair required involving partial or Greater than 25 complete reconstruction. Danger of but depends on instability number of cracks

Table 2. Building Damage Classification (after Burland et al., 1977)

268

Approximate Crack Width (mm)


Risk Category 0

Degree of Damage Negligible

1

Very Slight

2

Slight

3

Moderate

4

Severe

5

Very Severe

Description of Existing Damage All bricks in the same plane A few bricks (1 – 3) are noticeably out of plane/ Mortar appears to be loose/ weak/ missing around 1 – 3 bricks Overall, more than 5 bricks appear to be slightly out of plane/ Gaps in mortar are more noticeable/ Just perceptible difference in line of brick Overall up to 10% of bricks are noticeably out of plane Noticeable slope in masonry Windows, lintels, doorframes etc. are noticeably tilted Overall, up to 15% of bricks are missing entirely Noticeably outward bulge in the wall Window lintels and doorframes are at an angle greater than 15 degrees More than 15% of bricks are missing entirely Sections of the wall are on the verge of collapse Repair work would require majority of wall to be rebuilt

Table 3. Protruding or Loose Brickwork (after Laefer et al., 2008) Risk Category 0

Negligible

1

Very Slight

2

Slight

3

Moderate

4

Severe

5

Very Severe

Degree

Description of Existing Damage None Brickwork was replaced as a result of filling a doorway or window. Replacement Occurred in rarely occurring small clusters (i.e. 2-6) of bricks. Replacement Occurred in larger clusters (greater than 6) More than 10% of the wall is comprised of replaced brickwork More than 25% of the wall is comprised of replaced brickwork

Table 4. Replaced or Repaired Brickwork (after Laefer et al., 2008)

269


Risk Category 0

Degree of Damage Negligible

1

Very Slight

2

Slight

3

Moderate

4

Severe

5

Very Severe

Description of Existing Damage None

Isolated, rarely occurring chipping (i.e. 1-3 bricks)/ Lower perceptible damage of overall wall. Perceptible overall damage (weathering) of bricks in wall. Numerous examples of significant damage i.e. greater than 5%

Noticeable damage to greater than 15%of bricks in wall Greater than 25% of bricks are subjected to heavy chipping / spalling. Bricks are heavily eroded due to exposure

Table 5. Damage due to Exposure (after Laefer et al., 2008) Risk Category 0

Degree of Damage Negligible

1

Very Slight

2

Slight

3

Moderate

Description of Existing Damage None One or Two examples of weeds growing in typical places ( i.e. top of chimney, ledge etc) The weeds growing are more numerous as well as being more overgrown Whole wall ensconced with vegetation

4

Severe

Minor bush/tree growing out of masonry

5

Very Severe

Major (fully grown) tree growing out of masonry

Table 6. Plant Growth (after Laefer et al., 2008) are the buildings located at St. Stephen’s Green North, as this is in close proximity to where tunnel construction begins. The initial construction of the tunnel can often lead to increased settlements due to the engineer’s unfamiliarity with the soil conditions. These buildings were further considered in Stage 3 where a damage rating was assigned based on land usage and architectural significance (Tables 7 and 8). 95% of the buildings in Stage 3 were situated in a commercial/retail/business area or an area consisting of offices. 3% of the buildings were located in an educational/institutional/ community/civic area and 2% in a residential area with mixed uses. 74% of the buildings were situated in an Architectural Conservation Area or a Conservation Area but were not Protected Structures. Only 23% of the buildings 270


Land usage Other Spaces, Recreational Uses

Modifier/ Weight used 1

Residential Areas/Uses

2

Carparks

2

Areas under Construction

2

Residential with Mixed Uses (Commercial, Retail, Offices)

3

Offices/employment Uses

4

Commercial, Retail, Business Uses

4

Educational, Institutional, Community, Civic Uses

5

Table 7. Land Usage Weighting System Category Neither a Protected Structure nor located in an Architectural Conservation Area or a Conservation Area Located in an Architectural Conservation Area

Modifier/ Weight used 1 2

Located in a Conservation Area

2

Protected Structure

4

Protected Structure located in a Conservation Area Protected Structure located in an Architectural Conservation Area

4 5

Table 8. Architectural Significance Weighting System were Protected Structures and situated in an Architectural Conservation Area, and 3% of the buildings were Protected Structures or Protected Structures situated in a Conservation Area, but not holding concurrent designations. While two main clusters of buildings were identified in Stage 2, the more detailed analysis of the Stage 3 assessment highlighted that the buildings grouped near College Green had a more critical damage level. In addition, 1- 3 St. Stephen’s Green, St. Teresa’s Church on Clarendon Street, 96-98 Grafton Street, 71 Grafton Street and 2 College Street were classified as being at a severe risk of damage (Fig. 1 (c)). Overall, 5.4% of the total buildings in this study were determined to be at risk of severe damage and thus, will most likely require mitigation measures, such as underpinning or injection grouting, to protect their structure. In addition, 20% of 271


the buildings may be at risk of moderate damage, and may also require mitigation measures.

Environmental Impact Statement An Environmental Impact Statement (EIS), published by Ireland’s Railway Procurement Agency (RPA), analyzed the impact that Metro North will have on its surroundings.The EIS outlined an impact assessment which investigated tunnelinginduced ground movements and their effect on nearby structures. The process which was adopted for the investigation was similar to the three-stage damage assessment process conducted in this study and deals with a similar study area. The main difference is that the current condition of the building was not taken into account. The EIS notes five buildings to be at risk of severe damage as a result of tunneling; Trinity College Gate House; Bank of Ireland, College Green; Brown Thomas Department Store; St. Theresa’s Church, Clarendon Street; Gaiety Theatre. Firstly, Trinity College Gate House and Bank of Ireland, College Green could not be analyzed in this study due to the large size and complexity of the building. The hand calculations used were insufficient to encompass the complexity of such buildings and a finite element analysis program may need to be utilized to provide sufficiently accurate results. Two of the four remaining buildings considered at risk of severe damage by the EIS were eliminated in Stage 2 of this assessment. It is noted that these buildings are classified as protected structures. If this had been factored into the study at an earlier stage, these buildings may have received higher categories of damage and may not have been eliminated at Stage 2. St. Theresa’s Church, Clarendon Street was deemed to be at severe risk of damage by both reports. This indicates the severity of the potential damage that may be caused to this building as a result of tunneling, and also serves to confirm the validity of the three-stage assessment process adopted in this study. 1- 3 St. Stephen’s Green, 96-98 Grafton Street, 71 Grafton Street and 2 College Street were additional buildings identified in this report as being at a severe risk of damage, in comparison with the EIS. This exemplifies that conducting a damage assessment without taking the current condition of the building into account may omit buildings that could potentially suffer severe damage resulting from their sub-standard condition. Conclusions Traditional damage assessment processes adopt an empirical approach based on settlement calculations to predict risk levels, however, neglect the current condition of the building. A new methodology was developed to integrate the current condition of a building into the traditional damage assessment process, for predicting risk levels due to subsurface construction. This construction may include tunneling, excavation or piling. New damage scales were developed in Stage 2 and 272


(a) (b) (c) Fig. 1 (a) Buildings susceptible to damage after Stage 1 (b) Buildings susceptible to damage after Stage 2 (c) Damage Category rating after Stage 3 Stage 3 of the assessment process to categorise the level of potential damage a building may be at risk of. This categorisation was performed in order to determine which buildings may be at risk, and subsequently which may require protective measures, as a result of tunnelling works. A flow chart was developed to provide a simple way of following the new methodology proposed in this report. In comparison with the Environmental Impact Statement produced by the RPA, a shortcoming was identified, as protected structures were not considered until the third stage. However, the identification of St. Theresa’s Church in both reports as being potentially susceptible to severe damage, indicates that the process adopted here has validity. Four additional buildings were noted as being at risk of potential severe damage. This highlights the fact that the inclusion of the current condition assessment of a building identifies buildings which may have been omitted otherwise. Predicting such risk levels accurately is a vital component of the subsurface construction process, so appropriate mitigation measures can be put in place, if required. 273


ENGLISH PANEL Panel Members Prof. Elmer Kennedy (University of Ulster) – Chair Prof. Nicholas Grene (Trinity College Dublin) Prof. Peter Denman (NUI Maynooth) Prof. Brian Carahar (Queen’s University Belfast) Prof. Jan Jedrzejewski (University of Ulster) Dr. Farnkie Sewell (University of Ulster) Dr. Anne Jamison (University of Ulster) Dr. Willa Murphy (University of Ulster) Judges’ commentarty This was a very impressive essay, analysing the treatment of the figure of Hal in two modern film adaptations of Shakespeare’s Henriad: Orson Welles’ Chimes at Midnight and Gus Van Sant’s My Own Private Idaho. The judges were particularly impressed by the clarity with which the author brought together modes of analysis appropriate for different genres of narrative – literary and filmic – to produce a thoroughly integrated and very persuasive analysis of his central thesis. There was also praise for the way the author used existing scholarship without ever becoming dependent on it, and for the essay’s sense of logic and unity that made it a serious piece of academic work rather than merely an exercise in literary/filmic criticism. In short, this was a highly original, technically well-informed, stylish and intellectually ambitious paper which stood out quite significantly in these respects from the other submissions.


E NGL I SH

Tragedy in triumph: The lost paradises of Hal the Hypocrite Tim Mc Inerney

“W

hat wouldst thou think of me if I should weep?” asks a recumbent Prince of Wales1 of his companion Ned Poins2 in one of the most powerful scenes of Orson Welles’s acclaimed screen adaptation of William Shakespeare’s “Henriad,” Chimes at Midnight (1966). As the camera shifts without warning from a group scene to a startlingly intimate close-up sequence that alternates between the two men, the film’s infamously harsh audio distortion has the remarkable effect of adroitly conveying a deafening silence. Invasive, scrutinising, and even contemptuous, Poins’s somewhat sinister face fills the screen as he slowly articulates: “I would think thee a most princely hypocrite.”3 It is perhaps fitting within the web of contradictions that characterises Shakespeare’s triad of Henry plays that the static sound quality of Orson Welles’s film version has a peculiar capacity to greater accentuate this moment of intense clarity.4 The effects of sudden cinematic tranquillity serve to transform this most crucial of transactions into a profound deliberation on the prince’s hypocrisy. Hal, who has finally arrived at that long foreshadowed moment when he must give up 1 Played by Keith Baxter 2 Played by Tony Beckely 3 In Shakespeare’s original: 2 Henry IV. II. ii 4 In fact some critics (notably Michael Anderegg) have argued that “The poor sound quality of Chimes At Midnight was intentionally engineered and is an indication of Welles’s art…” Howlett ed, p. 151.

275


his old friends and bad habits in favour of the straight and narrow path of kingship, is being judged – if not accused – by the film, and its audience, of a hypocrisy that has haunted every shadowy aspect of his character. In fact, Welles’s cinematic treatment of this dialogue serves to graphically highlight a theme that runs solidly throughout Shakespeare’s Henry plays: the disquieting detail that “Hal schools himself in hypocrisy.”5 From the Prince’s portentous declaration of “I know you all…” in the second scene of 1 Henry IV to his denunciation of “I know you not…” in the closing scenes of 2 Henry IV,6 the conceit of his duplicity lends a pervasive sense of anxiety to the way we receive this most baffling of Shakespearean kings. As John Masefield scathingly asserted in 1911, “Prince Henry is not a hero, he is not a thinker, he is not even a friend; he is common man whose incapacity for feeling enables him to change his habits whenever interest bids him… He makes a mockery of the drawer who gives him his whole little pennyworth of sugar.”7 Thirty years on, the Prince would find himself once again on the receiving end of some rather relentless interrogation on screen. Gus Van Sant’s treatment of the Henriad in My Own Private Idaho (1991) dwells on Hal’s hypocrisy to such an extent that his character, represented in the film’s Scott Favour,8 is portrayed as blossoming into an all-out villain. The treatment of the “Hal character” as a villain, or more importantly, a loser, is an acutely subversive element in the fibre of the original text, bound by all its superficial trappings of monarchical glorification. Born of a father who “addresses all rebels in terms that would be impeccable if only he and they could forget that he was once a rebel himself,”9 even Hal’s dramatic genesis is riddled with hypocrisy. He embodies a celebrated young beacon of royalty whose God-given destiny rises above his earthly shortcomings, while at the same time exemplifying the crude Machiavellian tendency towards manipulation in which this self-same destiny is undermined, satirised, and even openly ridiculed.10 In a world of modern media where little credence is given to the divine right of kings, it is little wonder that Hal, having been interrogated by the gaze of Welles’s camera, should eventually come to be vilified by Van Sant’s screenplay. Likewise, in the hypocritical world of Machiavellian self-service, the end that justifies the means, it seems, is 5 Joseph Mc Bride, ‘Chimes at Midnight ‘ in Gottesmen’s Focus on Orson Welles, cited in Pilkington, p. 134 6 1 Henry IV, I.ii.188, and 2 Henry IV, V.iv, 48, respectively 7 Masefield’s ‘Hal – the Unheroic Hero’ (New York, 1911), His criticism perhaps demonstrates how Prince Hal’s hypocrisy has invoked passionate condemnation for at least the last century. cited in Sanderson ed. p. 245 8 Played by Keanu Reeves. It is interesting to note the significance of the name ‘Scott,’ considering the ethnic bckground of the northern rebels in 1 Henry IV 9 M.M. Reese, ‘Father and Son’, from The Cease of Majesty, cited in Sanderson ed., p. 230 10 The possibility that the play is deliberate satire is an ever-present element in the Henry IV plays, L.C. Knight notes that there are four separate accounts given in the first part on how Henry IV has unjustly usurped the throne, and points to the possible irony in the closing lines of the play : “And since this business so fair is done, Let us not leave till all our own be won” [1 Henry IV, V.v.43-4] (Knight’s emphasis) from ‘Henry IV as Satire’ in Eric Bentley ed. The Importance of Scrutiny, cited in in Sanderson ed.,

276


not always worth its own weight. This essay will explore the extent to which Hal’s hypocrisy is expounded and represented in the films of both Welles and Van Sant. We will consider how the motifs of tragedy in triumph and hypocrisy in heroism are explored and explicated through the lens of twentieth-century cinematic production, while looking at the problematic effects on the historical hero when the triumphant assumption of kingship is framed within the context of irrevocable loss. We will investigate the treatment of Hal as a hypocritical ‘loser’ by way of three innovations that both films hold in common: the interpretation of the plays’ division of their action into different spheres that interact and compete for the audience’s attention and the reflection of these worlds in the plays’ main characters; the management of masculine ideology and the portrayal of relationships between men in this remarkably homosocial world; and finally the filmic treatment of time and space as a heuristic and representational mechanism. Above all, we will see how both films deal with the plays’ most disturbing implication: an inevitable loss of happiness; an unstable and nostalgically constructed sense of utopia that will, seemingly inevitably, come tumbling down. Behind the steely façade of a newly coronated Hal or a Scott Favour recently bequeathed with vast inheritance, there lies a world of lost happiness, personified in the heartbroken Falstaff. “Almost all serious stories in the world,” said Orson Welles of Chimes, “are stories of a failure with a death in it…but there is more lost paradise in them than defeat. To me, that’s the central theme in Western Culture: the lost paradise.”11 There is no denying that Prince Hal is a difficult character for an audience to empathise with, much less to valorise. While Van Sant’s film greets its audience with a gloss of the term “narcoleptic,” referring to the medical condition of the central character Mike Waters12, both his and Welles’s representation of the young Henry cannot help but leave one feeling that it might be more appropriate to investigate the term “sociopath.”13 While stopping short at diagnosing the Prince of Wales with Antisocial Personality Disorder, it cannot be avoided that Hal’s cold, remorseless and utterly destructive actions are more than a little problematic. We are invited by the opening scenes of the play to see the young prince, though wayward and unresponsive to his duties, as a paradigm of untempered virtue who prophesises his own glorious rise to greatness. It remains a categorically compli11 Welles, quoted in Juan Cobos and Miguel Rubio ed., ‘Welles and Falstaff: An Interview’ in Sight and Sound 35, (Autumn 1966), cited in Howlett p. 152 12 Played by River Phoenix 13 Incidentally: Sociopath: Adj; Someone with a personality disorder manifesting itself chiefly in anti-social attitudes and behaviour. Hence sociopathic a.; sociopathy. www.oed.com. Antisocial Personality Disorder (APD): … “The essential feature for the diagnosis is a pervasive pattern of disregard for, and violation of, the rights of others that begins in childhood or early adolescence and contains into adulthood. Deceit and manipulation are considered essential features of the disorder.” From the American Psychiatric Association (1994) in Diagnostic and Statistical Manual of Mental Disorders. Washington, DC: American Psychiatric Association. Cited on www. wikipedia.org

277


cated task however, to portray Hal’s premeditated induction into the society of thieves and petty criminals – only to publicly denounce them to make more of his assumption of “virtue,” – as anything less than unnecessary and indulgent, if not wantonly cruel. Indeed, it would take little more than a slight rephrasing of speech, or perhaps a more condemnatory title to re-introduce Hal as a maniacal powerhungry monster, whose virtuous soul itself was the “act,” put forth to make his black heart all the more horrifying when it raised itself onto the throne. As it is, however, Hal is all the more of a conceptual problem because of his active involvement in each of his contradicting social worlds.14 Shakespeare’s division of 1 Henry IV into the separate actions of the castle, the tavern and the rebels is certainly rich fodder for the cinema screen. Welles’s dramatic discrepancy between the long tall stone interiors of the castle and the rotund, labyrinthine vaults of the Boar’s Head lend these spaces a fundamentally representational value. They immediately introduce the foundational image of “straight” versus “crooked,” “direct” versus “convoluted” and “linear” versus “cyclical,” that is to be made central in Hal’s early speech on the sun:15 I know you all, and will awhile uphold The unyoked humour of your idleness; Yet herein will I imitate the sun, Who doth permit the base contagious clouds To smother up his beauty from the world That, when he please again to be himself, Being wanted, he may be the more wondered at … I’ll so offend to make offence a skill, Redeeming time when men least think I will. (Emphasis added, 1 Henry IV I.ii.188 – 210) This crucial speech, being as it is the play’s only direct explanation (or excuse) for Hal’s bizarre behaviour, is referenced consistently throughout the play. Furthermore, the prince’s use of the word “imitate” implies that we are not as of yet privy to the prince’s true motivations, but are only witnessing another layer of his hypocritical methodology. Moreover, the fact that he uses the language of imitation with the play’s consistent word-play on sun/son, inescapably recalls his father’s wishful fantasy in the first scene of 1 Henry IV that he is not his son at all, but a changeling impostor. Indeed, this speech figures Hal as asserting himself as an impostor in every social world he partakes in, imitating a member of the tavern 14 For the purposes of this essay we will leave the world of the rebels mostly undiscussed, due to the excision of Hotspur for the most part from Idaho. 15 As Anthony Davies notes, the castle/tavern are also oppositions of the phallic/yonic, this is all the more significant when we consider the patriarchal nature of hereditary kingship and the pansexual energy of the non-linear tavern.

278


community only so he can better imitate a prince of the castle, and thus signalling a basic hypocrisy as his sole means of social success. Without a doubt, Welles loses no time in bestowing the speech with its full representational weight. He places Hal outside the tavern, looking towards the high stonewalls of the castle; he is only a few feet from a watching Falstaff (who, incidentally, is not present in the original text), but has his back firmly turned away from him. Nearby is a crooked tree, gnarled and not yet in bloom, but nonetheless striving towards the heavens. The sky above obediently enacts the image of obscured brightness as the sun filters, accordingly, through a thick and unremitting layer of cloud. By placing him so literally between the two main “worlds” of the film, Welles is in fact removing Hal from either space, disassociating him from a context to as to better consider him in isolation. This moment of transition thus establishes him both as a dramatic unifier and as a “non-member” of either of these divorced social spheres. Hal will, of course, be envisioned later in the film as basking in the full light of the sun beside an eminently upright and leafy tree, not long before Poins christens him a hypocrite. Though Van Sant’s film pays a great deal of homage to Welles’s version, not least in this scene, the effect of representational transition in Chimes at Midnight is entirely abandoned in My Own Private Idaho. Van Sant’s parallel “worlds” are that of an abandoned hotel, with cavernous halls and endless passages much in the manner of Welles’s tavern, and a mahogany panelled office space, wherein Scott’s father is only ever seen. These opposing cinematic worlds, though clearly referencing those of Welles, come to work on a decidedly different dynamic. The mahogany office, as an influence on Hal’s fate, is hardly ever utilised in the film. It appears only in a handful of static, fleeting scenes, which function more like an allegory of convention itself than a world of its own. Indeed, in true allegorical fashion, it is filled with trophies, achievements and images of the past, establishing itself as a realm of the linear and progressive. Scott’s father too, is envisioned as little more than a signifier of impotence; Van Sant cuts through Welles’s ironic low angle shots of the throned king in all his usurping guilt, physical vulnerability and regal insecurity, and places him, firmly subordinate, in a wheelchair whereupon the camera looks down. Indeed, both throne and wheelchair as reifications of the king’s dependence and insecurity are made the main parodic focus in the mock interview scene16 in both film versions.17 The first works to represent the pompous fallacy of presumption, the second the ultimate ineffectuality that lies behind it. Neither in the original text nor in its film adaptations, it seems, are these dramatic worlds on an equal standing. While the castle is made a mockery of in the tavern, the tavern is feared in the castle as an aggressive and intimidating threat. The king’s agents can be dismissed form the tavern by the prince with a simple wave of the hand; while Hal bids the Sherriff “so let me intreat you leave the house” 16 ������������������������ In the original text at 1Henry IV, II.iv, 364 - 468 17 This scene was cut from the final edit of My Own Private Idaho, being deemed too long and apparently irrelevant. It is available on the DVD’s supplemental deleted scenes.

279


(1Henry IV, II.i.500), Scott utters a not so eloquent “fuck you.” In marked contrast, Falstaff’s intrusion on the Castle world threatens fundamental subversion. While only social incongruity is utilised as a transgression in the original text, the final scenes of both Welles’s and Van Sant’s films portray this royal interruption as having the power to strike fear and revulsion into the tenuous core of conventional society. Moreover, Welles’ directorial decision to introduce the uncontrollable Falstaff at the moment of Hal’s coronation suggests that this kind of disruption has the potential to destabilise the entire patriarchal world order. Ironically, the tavern is the realm of power in this play. That Hal should denounce this unusual territory of dramatic authority within the play for one that has been held up to the audience as ridiculous and deluded enacts yet another of his falling hypocrisies. The recognition of the Wellsian cinematic worlds and the subsequent shifting of their focus, has a dramatic effect on Idaho’s Hal character. Scott Favour delivers the same basic “sun” speech as his predecessor; albeit in a surprisingly effective modern rhetoric (“I will impress them all the more” muses Scott, “when such a fuck up, like me, turns good”). Unlike in Chimes however, the scene neglects to reflect his spherical ambivalence, focussing more on his relationship with the sometimesFalstaff-character, Bob. Though his overseen soliloquy takes place outside the abandoned hotel that stands in for the tavern, he looks through an open doorway that leads only onto a busy road. There is no tree, and there is little concern with the act of growing straight (so to speak). In fact, Scott’s already incongruous three piece suit makes much more explicit from the outset where his true dedications lie. His deficiency of equivocal language, too, makes all the plainer his indication of future abandonment to the watching Bob, and undermines the possibilities of spherical ambivalence that are so characteristic in Welles’s Hal. In Scott, Van Sant delivers a Hal character whose emotional journey (quite unlike that of Welles’s Hal) the audience is not invited to share. The cinematic ruminations on his facial expressions, indecisions, hesitations and even bewilderment that characterised Welles’s prince are cleanly excised from Idaho. The audience is given instead a mostly two dimensional and largely unreadable character that seems as intractable as his ruthless ambition. It follows that the great weight of ambiguity and complexity in Hal’s character are shifted onto the impenetrable Mike. As a herald of Falstaffian reaction and a paradigm of motiveless desire, it is only through him that the film invites us to consider Scott at all. Though the film introduces a pantomime Falstaff figure in the guise of Bob (who is, to be sure, heralded with renaissance music whenever he enters a scene), the film’s representation of Falstaff himself is entirely fragmented. It is no coincidence that in the very first scene of the film in which Mike is seen standing on an empty Idaho road, that he is wearing a tradesman shirt labelled with the name “Bob” on the breast pocket, nor that the beer that the “tavern” community drink is itself labelled “Falstaff.” If Bob is Falstaff, then so is Mike, if Mike is Falstaff then so is every crumbling dimension of the abandoned Portland hotel. The character of Mike – most often identified somewhat facilely with Ned Poins – actually takes 280


on a momentous breadth of significance in the film. He works both as a subversive unifier of the disparate worlds, and as the essence of a world that is itself fragmented. His position as a hustler brings him into the same conventional sphere of hypocritical patriarchal standards of Scott’s father, while firmly remaining part of a liminal and untouchable social caste. While he often does fill the shoes of Poins during some of the more direct allusions to the original text, his inscrutable depth of character and fundamental ambiguity means that he comes to represent the world that Scott must reject more than any other character in the film. It is in him that we see that enigmatic element of Falstaff that has Hal so captivated – an indescribable sense of camaraderie and mutual dependence, a deep urge to follow and conserve, perhaps most succinctly portrayed in Van Sant’s positioning of Scott and the sleeping Mike in a pastiche of Michelangelo’s Pieta early in the film.18 Certainly, Bob on his own has no direct equivalence to Welles’s Falstaff: he is pathetic when Welles’s character is wryly self-aware; he is explicit when Welles conceals a galaxy of unspoken complexity within his expressions and tones; and he is decidedly peripheral when Welles proudly dominates the narrative. In Idaho, Bob comes to represent, solely, the object of Hal’s conscious, mindful sadism. He is introduced in a distinctly subordinate manner: “he was fucking in love with me,” Scott tells Mike, and though he goes on to declare his paternal love for Bob as greater than that for his father and mother combined, his already asserted feelings for his father give this statement no particular depth. In fact, there is no one scene in Van Sant’s film wherein Scott’s love for Bob is made evident in any demonstrable way. Quite the opposite. Bob is figured as the victim of relentless and unremitting hatred. “Somewhere along the line,” says Nigel Wood, “the abuse of Falstaff becomes the allure of the prince…”19 The scene in which this second father is comically quizzed as to the presumed failure of the highway robbery, succinctly captures the conceptual differences between the two representations of Falstaff. Welles’s Falstaff is worked up into a frenzy by his mocking friends, before being allowed the dignity of his own characteristic hypocrisy. Falling back into good humour, he claims to have known about the robbery coup all along, while Hal’s reaction and his mischievous expression explicate that this is simply a silent agreement between the two that allows him to save face, if only at a superficial level. Scott’s management of the same scene is certainly not an occasion of jovial mockery; indeed it might be better likened to bearbaiting. Bob’s decreasingly credible explanations lead him into a psychologically tortuous corner of inescapable taunts and jeers. The camera moves dizzily from the callous heckling of the entire “tavern” community, encircling a furious and impotent Bob, who is offered no such rhetorical relief as his Wellsian counterpart. His attestations to having known all along are weak 18 Another feature of this tableau is the Portland monument in the background that reads “The coming of the white man,” an idea which is expanded throughout the film (Native American chanting heard in the distance during desert scenes, for instance) as a signifier of arbitrary sophistication and conventional authority. 19 Wood, p. 49

281


and pitiable, and are not in the least bit humoured by the rest of the crowd. In the mock-interview scene where Hal and Falstaff both imitate King Henry, the same cruelty can be distinguished. Instead of a medium for the complex relationship of criticism and admiration that characterises Welles’s Hal and Falstaff, Van Sant’s scene yet again becomes a vehicle to explicitly mediate the prince’s underlying hatred for his companion. As he takes the stage to enact his real father’s description of his adopted patriarch, his jocular digs descend into malicious humiliation. By the time he finishes the scene, there is no more semblance of pretence, and he is simply shouting obscenities at his crestfallen friend. This conflation of hatred and love is of course a central feature of the Hal character’s profound hypocrisy. In Chimes the complex relationship between the men in the film is unquestionably given a great deal of attention.20 As James Naramone has noted,21 in a world where women are remarkably scarce, Welles’s tavern stands in as a sort of feminine bubble; his round, affectionate figure serving as a binary foil for the masculine duties of the castle. However it is conceptualised, the rapport between Hal and Falstaff is certainly not a kind of affiliation between men that is tolerated within the castle walls. It follows that the sort of male-male relationships which society approves and the sorts that are to be disposed of at all costs, is an issue that has great resonance in both films. The aforementioned dialogue between Welles’s Poins and Hal is just one of many instances where it is suggested that they may, in fact, be lovers. The alternating close ups of them looking knowingly into each other’s eyes certainly have the appearance of being far from innocent. As Ace G. Pilkington notes, Welles did not refrain from including even the most explicit of the play’s references to homosexuality in the original shooting script of the play. In noticing the servant of Master Shallow for instance, Falstaff remarks sardonically: “Thus Davy serves you for good uses; he is your serving man and your husband.” To this Shallow replies: “When flesh is cheap and females dear, and lusty lads room here and there so merrily, and even along so merrily!”22 Undoubtedly, the character of Davy survives into the film as a paradigm of the post-war homosexual stereotype: near androgynous, lisping and mincing across the room in a devious, even conspiratorial fashion. The inclusion of this character imports a wealth of suspicion into the bawdy conduct of the tavern and gives a prominent voice to major implications of pansexual relationships. Indeed, Rubinstein notes that the original text utilises Davy, (whose having offered Falstaff “a cup of wine sir?”23 links him di20 Robin Wood states fallaciously and rather oxymoronically, “it might be argued that a strong homosexual undercurrent runs through Welles’s work, the more potent perhaps for never being exposed to light.” Cited in Pilkington, p. 185, note 105. As we will see, the homosexual theme is quite overt in the film, as in the play – and has been “exposed to light” quite a bit since the films production. 21 “In a sense…Falstaff acts both as a displaced mother, being associated throughout with softness, earthy affection, and nourishment… He is the figure who stands for the child’s need of love, intimacy, and human contact.” James Naramone, ‘The Magic World of Orson Welles’, cited in Davies p. 127 22 2 Henry IV, V. iii 56 23 2 Henry IV, V. iii 53

282


rectly to the classical figures of Jove and his boy lover Ganymede the cup bearer), to suggest a sexual element in the relationship between Falstaff and Hal. Most notably, when Falstaff accosts the newly coronated Hal he declares: “My king! My Jove! I speak to thee my heart!” The language he uses, writes Rubinstein, “leaves little doubt about their relationship.” Whatever the inference, the implications of unconventional sexuality in Chimes are difficult to ignore. During a scene in which Hal is berating Falstaff from the rafters,24 Welles shoots a curious sequence that has both Poins and the prince carousing with Doll Tearsheet, who just moments before been exchanging words of love with Falstaff. Here too, the implications of orgiastic tendencies run side by side with an implication that Doll may merely stand in as a correlative for the desire that exists between the three men themselves. The suggestion of a sexual relationship between Hal, Poins and Falstaff, has the power to intensify his abandonment on a grand scale. It implicates his dalliance with the seedier side of life in a whole web of personal and emotional involvement that both question his base desires and the nature of his ambitious motivations. If Hal is giving up a lover, or even a sense of a utopian physical intimacy with men that he will evidently never know again, it adds a whole new dimension to the loss he will suffer in kingship. The theme of homosexuality in both Bob and Mike as signifiers of a Falstaffian energy further fragments the spirals of power and pleasure that attract and repulse Hal/Scott. In Idaho, homosexual behaviour is figured as one of the main transgressions that constitutes Scott as a “fuck up.” In this film too, Scott’s “homosexuality” is expounded as his most explicit centre of hypocrisy and self-delusion. “His presence on the streets” writes Greenberg, “is provisional, a function of bristling oedipal revolt.”25 If even. In a dream-like sequence that depicts the Oregon hustlers as models on the covers of pornographic magazines – a grim rumination on the commercialisation, commodification and ultimate de-humanisation of prostitution – Scott’s position as a voluntary sex worker is incongruous and even odious. While Welles’s Hal’s condescension to the Boar’s Head seems to be met only with delighted acceptance, the underlying inference of Marie Antoinette playing milkmaid in Scott’s “slumming it” is explicitly criticised by the other hustlers in Idaho. Furthermore, Scott is not, as he asserts on numerous occasions, in fact homosexual. “It’s when you start doing things for free,” he says from the window of his magazine, “that you grow wings… grow wings and become a fairy.” Conversely, the character with whom the audience has been led to identify, the narcoleptic Mike, is positioned in a direct contrast to Scott’s hypocrisy. His innocent, unrequited love is faced with a cynical, disparaging, and intractable sense of sexuality. “Two guys can’t love each other,” Scott informs his clearly besotted friend during the Idaho fireside scene. “Yeah,” Mike replies, pausing before he marks film’s most poignant comment on the Hal character’s insidious hypocrisy: 24 In the original text at 2 Henry IV, II.iv “Lets beat him” suggest Poins at 248, “before his whore” 25 ���������������� Greenberg, p. 24

283


“…well, I dunno. I mean, for me, I could love someone if I, you know, wasn’t paid for it… I love you, and, you don’t pay me.” The cloud-obscured Scott is bringing his conventional sun-bright ideology into a dark world that he only frequents in order to make for a vivid contrast. Unsurprisingly, it has a devastating effect. Placing the conventional ideology that underlies his faux-rebellion alongside Mike’s simple and disinterested assertion of real emotional significance showcases both the ridiculousness of a conventional morality and the hypocrisy that is instrumental in its deployment. Indeed, it is Scott’s own heterosexually sanctified “sexual tourist”26 mentality that is equated in the film with paid-for sex. “The film thereby suggests that compulsory heterosexuality is a form of self prostitution for the sake of normalcy.”27 His abandonment of a sphere wherein he is loved unconditionally in favour of preconceived assumptions of normative social order exposes him as no better nor worse than the father he criticises so much, and just as ruinous. It is little surprise that this heterosexually conventional self-prostitution is exactly what Scott enacts with Carmella. In an interesting movement of Van Sant’s film, the war against the rebels in the original play – which is utilised by Welles as the moment of the young prince’s eventual beaming through the clouds of tavern life – is replaced by a long deliberation on Scott’s decided assumption of heterosexual normativity. Utilising the language comedy and wooing fodder of the closing scenes of Henry V,28 and replacing war-time France with rural Italy, this latest enactment of hypocrisy sees the fragmentation of worlds descend even further into obscurity. “At this junction of Van Sant’s film, Mike is as much Hotspur as he is Poins, and the killing of Hotspur, robbing him of his youth, is here, as it is with Falstaff/Bob, a matter of breaking a heart.”29 Indeed, like Hotspur, Mike is sacrificed on the alter of Scott’s hypocritical assumption of normative, masculine convention; a spoil of the prince’s self-pleasing little game. All the more, it is at this point in the film where Hal traditionally rises up from the gutter to bask in the abundant sunlight of patriarchal kingship, that Scott is most completely portrayed as the film’s villain. On receiving notice of what is presumably his father’s demise (in a taxi decorated with his conventional, entirely inheritable and cash valued queen) the film allows us to see in him a fleeting moment of remorse, only to be over-taken by a decidedly Machiavellian smirk that leads him directly to his next scene as a suit-clad ‘yuppie’ in a limousine. In his portrayal as villain, perhaps the last piece of paradise this Hal loses is the shredded vestiges of his own empathetic humanity. Finally, one of the greatest cinematic representations of prince Hal’s losing by hypocrisy is to be found in the temporal arrangement of the films. Welles’s castle signifies succession and linear progression not just from father to son and from 26 To borrow a term from Tony Howard, in Jackson ed., p. 309 27 Jonathon Goldberg, ‘Hal’s Desire, Shakespeare’s Idaho’ in Wood ed. P. 52 28 Interestingly, it is Mike’s mother who has taught Carmella English, putting her in the position of Katherine’s nursemaid Alice in Henry V 29 Ibid, p. 52

284


prince to king, but from king to death. It is in his assumption of the patriarchally sanctified roles of king/businessman that the Hal character’s central tragedy becomes most evident. In Welles’s film, Hal’s almost literal “becoming” his father – that deluded, lacklustre, easily duped usurper – is no great joy for the audience. Furthermore, having seen him ridiculed in the play within a play, the audience has been actively led by another audience on stage to actively disrespect this social position; to see it as pretentious, detached, presumptuous and above all – hypocritical. No matter how regal and materially wealthy he is, the audience is overly invited to reflect on when Hal appears to have been most happy. The life before him seems devoid of any sense of self-affirmation in the present. The greatest part of King Henry’s concern throughout the play has been taken up with his own impending death, how he will be remembered and the linear progression of his line. The castle, it seems, has the power to accentuate mortality. Likewise, the scenes in Idaho that depict Scott’s father consistently assert his subjectivity to impermanence. Photographs of him as a young man are used as an almost comical contrast with the helpless old man in the office. His vulnerable physicality, exemplified by his being wheelchair-bound, is expounded graphically when the film allows us to hear the beating of his failing heart. The ultimate suggestion of both tropes is that social conformity holds death at its core. Conversely, a total conceptual subversion of linear time characterises both films’ depiction of the tavern world. Welles’s tavern is a temporal zone of the infinite present. Falstaff is near defined as living “in a timeless, fleeting moment [that] precludes fulfilling future promises.”30 No one in the Boar’s Head is concerned about what is going to happen in the future, and when Falstaff is dying, far from dwelling on it, the community often refuses to acknowledge its possibility.31 Even the promise of Hal’s impending kingship is looked towards only as a means by which the community can continue to live as they already do. Indeed, it is Hal’s forcing of Falstaff into the world of the linear by way of his brutal denunciation, his “redeeming time,” that leads to his death at all. As Toliver notes, central to the Prince’s transition is his conceptual subversion of both he and Falstaff’s timelessness, thereby effectively, and all too knowingly, killing his friend. His rejection speech is rife with assertion of predominant linear time: Presume not that I am the thing I was; … When thou dost hear I am as I have been, Approach me, and thou shalt be as thou wast (2 Henry IV, V.v.60)32 30 Harold E. Toliver, ‘Falstaff, The Prince, and The History Play,’ from Shakespeare Quaterly XVI, (Winter, 1965) in Sanderson ed. p. 177 31 Master Shallow’s feeble suggestion that Falstaff pay him back some money before he dies (2 Henry IV, V.v.84) could be read as a parody, if anything, of how much the tavern community, unlike their castle superiors, don’t prepare for the future. 32 Emphasis is Toliver’s, in Sanderson ed., p. 177

285


Idaho takes the cyclical nature of the tavern even further. Mike’s narcolepsy is used as a temporal frame for the film’s action in its entirety. His drifting in and out of sleep, dreams and memories undermine any sense of a dependable temporal linearity. Like the tavern community of Welles’s film, he and the other hustlers live hand-to-mouth, subsisting in a fragmented and deconceptualised sense of the present, itself represented in the constantly reappearing and never ending roads of the Idaho plains. Implicitly, he, like the whole tavern community, can never die. In Idaho, it is only the figurehead representation of Falstaff who succumbs to (appropriately symbolic) death. The double funeral scene at the end of the film offers two vastly differing images of death, which depend wholly on a subjection or rejection of a patriarchally defined time scheme. As Scott attends his father’s burial, having become, like Hal, an embodiment of the patriarch who he now must follow into the grave, the riotous funeral of the “tavern” community rises to drunken splendour in the background. The whole assemblage join in the lawless cries of “Bob!” neither in grief or lament, but in assertion of an irrepressible energy that they have all come to represent. While the superficial, caricatured and vilified signifier of Falstaff has been cast off as planned by the triumphant Scott, the enigmatic and magnetic appeal of the Falstaffian world, one which was the golden source of his own happiness has now been denied him forever. As Scott watches on, immobile and impotent, Hal has lost his little game. The price he must pay is his own immortality. The tragedy of Hal as hypocrite is the triumph of the traditional regal hero, with all his trappings of patriarchy, superiority, self preservation, anxiety and above all that linear spectre of impending death. In fact, that Hal loses so much in becoming a great and well-remembered king endures as the greatest hypocrisy of so many that riddle the plays – he is a perverse tragic hero parading himself as a valiant survivor in a heavenly office that the audience cannot help but feel whiffs all too much of Hell; his virtuous kingship is intrinsically linked with malicious humanity. Instead of falling from greatness, he rises to it. Instead of succumbing to a fatal error of judgement, he resists it. And instead of dying as King, he is reborn as Henry V. In doing so, he falls from greatness of a different kind: that of his own happiness; succumbs to the fatal error of conventional expectation; and dies as the rebellious, carefree Hal of the tavern. Indeed, the triumph of Hal’s role as an English king is showcased as being intrinsically linked with his tragedy as a human being. “History is enlarged here to make room for taverns and trollops and potations of sack and the heroic voice is modified by gigantic mockery, by the roared voice of truth.”33

33 Mark ��������������������������������������������������������������������������������������������� Van Doren, Shakespeare, cited in Milton CRance, ‘The Worlds of Prose and Verse in Henry IV Part 1’ in Sanderson ed., p. 350

286


287


GENETICS & MICROBIOLOGY PANEL

Panel members Prof. Tony Kavanagh (Trinity College Dublin) – Chair Dr. David McHugh (University College Dublin) Prof. Kevin Devine (Trinity College Dublin) Prof. Noel Lowndes (NUI Galway) Dr. Emmeline Hill (University College Dublin) Judges’ commentary Our immune system protects us from infectious diseases by enabling recognition of a vast array of potential pathogens, mobilizing effective defence responses and retaining a long-term memory of the encounter. This “adaptive” aspect of the immune response prepares us for future challenges and enables an even faster and stronger protective response if the same pathogen is encountered again. Adaptive immunity critically depends on a gene shuffling mechanism that can generate, in each of us, an essentially unlimited repertoire of receptors that recognize specific pathogens and other antigens. It appeared suddenly, some 500 million years ago, during vertebrate evolution at the point of divergence of the more advanced “jawed” vertebrate lineage from the primitive “jawless” lineage. Various hypotheses have been advanced to explain how the adaptive immune system arose during evolutionary time. The most important of these, the so-called “immunological ‘big bang’ theory” is discussed in the following essay.


Ge n et ic s & M ic robiol o g y

The evolution of the adaptive immune system Darren Fitzpatrick

A

Abstract pproximately 500 million years ago a divergence event within Vertebrata occurred. This resulted in two classifications of vertebrate, Agnatha, the more primal of the two and Gnathostomata. This divergence event marked both a speciation and the subsequent radiation of vertebrates. Somewhere within the estimated 500 million years since divergence, the adaptive immune system appeared. Enquiry into the origin of this system has opened disparate fields of investigation. This review focuses on the “big bang� theory which is an attempt to bring coherency to the multifaceted field of evolutionary immunology. Introduction Two patterns of immune system function in jawed vertebrates have evolved: the innate system and the adaptive system. The latter, the focus of this review, theoretically enables an organism to respond to an infinitude of antigens. Much is known regarding the structural components and mechanism of action of the vertebrate adaptive immune system. It is defined by the presence of clonal antigen receptors (immunoglobulins, T-cell receptors), recombination-activating genes (RAG1/2), organised lymphoid tissue, class I and class II major histocompatibility complexes (MHC I/II) and immunological memory as an emergent property of clonal selection, (Flajnik, 2004). Until recently, research on the presence of adaptive immunity in invertebrates has focused on sourcing evidence for structural and functional homologies/analogies to the vertebrate adaptive immune system. Such research has been invariably in vain (Flajnik, 2004). Immune-like responses have been documented across taxa. In Protozoa, Amoebae show intolerance for the transplantation of a foreign nucleus and self recognition in the form of remerging of severed pods (Tartar, 1970). In Porifera, the most 289


Fig. 1. Dendrogram showing evolutionary relationships between selected animal phyla and classes according to the current model (http://phylogeny.arizona.edu/tree/phylogeny.html). Speculative origin times of adaptive immune structures are indicated, as are documented histocompatibility systems. Divergence times are not to scale unless indicated.(Cited from Laird et al., 2000) primitive of the Metazoa, both allografts and xenografts are rejected and the rate of rejection increases for second set grafts (Hildemann et al., 1979; Pfeifer et al., 1992). The increase in rate of rejection is implicitly analogous to immunological memory in vertebrates. An elegant argument has been put forward by Vatclav Vetvicka and Petr Sima (Vetvicka and Sima, 1998) drawing a correlation between the evolution of complex structure, from single cells to multicellularity and organised tissue types and the complexity of immune responses. In light of the role of lymphoid tissues in lymphocyte maturation, such an argument has merits on phenomenological grounds. Structure is indeed a “conditio sine qua non� for function. However, phenomenological conclusions derived from phenomenological premises are not adequate in determining scientific theories when mechanistic details have been omitted. Similarity of function does not translate as homology (Klein, 1997). Considering the search for homology/analogy to the adaptive immune system in jawed vertebrates, it is clear that this system is thought of as the paragon of excellence for adaptive immunity (Zasloff, 2002). Hauton and Smith have determined criteria, the fulfilment of which they argue as necessary, prior to defining adaptive immunity in any species. The criteria were of course delineated a posteriori, i.e. deduced from an understanding of gnathostome adaptive immunity. These criteria I will quote: (1). Clear, unambiguous and reproducible evidence of at least some specificity and memory that cannot be attributed to anything other than an active response on behalf of the host, (2) a description of the 290


likely mechanism(s) underpinning the response, and (3) extensive experimental testing of these “new” hypotheses. (Hauton and Smith, 2007)

The significance of the many failures to determine homology between the jawed vertebrate system of adaptive immunity and the adaptive like features of invertebrate immunity is such that no shift in thinking with regard to the origin of the “bona fide” adaptive immune system has occurred. The prevalent theory still is that adaptive immunity has its origins during the divergence of Agnatha (jawless vertebrates) from Gnathostomata (jawed vertebrates) during Cambrian explosion (Forey and Janvier, 1993). This postulation is justified by the abrupt appearance of the Ig/TCR/MHC components (Fig. 1) in the most primitive of Gnathostomata, viz. Chondrichthyes (cartilaginous fish) and their absence in Agnatha (Marchalonis et al., 1998, Rast et al., 1997, Rast et al., 1994). The abrupt appearance of Ig/TCR/MHC components has been referred to as the immunological ‘Big Bang’ (Schluter et al., 1999). This will be discussed later. Throughout this review, reference will be made to adaptive immunity as it is currently understood in both Agnatha, in particular Petromyzon marinus and in Chondrichthyes, in particular Carcharhinus leucus and Carcharhinus plumbeus. These species are used as models because of their phylogenetic antiquity. They are the most basal of the vertebrates. In the absence of Cambrian genetic material, the above species are the closest to the last common ancestor of Agnatha and Gnathostomata, the divergence of which is the point of origin of the RAG based system of adaptive immunity.

Somatic Diversification: adaptive immunity as rearrangement Differential gene expression usually accounts for the phenotype of varying cell types and functions arising from intercellular genetic uniformity within a multicellular organism. The adaptive immune system operates contrary to this axiom of genetics. Genomic rearrangements via RAG mediated recombinations are responsible for the generation of diversity (GOD) of antigenic receptors and, as such, lymphocyte phenotype. An overview of the mechanism of somatic diversification as it occurs in Gnathostomata and comparison to the recently uncovered mechanism in agnathan Petromyzon marinus (sea lamprey) will ensue. The necessary machinery for somatic diversification in jawed vertebrates is as follows: (V) variable, (D) diversity and (J) joining gene segments, the lymphocyte specific recombination activating enzymes (RAG 1, RAG 2), recognition sequence signals (RSS) and the regular DNA repair pathway (Market and Papavasiliou, 2003). Each of the V, D and J segments are flanked by an RSS to which RAG 1 binds (Fugmann et al., 2000). The RS sequences are composed of a heptamer, a nonamer and a spacer of either 12 or 23 bp. In vitro, the recombination reaction will not proceed in the absence of RS sequences, the 12/23 rule (Akira et al., 1987; Lewis, 1994). 291


Fig. 2. Diargramatic representation of RSS. The nucleotides represented in bold print are conserved. Changes in the other nucleotides are tolerated (Lee et al., 2003) (Diagram cited from Market and Papavasiliou, 2003) The evolutionary significance of the RS sequence will be discussed later. Following the binding of RAG 1 to the RSS, RAG 2 is recruited and a site specific cleavage occurs between the V(D)J segments destined to become Ig’s or TCR’s and the RS sequence (Fig. 3). Cleavage results in the exposure of a 3’-hydroxyl group on the coding region. The products of the reaction, resulting from a Mg2+ dependent nucleophilic attack on the target phosphodiester bond, are the coding joint (CJ), i.e. the somatically diversified exon and a signal joint (SJ) (Agrawal, et al., 1998). CJs result from the opening of the hairpin and the addition of the nucleotides by terminal deoxynucleotidyl transferase. Some mechanistic features of the reaction have been omitted, most significantly the modifications generating P nucleotides (Bassing et al., 2002). Such details are outside the scope of this review. The generation of diversity that differentiates adaptive immunity from innate immunity occurs at multiple points in the recombination reaction. The number of gene segments incorporated into the recombinant DNA results in combinatorial diversity, junctional diversity due to insertions and deletions at the joining site of gene segments and the addition of non-templated nucleotides to the coding joints (Market and Papavasiliou, 2003, Bassing et al., 2003). The V(D)J segment then generated bestows a unique phenotype on the lymphocyte wherein the gene is expressed. Based on a sequence of 35 genes, Agnatha were shown to comprise a monophyletic group (Takezaki, et al., 2003). Extant Agnatha comprise both Myxinoids (Hagfishes) and Petromyzontids (Lamprey). The justification for utilising the lamprey as a model is the existence of synapomorhies amongst it and Gnathostomata (Takezaki, et al., 2003), thus enabling the elucidation of a more accurate picture of their last common ancestor. Immune responses are well documented in the lamprey. Immunization induced aggluttin proliferation (Marchalonis and Edelman, 1968) and accelerated rate of second set allograft rejection (Perey et al., 1968) are but two adaptive like responses that provoked interest in lamprey immunology. Morphologically, the cells associated with these responses resemble gnathostome lymphocytes (Mayer et al., 2002), they proliferate on exposure to antigens and express the transcription factors PU.1/Spi-B and Ikaros which have a role in lymphocyte differentiation in 292


Fig. 3. Overview of V(D)J recombination. 12RSS and 23 RSS are represented as black and white triangles, respectively, coding sequences as rectangles and proteins as shaded ovals (Fugmann, et al., 2000). The diagram illustrates the general mechanism of RAG recombination, i.e. RAG binding, synapsis, cleavage and generation of the resulting genomic rearrangement, the coding joint. (Diagram cited from Fugmann et al., 2000) mammals (Shintani, 2000; Mayer et al., 2002). This data combined enabled Pancer et al. (2004) to hypothesise that these lymphocytes were more likely to express genes involved in adaptive immunity (Pancer, et al., 2004, Pancer and Cooper, 2006). Following an analysis of the transcriptome of these lymphocytes, variable lymphocyte receptors (VLR) based on leucine rich repeats (LRR) were uncovered. Genomic analysis revealed the presence of a single germ-line VLR gene termed gVLR situated between upstream and downstream LRR cassettes. PCR analysis served to verify the hypothesis that insertion of varying LRR modules was the source of somatic diversification in lamprey lymphocytes and thus responsible for the observed VLR’s (Pancer, et al. 2004). The genetic and evolutionary relation of agnathan lymphocytes and gnathostome lymphocytes suggests that the last common ancestor of Agnatha and 293


Fig. 4. Comparison of RAG catalytic domain from Carcahinus leucas to 186, P2, P22, E. Coli Fim A and Fim B, transposons 2603 and 554, Human RAG 1 phages ф80, P1, λ and P4. The alignment was made using CLUSTALW and PRETTY. Black indicates amino acids similar to shark, human proteins are shaded and amino acids of neutral exchange value are boxed (Bernstein et al., 1996). (Diagram cited from Bernstein et al., 1996) Gnathostomata had lymphocytes or lymphocyte like cells. The aforementioned mechanism of somatic diversification and lymphocyte clonal selection in Petromyzon marinus fulfills the criteria of Hauton and Smith. Lamprey have a mechanism of adaptive immunity. The consequence of this is such that during vertebrate evolution, two modes of immune system related somatic rearrangement evolved (Pancer and Cooper, 2006). Questions regarding selective pressures and variances therein result from this point. It must be emphasised that somatic rearrangement in the lamprey is not RAG dependent. RAG is unique to Gnathostomata and enquiry into its mechanism of reaction, sequence, RSS dependency and inheritance has served to elucidate questions regarding the evolution of adaptive immunity in jawed vertebrates. RAG will occupy the next point of discussion.

RAG: The Origin of a ‘Cut and Paste’ Complex As discussed, the RAG 1/ RAG 2 complex is responsible for the somatic rearrangement of germline V(D)J genes and the subsequent creation of unique Ig’s and TCR’s during lymphocyte development. The reaction mediated by RAG is akin to a transposition reaction, i.e. involves the translocation of genomic elements. RAG has similarity to transposable elements, viz. RAG 1/2 have a compact genomic organisation, they are syntenic and adjacent, are transcribed simultaneously and are generally absent of introns (Agrawal, et al., 1998; Gellert, M., 1996). The transposition like features of RAG recombination are the Mg2+ nucleophilic attack by the post-cleavage 3’-hydroxyl group on the target phosphodiester bond and also RAG’s continued synapsis (binding to the RSS) post-cleavage (Agrawal, et al., 2006). In 294


Fig. 5. Comparison of RAG 2 Him A and Him D from both E. coli and S. typhimurium. Alignments made using CLUSTALW and PRETTY (Bernstein et al., 1996).(Diagram cited from Bernstein et al., 1996) vitro, RAG carries out legitimate intermolecular and intramolecular transpositions (Agrawal, et al., 2006; Hiom et al., 1998). The primary difference between RAG recombination in vivo and transposition reactions is the intra-molecular binding of the products. Transposition products are bound by the transposase to the target molecule whereas the recombinant signal joints in RAG recombination are ligated by the DNA repair pathway (Bogue and Roth, 1996). The similarity of RAG recombination to transposition has inspired research into the sequence similarity of the RAG genes and microbial transposases. A comparison of the RAG 1 catalytic domain sequence to recombinases from phages 186, P2, P22, E. Coli Fim A and Fim B, transposons 2603 and 554, Human RAG 1 phages ф80, P1, λ and P4 respectively is shown (Fig. 4.). In this sequence segment, the RAG proteins have 48% similarity and a 60% overall similarity (Bernstein et al., 1996). Similarly, RAG 2 was compared to the IHF proteins Him A and Him D from both E. 295


coli and S. typhimurium (Fig. 5.). The overall similarity is 44% and 43% for Him A and Him D, respectively. Most significantly, the similarity of the active domain, i.e. the N-terminal is 84% (Bernstein et al., 1996). The conservation of functional regions in homologous proteins is a general occurrence. These homologies when considered with the absence of RAG elements in Protostomata and Deuterostomata inclusive of Agnatha support the hypothesis that RAG originated via the horizontal transfer of a transposable element into an already existing receptor gene (Marchalonis and Schluter, 1998; Schluter, et al 1999). It has been hypothesised that aberrant phylogenetic patterns concerning gene distribution may be indicative of a horizontal transfer event (Li, 1997). RAG complies with this. The insertion of transposable elements is a well documented phenomenon (Marchalonis et al., 1998; Zapata et al., 1990), e.g. >100,000 retrotransposons in the human genome (Smith and Riggs, 1996). Considering RAG’s dependency on the RS sequence, it has also been hypothesised that this sequence formed part of the primordial transposon (Thompson, 1995). The conservative nature of the RSS supports this. RAG proteins are also conserved across Gnathostomata; 90% similarity exists between the entire shark sequence and the corresponding human segments (Marchalonis and Schulter, 1998). This coupled with the synteny of RAG1/2 and their inheritance as a cluster is indicative of positive selection.

The Ig Superfamily: Ancient and Modern The Ig domain characteristic of the Ig Superfamily is present in many metazoan taxa outside vertebrata e.g. there are 38,000 known varieties of the Down Syndrome Adhesion Molecule (DSAM) in Drosophila, they are known to exist also in Porifera and Coelenterata (Rougen and Hobert, 2001). These Ig domains are IgV like domains (Marchalonis et al. 2006). It is not known if these IgV like molecules are the prototypes on which the “bona fide” IgV was based. The absence of a CDR3 antibody has led to the proposition that these are statistical accidents resulting from gene duplication (Marchalonis, et al., 2006). Further, comparison of TCR Vγ chain with IgV in man has led to the hypothesis that TCR Vγ is the primordial Ig in vertebrates (Richards and Nelson, 2000). The plausible suggestion that the Ig domain was present in the last common ancestor of Agnatha and Gnathostomata has been put forward (Schluter et al., 1999). The Ig molecules are thought to have undergone a period of intense selection. This has been described as their achievement of a “canonical” form (Marchalonis, et al. 1998). The conservation of the Ig domains supports this due to their functional role in dimeric stabilisation and their place as a foundation for antigen binding domains (Marchalonis, et al. 1998). It has been proposed that Igs underwent a second phase diversification via the generation of orthologous genes (Marchalonis et al., 1998). This corresponds to the varying repertoire of Ig types across the vertebrate phyla: Ig, A, D, E, G, M in Mammalia, Ig M, Y, A, D in Aves, Ig M, N in Osteichthyes and as expected, only Ig M in Chondrichtyes (Weiser et al. 1969). The frequency of 296


occurrence of Ig subtype within species is of no evolutionary significance as this is determined by host-parasite interaction post-ontogeny. Gene duplication has been called upon to explain V(D)J multiplicity. This hypothesis is dependent on the 2R hypothesis or the hypothesis of tandem duplication of individual genes or segments (Hughes and Friedman, 2003). Both duplication hypotheses are a point of contention. It is accepted that the larger the genome, the higher the probability of mutation. On the basis of this and incorporating Marchonalis and Schluter’s (1998) notion that pathogen recognition is a by product of adaptive immunity, it has been hypothesised that gene duplication operating in response to selection pressures (Kondrashov and Kondrashov, 2006) resulted in a self surveillance system designed to detect non-self and cancerous self (Rolff, 2007). In light of the possible role of DNA methylation in epigenetic silencing post duplication (Rodin and Riggs, 2003) and also the role of methylation in cancer (Baylin and Ohm, 2006), Rolff (2007) hypothesises that increase in frequency of cancer may have selected for gene duplication resulting in V(D)J multiplicity, an efficient self surveillance system and pathogen recognition as a by product (Rolff, 2007).

Piece by Piece: The Proposal of a Model Firstly, as a point of semantics, it has been argued that all “inducible immune responses” are adaptive. Therefore, it was proposed that “combinatorial immunity” is a more apt description for what has hitherto been referred to as adaptive (Schluter et al., 1999). Considering that there is now a plethora of research being conducted into possible “adaptive” immune responses, the lamprey being one such example. I concur that pedantic and semantic improvements are necessary for the purpose of clarity. The model proposed by Marchalonis and Schulter (1998) is stochastic and thus in keeping with their immunological “big bang” theory (Schulter et al. 1999). The model has three components. These I will quote: (a) the rapid generation phase, (b) a rapid phase of decay or evolution under stringent selective conditions during which the generated immunoglobulin rapidly evolved to “canonical” form, (c) a rate of evolutionary change consistent with those of other proteins. (Marchalonis and Schluter, 1998).

Data for the above model was estimated using first order differential equations and the results were in keeping with Kimura’s proposition that evolutionary rate is proportional to quantity of conservation between two orthologous sequences (Kimura, 1969; Marchalonis and Schluter, 1998). The results to emerge from the model of differential equations proposed are of interest. They concern rate of protein evolution and the units are in substitutions per site per year. The rates determined are as follows: (a) 10 -7, (b)10-8 and (c) 10-9 297


(Marchalonis and Schluter, 1998). The decrease in rate corresponds with the “big bang” theory. However the model assumes that phases (a) and (b) occurred within 20 million years. This time span is based on divergence times deduced from the fossil record, thus a +/- 15% margin of error is expected (Ayala, 1997). This may be a weakness in the model. The “big bang” model has become “de rigueur” of late, partly I assume because of its orderliness, simplicity and logic. On a less aesthetic note, advances in immunogenetics are slowly serving to corroborate the model (Flajnik, 2007). An alternative model, or rather description, has been proposed (Klein and Nikolaidis, 2004). This model advocates traditional gradual evolution. It argues firstly that the lymphocytes, as described in the lamprey, are not genuine. The basis of this is the homology and not orthology between the Spi and Ikaros transcription factors. Clarity regarding the exact functions of Spi and Ikaros in the lamprey will conclude this. The model acknowledges the functional and structural similarity of RAG to transposons yet merely concludes that the transposon insertion hypothesis is controversial. No alternative is proposed.

Conclusion In the absence of a credible alternative model to pose an adequate challenge, the “big bang” hypothesis has merit in that it connects disparate phenomena and merges them to formulate a logical idea. Based on the findings in the lamprey, it can be justifiably declared that lymphocytes were likely present in the last common ancestor of Agnatha and Gnathostomata, possibly a Placoderm (Schluter, et al., 1999). The research previously described which highlights the similarities between RAG recombination and transposase activity are convincing. However, I contend that the range of microbes to which the RAG sequence compares favourably is perhaps too broad to conclude on the occurrence of a single transposition insertion event. Alternatively, transposase similarity in general and as such phylogeny may render it impossible to determine the exact nature of the insertion event. Gene duplication in general is controversial with regard to its mechanism of occurrence. However, not even Klein and Nikolaidis (2005) disagree that such an event had a role in the evolution of combinatorial immunity. I agree with the proposal of Rolff (2007), that cross phyla analysis of cancer occurrence in vertebrata may serve to show that gene duplication inadvertently selected for combinatorial immunity via selection for rigorous self-surveillance following genome expansion. Increased metabolic rate, as is characteristic of vertebrates, may also have a role akin to genome duplication due to the greater likelihood of mutation associated with higher nucleotide turnover (Rolff, 2007). This returns somewhat to the phenomenological argument of Vetvicka and Sima (1998). Increased metabolic rates resulting from evolution of complexity and the associated probability of mutation as proposed provide a possible mechanism, although incomplete for the evolution of defence reactions. It seems startling that the combinatorial immune system may have evolved pathogen recognition ability as an aside to that which selection 298


resulting from genome duplication and increased metabolic rate acted upon, i.e. increased self recognition. It would be facetious to conclude that combinatorial immunity was a lucky waste product. Clarity on the above points is required prior to the announcement of any conclusions. However, Occam’s razor demands simplicity and the “big bang” stochastic model is more parsimonious and coherent than the empty dismissal of Klein and Nikolaidis (2005). Should the “big bang” theory be further verified, and I contend that it will, it will serve to contravene Darwin’s gradualism. “Natura non facit saltum” may very well be an adage, not a maxim.

299


GEOGRAPHY PANEL

Juding Panel Prof. Patrick O’Flanagan (University College Cork) – Chair Dr. Úna Ní Chaoimh (University College Cork) Dr. Denis Linehan (University College Cork) Judges’ Comments This is a powerful and comprehensive essay. It is well written, engaging and thoughtful. It seamlessly deals with a series of extremely complex issues and reduces them to a highly interesting and readable format. The essay meets all the criteria as set out in your ‘guidelines’. It is extremely original in its remit. It demonstrates no significant weaknesses and his management of his chosen topic exhibits clear intellectual excellence. These qualities set it above and beyond all the other essays and which its nearest rival sadly displays.


Ge o gr a ph y

The Earth’s disciples: Geographers & the reinterpretation of Space in the 21st Century Drew Reid Introduction

A

“The only constant is change” (Heraclitus 540 - 480 BC)

s an interpretative study of the discipline of geography, this paper will not and cannot categorically answer the question as to whether geography is an academically integrated or divided subject. The directions taken in this field of study in terms of the physical and humanistic perspectives can be made intelligible and analysed but will still be at the mercy of contemporary ideas. To clarify, whatever interpretation is argued and presented in this paper will inevitably be a point of view that is a product of this individual’s scholarly experiences and personal elucidations that have in turn been influenced by the works of preceding geographic thought. In the contexts of this assignment, an opinion will be expressed and substantiated with academic sources that will offer an individualistic exegesis on the modernistic to post-modern trends in geographic thought. This paper will then suggest that the hemispherical nature of geography will progressively blend together into a more “fuzzy” subject area that will combine the more quantitative nature of physical geography with that of the more qualitative stream of human geography. With this perspective taken, an increasingly integrat301


ed science is proposed to emerge through technology and a clearer understanding of the global impacts of a global society is having upon this fragile Earth. Although there have been a number of paradigm shifts over the centuries in the field of geography, the dialogue between the spatial and the temporal have always managed to “fit in” with the thinking of the time. An example of this to be considered was the preponderance of validity Environmental Determinism gained during the late 19th century and early 20th century in Europe (Hartshorne, 1960, 56). Eminent human geographers such as Friedrich Ratzel, influenced by the Darwinistic theoretics of the time, and later Ellen Churchill Semple in the United States took ideas derived from the natural environment and evolution to steer geography along a path in line with the accepted scientific and socio-political perspectives of that era (ibid, 56-57). Using this example still, a retrospective view can see the faults and fallacies in the Deterministic model as an explanation for how we develop as a society, but the idea of evolution had, and still has, a valid point to make in the context of how geography as a discipline would advance. In this context, geography itself has evolved throughout history to the discipline it is today and will continue to do so in accordance within the context of society’s dictates (Holt-Jensen, 1988, 83-84). This explanation is supported by Harvey (1973) and Widberg (1978) who argue that progression within the field of geography “reflects the special interests of those who are in control of the means of production” (ibid, 84). In the opinion of the authors, the direction that both human and physical geography has taken has been as a result of it becoming politicised. Therefore, the evolutionary pattern seen is not one where the variables of interest are “left to their own devices” but more along the lines of a directed evolution from exterior forces. Contemporaneously, this politicisation of geography as argued by Johnston (2002) states that (geography) “departments compete for resources, including students, within and between universities” (Johnston, 2002, 133). With this contestation between the physical and human emerging, the consequential effects are a forced divergence from within the discipline itself as it engages in strategic self-interest and self-preservation. Historically, geography has gone through a number of dramatic shifts. These shifts have acted to splinter the subject into definable and specialised areas which has popularly been accepted as the reason why the discipline has split (Hartshorne, 1960, 68-69). Although this paper will not provide an in-depth analogy of them, reference to them as pivotal moments cannot be ignored. The modern and post-modern developments are of unique importance here and will be incorporated into the dialectic model of development and particular attention will be drawn to the 20th century where four key paradigm shifts in the discipline were seen: Environmental Determinism Chorological or Regional Geography The Quantitative Revolution Critical Geography (Holt-Jensen, 1988, Preface, 9) 302


The development of the discipline has seldom been harmonious. A dualism (physical and human) has emerged from the application of differing views on how the field of study should progress, be it separate or together (Harrison, 2004, 435). This separation is best exemplified by the approaches taken in regards to the objectivity (factual, measurable and quantifiable) of physical geography and the subjectivity (personal, interpretative and experiential) of human geography. However the collaborative instances have resulted in the imbrigation of human and physical geography. This overlap is desired in some instances to understand human-environmental phenomena, for example groundwater pollution as a result of intensive agricultural fertilisation of arable lands. Although, as stated earlier, geography has managed to “fit in” with the constructs of polity and society, it has done so through a metamorphosis of contrasting ideas and outside pressures. In understanding this, geography can be thought of as having a dialectical or contested arrangement between the physical and human aspects of the discipline. Harvey (1995) advocates this in saying “the world is inherently dialectical” and “the dialectic is simply one convenient set of assumptions or logic to represent certain aspects of physical, biological, and social processes” (Harvey, 1995, 10). To develop upon this idea of the dialectic, what follows is a conceptual model of how dialectics can be used to describe the direction, growth and mobility of geography.

Dialectic Model of Development Everything is made out of opposing forces/opposing sides Gradual changes lead to turning points, where one force overcomes the other Change moves in spirals, not circles (Rosser Jr, 1998, 5)

I. Everything is made out of opposing forces/opposing sides. In a geographic context this refers both the quantitative vs. qualitative or physical vs. human sides of the discipline shaping geography into a divided subject as Stoddart (1987) would argue by making reference to the increased divisions that have arisen due to the specialisation of sub disciplines in geography (Stoddart, 1987, 327-336). According to the author, this has led to a fragmentation of geography that has rendered communication between the two sides almost irrelevant. This notion of communication is raised by Bracken and Oughton (2006) who state that the oppositional aspects of geography comprise of “differences in epistemologies, knowledges and methods; different ways of formulating research questions; differences in communication and a range of attitudes across disciplines” (Bracken et al, 2006, 372). According to the authors and substantiated by Hartshorne (1960) 303


geographic study has developed along divergent lines as “each division would develop its distinct methods of observation and study” (Hartshorne, 1960, 65). The oppositional standpoints however as specified by Johnston (2002) may also be a “source of strength” which then offers to the discipline a diverse, broad yet highly specialised pool of knowledge from which geographers on both sides can draw (Johnston, 2002, 139). It is Johnston’s (2002) belief that this paper is concerned with as the development of geography had to diverge in order to re-integrate. The opposing approaches, namely the objective view of the physical geographer and the subjective view of the human geographer can be argued to have furthered the holistic nature of geography (Holt-Jensen, 1988, 128). The counter-arguments in geography therefore have acted and encouraged research to explain geographic phenomena which assist in the development of the subject.

II. Gradual changes lead to turning points, where one force overcomes the other. Paradigm shifts in the discipline have resulted in a dominance of one hypothesis over the other. An example of this is the superseding of the Quantitative Revolution during the 1960’s over the Regional or Chorological theories of the 1920’s. Regional geography was itself a turning point over Environmental Determinism which came about as a result of a reaffirmation that the proper topic of geography was study of places (regions). Regional geographers focused on the collection of descriptive information about places, as well as the proper methods for dividing the earth up into regions and not the characteristics of a place (especially human) being defined by environmental considerations (Livingstone, 1992, 353-354). The development of geography as a result of the Quantitative Revolution was a reaction by geographers of a physical geography background to return geography back into the “sciences” (Gould, 1985, 35). Its timing was so because of the advances made in technology at that time enabled great leaps to be made through the advent of new theoretical modelling previously unavailable (ibid, 36). The predominance of the Quantitative Revolution (Spatial Science) during this time were also a reaction by more “scientific” physical geographers to establish geography as a harder, more quantifiable discipline and move away from the “softer” image it had gained from Chorological studies that had dominated geography previously (Dalgaard et al, 2003, 50-51). In this example, the ideographic (human orientated) advocates were in the minority and the nomothetic (science orientated) geographers were in the majority. This “revolution” brought with it spatial and temporal modelling, geometric measurement, statistical analysis and developed positivistic thought (Livingstone, 1992, 355-356) (www.abdn.ac.uk). Another shift occurred in direct reaction to the Quantitative Revolution. Critical Geography arose as a riposte to positivism in the 1970’s – 1990’s as human geographers saw the need to de-emphasize the significance attributed to geographic phenomena as being objective and quantifiable in nature (Livingstone, 1992, 357-358). Using ideas derived from Phenomenology and Existentialism, importance was placed firmly on “meaning of 304


place” and the subjective view of the world (Holloway et al, 2001, 65). These postmodern approaches to geography are still influencing geographic research today through geographies of exclusion, femininity and marginalisation. In returning to positivistic thought, the Quantitative Revolution was the birthplace of modern technological geographic sub-disciplines such as Remote Sensing (RS) and GIS which are central to the core argument this paper proposes.

III. Change moves in spirals not circles. This is fundamental to the pattern of development in geography as change does not necessarily mean that an entirely new perspective substitutes a previously accepted theory but instead progression can be made on one side of geographic thought which then furthers the breadth or depth of both fields of study. This is critical to the influence that one particular sub-discipline can have on research on both sides. Change can occur in highly specialised geographies such as Remote Sensing and GIS but can then penetrate and help develop other sub-disciplines at the opposite end of the geographic spectrum. An example of such would be the RS and GIS digital mapping of onchocerciasis (river blindness) outbreaks in Mozambique conducted in 2000 by Aircare International (www.cdc.gov). This had implications for the study of settlement dispersion and agricultural practices in the region which had to be studied to ascertain their risk level from the disease. The settlements and their inhabitants had cultural and traditional ties to the river such as fishing which became the focus of human geographic study; hence a crossover between sub-disciplines was seen (ibid). The rapidity of change may depend on a number of factors such as funding and/or special interest by governments or other influential bodies which in turn prioritise one form of research over another (Bracken et al, 2006, 372). But this can then point those specialised subjects (RS and GIS) into new areas of research as demanded by humanistic studies (Thrift, 2002, 296). Using this example still, “computing will add an extra layer of flexibility and possibility to most social sciences and humanities research” (ibid, 296). With this, it is possible to see that common ground between physical and human geography can be reached as the impacts of new methods of observation and interpretation allow a platform from where collaborative research can be initiated from. Historically and contemporaneously, the debate has focused on the continued divergence of human and physical geography as proposed by the likes of Livingstone (1992) and Stoddart (1987), but it also can be said that integration looks more likely than ever with inter-disciplinary opportunities that the likes of RS and GIS can provide. The Future of Geography The chance for an integrated discipline in geography is not a mirage. In the current age of global Earth-observing satellite constellations and digital mapping techniques, the facilitation of data presentation and analysis of large and small scale phenomena is now, and in this individual’s opinion, the future prospect of geography. Thrift (2002) highlights the importance of technology and the role it will play 305


in fusing geography together by indicating that “in physical geography, we can see large-scale simulation becoming a way of life. But, in human geography, the possibilities have been counted to be less when they may actually be more” (Thrift, 2002, 296). The application of such technologies is only one half of the story. The development of Earth System Science (ESS) came about from the recognition that “geographers must consider multiple environments (natural, built, interactional, socio-cultural, and cognitive)” (Golledge, 2002, 2). ESS stresses the interconnectivity of systems on the Earth and “accepts that biophysical sciences and social sciences are equally important in any attempts to understand the state, and future of the Earth System” (Pitman, 2004, 137). The most prevalent example of interconnectivity amongst systems is the current issue on climate change. Through the use of Remote Sensing platforms (satellites, aircraft, buoys, ships and land based sensors) to monitor and record information, and GIS to spatially represent that data, greater knowledge and comprehension of the causes and effects of climate change can be better understood. Climate change is the most prolific exemplification of an integrated geography as it exhibits “Human-Environment Relations (HER)” (Golledge, 2002, 2). The author goes onto say that “the HER integrative approach that has a natural home in geography has resurfaced as an important knowledge seeking procedure” (ibid, 3). As the atmosphere is the most dynamic natural environment on the Earth, high temporal frequency monitoring allows for the impact of all elements both human and natural on the atmosphere to be measured. Belbin (2002) states that “in modern times, geographers have specialised in different geographies of space-time such as geomorphology, hydrology, climatology, urban and rural geography, political geography, demographic geography and feminist geographies” to name but a few (www.scienceinafrica.co.za). But these human and natural geographies are necessary to understand the complexities of the world as it functions as a system of integrated systems, each containing their own detailed environmental and social issues (www.scienceinafrica.co.za). This argument as proposed by Belbin is one that I agree with. Belbin (2002) adds to this by saying that the splintering of geography into its component parts has enabled the modern geographer to communicate more clearly with contemporary issues and that “the true geographer is still an integrationalist” (www.scienceinafrica.co.za). Belbin’s perspective is, in my view, wholly correct. How else can a geographer truly call him/herself a geographer without incorporating ideas from both sides of the discipline? It is accepted that future research will not engage upon an equally balanced measure of human and physical perspectives, but an integrated discipline must be embraced to some extent. In assigning cause and effect to either human interference on the landscape or vice versa for any given geographic event, ignoring that geography is a two-sided coin that complements each other is ignoring what Belbin professes about being a true geographer. How will geography re-integrate itself? This paper proposes that at a time when global geographic phenomena such as climate change now being fully appreciated and understood, global spatial monitoring can now effectively observe global sys306


tems such as the atmosphere at high temporal frequencies and be able to relay information that can identify the inter-relations between human and physical factors. Drawing on the theories of ESS, systematic observation has only relatively recently been able to develop through technological advances in observation techniques (Pitman, 2005, 139). The systems approach is holistic in nature and can be explained in a tri-conceptual framework as suggested by Arild Holt-Jensen (1988). 1. A set of fixed elements with variable characteristics: These are the structural aspects of the system, for example multi-scale settlements (rural and urban/developing and developed) and people, flora and fauna. These are the variables of interest that exhibit unique characteristics that also display connectivity between each other (Holt-Jensen, 1988, 141142). 2. A set of connections between the elements in the system: These are the functional aspects of the system. The function describes the “series of flows (exchange relationships) that occupy the connections” (ibid, 142). An example of this would be food chains in an ecosystem as an interrelated network. 3. A set of connections between the elements in the system & its environment: These are the developmental aspects of the system. Development represents the structural and functional changes that may take place over time (ibid, 144). Climate change is a clear example of a temporally active phenomenon that is impacting upon the structural and functional aspects of the world system.

The integrative approach of ESS and the use of technologies such as RS and GIS are bridging the gap between the human and physical divides as it is only now in contemporary academia that human induced climatic phenomenon can be observed and measured. Widely reported in the media, recent catastrophic geographic events like the “El Niño” (the replacement of cold upwelling water off the coasts of Ecuador and Peru by warm tropical water) have been attributed to rising carbon dioxide levels in the atmosphere from air pollution from industrialised countries. This affects weather systems globally and has far reaching effects in terms of prolonged droughts and bush fires (as experienced in Northern Australia and South-East Asia), flooding (as seen in Brazil, Chile and Northern Argentina) and milder or harsher than normal seasonal deviations in other parts of the world (www.pmel.noaa.gov). The “El Niño” example is here for a purpose, to fortify a point; human/physical interactions and their effects are relevant, present, and are spatially and temporally active and integrated understanding is the only possible way forward for geography to remain a viable discipline. Therefore effects of struc307


tural human induced pollution, deforestation, river damming, urbanisation and resource depletion can be linked to the humanities and to the “harder” sciences of physical geography as they are in essence, bound together in a unified whole. As Magilligan (2004) states “both physical geography and human geography have mutual interests in understanding the impact of human agency, and unravelling the role of human activity is an area of concern of each of the sub-disciplines” (Harrison et al, 2004, 438). Therefore the boundaries within geography will become less defined and as introduced in this paper, a more “fuzzy” discipline is proposed to emerge.

Conclusion Integration or division poses an open ended question depending on where your loyalties lie. It has been proposed in this paper that an integrated discipline will develop through a new inclusive approach to geography in the guise of ESS as Pitman (2005) puts forth. Crossovers between the quantitative and the qualitative, human and the physical can be made more intelligible through ESS and technology and a better awareness of the validity each can offer the other as Belbin (2002) proclaims can lead to a prosperous and valuable discipline. Geography or “Earth writing” is, in essence, these two practices of the human and the physical that defines the study of Geography. The complex environmental entity that is our Earth is home to the tangible and the spiritual and focusing on the ecological interaction between players and events at a variety of spatial scales defines the core focus of the discipline. Pursuing geography for the sake of knowledge then enables geographers, no matter what their specialization, to help identify solutions to many of the world’s problems that undermine our well-being as only one of a number of species that inhabit this space, this place we call home. Geography is therefore not sitting in an old fashioned dusty corner of the classroom, fragmented and ready to be consigned to an academic dustbin, but can be a modern, unified and contributory discipline that can do away with the diverse exclusivity it has become known for. The arrangement of geography has to embrace the new opportunity it has been given, a second chance if you will. In reference to Heraclitus, the only certainty is change and as for the future “the geographer will never be the master of his fate; his very reason always progresses by leading him into the unknown and unforeseen where he learns new things” (Bird, 1979, 24).

308


309


HISTORY PANEL

Panel Members Prof. Alan Sharp (University of Ulster) – Chair Prof. Roy Foster (Oxford University) Prof. Richard Aldous (University College Dublin) Judges’ commentary The Panel was impressed by the sophistication of intellectual approach of this work. Using an extremely broad and demanding range of sources and demonstrating a remarkable ease and familiarity with Victorian intellectual and scientific history, it reached judicious conclusions in a very well written and presented piece of work. We congratulate Ms. Rowe on an outstanding example of undergraduate history at its very best.

310


H I ST ORy

Kingsley’s natural selection: The significance of Darwin in his works Abigail Rowe

T

he second edition of On the Origin of Species (John Murray, 1860) contains an unattributed sentence regarding religious reaction to the publication. Darwin writes :

I see no good reason why the views given in this volume should shock the religious feelings of any one. A celebrated author and divine has written to me that ‘he has gradually learnt to see that it is just as noble a conception of the Deity to believe that He created a few original forms capable of self-development into other and needful forms, as to believe that He required a fresh act of creation to supply the voids caused by the action of His laws.’1

The ‘celebrated author and divine’ in question is Charles Kingsley, cleric, author and amateur natural scientist. The fact that Darwin felt it expedient to include this quote immediately alerts the reader to the controversy that the first edition had provoked within the religious establishment. The impact of The Origin, in the years following its publication, was multilayered and complex, provoking strong reaction from the scientific and religious communities, becoming visible in the arts and in philosophy, in essence fundamentally and irrevocably changed humanity’s 1 Darwin, Charles, On the The Origin of Species by means of Natural Selection, or the Preservation of Favoured Races in the Struggle for Life (John Murray, 1860, 2nd Ed., 2nd Issue) p. 481, retrieved 22 January 2009 from http://darwin-online.org.uk/content/frameset?viewtype=text&itemID=F37 6&pageseq=508.

311


view of itself. This essay seeks to investigate the impact of Darwinian theory on the works of one man, Charles Kingsley, who not only straddles the worlds of religion and literature throughout his career, but also continually reveals his fascination with the science of the day. Kingsley’s prolific and unstintingly didactic works reveal a purported acceptance and assimilation of Darwin’s theory of natural selection, with no antagonism posited between scientific fact and religious faith. Nevertheless, on closer examination, Kingsley’s professed agreement becomes more tenuous and starts to appear somewhat selective. Given that Kingsley was lauded as being ‘on board’ by Darwin himself, this essay will seek to explore the veracity of this reputation. To assess this it is necessary first to evaluate why Kingsley’s support was important to Darwin, and also to contextualise the publication of The Origin alongside Kingsley’s existing canon. Darwin understood that his theory of natural selection to all intents and purposes removed divine providence from the understanding of the development of life. Facing a two millennia account of natural history which was based on a providential plan, including the most contemporary thought regarding evolution and geology, the evidence laid out so accessibly in The Origin suggested to many that this foundational tenet of Western Christendom was erroneous. Whilst in The Origin Darwin studiously avoided the implications of his theory on the origin of mankind, he knew that this question was implicit in his text.2 As a man who avoided confrontation throughout his life, and expecting a polarised response to The Origin, he prepared himself for altercation. His nervous anticipation shows in the covering letters he sent to recipients of the complimentary copies, which reveal a self-deprecation to the point of seeming unconfident of his book. For example, to his former mentor, Henslow, Darwin writes ‘my dear old master in Natural History; I fear... that you will not approve of your pupil in this case,’ but further reading seems to suggest that Darwin’s nervousness was not about the content of the book but more about the fact that he felt it was published before he had time to complete it: ‘the book in its present state does not show the amount of labour which I have bestowed on the subject.’3 To Jenyns, Henslow’s brother-in-law, entomologist and cleric, he writes ‘please remember that my book is only an abstract, and very much condensed,’ and goes on to say that he ‘may, of course, be egregiously wrong; but I cannot persuade myself that a theory which explains (as I think it certainly does) several large classes of facts, can be wholly wrong.’4 Two years before publication, Darwin had written to his cousin Fox that he was working very hard at my Book, perhaps too hard. It will be very big & I am become most deeply interested in the way facts fall into groups. I am like Crœsus overwhelmed with my riches in facts and I mean to 2 His preface refers to the work on human variation published by Dr D.C.Wells in 1813 and 1818. 3 Francis Darwin ed., The life and letters of Charles Darwin, including an autobiographical chapter (John Murray, London, 1887) Vol 2, p. 218. 4 ibid., Vol. 2, p.220.

312


make my Book as perfect as ever I can.5

With such assurance in the solidity of his evidence it may rather be this drive for perfection, coupled with the pressure to go to press and an understanding of the implications to the prevailing worldview, which explain the anxieties and equivocations present in his letters. His thoughts regarding the opinions of those who were not primarily men of science show that he was aware that popular acceptance of his theory would hold some sway over the establishment; talking of ‘intelligent men, accustomed to scientific argument, though not naturalists’ he says ‘it may seem absurd, but I think such men will drag after them those naturalists who have too firmly fixed in their heads that a species is an entity.’6 In the context of the time, ‘such men’ would include the influential gentlemen and noblemen amongst whom a lay interest in natural theology, geology, botany, entomology and zoology would be fairly common. Kingsley was one such man.7 Known to society primarily as an author of some note and a cleric of questioned stance, he was avocationally drawn to natural history.8 He was known for his interest and collected specimens for Gosse whilst in the West Country.9 Self-deprecating regarding his own scientific knowledge, Kingsley later was to write: I know very little about these matters, and cannot keep myself ‘au courant’ of new discoveries, save somewhat in Geology, and even in that I am no mineralogist , and palaeontologist. Science is grown too vast for my head.10

Notwithstanding this, the respect Kingsley received from the scientific establishment is clear.11 It is imperative, however, to interpret Kingsley’s scientific inter5 Charles Darwin, Letter 2049 — Darwin, C. R. to Fox, W. D., 8 Feb [1857] retrieved 22 January 2009 from http://www.darwinproject.ac.uk/darwinletters/calendar/entry-2049.html. 6 Darwin, Francis, Letters, Vol. 2, p. 245. 7 Charles Kingsley, (1819-1875) cleric, author, professor, social reformer and amateur scientist, had much in common with Charles Darwin (1809-1882). Both were gentlemen, though Kingsley was born into a genteel poverty that Darwin would never know, and both followed the gentlemanly pursuits of geology, entomology, zoology and botany. Furthermore, both had expectations on them to enter the clergy; for Darwin, this was seen as an expedient career given his distaste for medicine, but for Kingsley it was a true vocation. See Chitty, Susan, The Beast and The Monk: A Life of Charles Kingsley, (London, 1974), Colloms, Brenda, Charles Kingsley The Lion of Eversley, (London, New York, 1975), Desmond, Adrian and Moore, James, Darwin, (Penguin, London, 1992). 8 Kingsley had previously come up against the establishment in his work with Chartist reformers and his prominent role in advocating Christian Socialism. See Colloms, Lion of Eversley, p. 174. 9 Indeed Gosse used Kingsley’s samples as the basis of a whole chapter in The Aquarium (J. Van Voorst, 1854), cited in Colloms, Lion of Eversley, p.185. 10 Fanny Kingsley, Charles Kingsley: His Letters ad Memories of His Life, (London, 1884) p. 293. This is from a letter from Kingsley to Rev. F. Maurice written in 1869. 11 Hanawalt writes that ‘He... obtained acquaintance with men of science who were impressed by

313


est within the context of his lifelong understanding of nature. His school friend from Helston Grammar School, Cowley Powles writes: His passion was for natural science and art. With regard to the former, I think his zeal was led by strong religious feeling – a sense of the nearness of God in His works.12

Kingsley succinctly makes this explicit when, at age 17, he writes home to his mother saying ‘I am reading my Bible and my Paley;’13 to Kingsley, the two went hand in hand, yet he was well aware of the tension this necessitated in light of unfolding scientific evidence: ...for if in any age or country the God who seems to be revealed by Nature seems different from the God who is revealed by the then popular religion, then that God, and the religion which tells of that God, will gradually cease to be believed in.14

This is evident in his literary works, all of which feature a man of science in a leading role. His protagonist in the 1949 novel Yeast: A Problem, for example, avers that: My only Bible as yet is Bacon. I know that he is right, whoever is wrong. If that Hebrew Bible is to be believed by me, it must agree with what I know already from science.15

This shows clearly that Kingsley believed that scientific findings should bolster religious and theological understanding and not undermine it. Kingsley was one of the first to receive a copy of the first edition of The Origin, having been sent a preview by John Murray at the behest of Darwin himself. He was also one of the first to respond to Darwin, and his positive response, quothis zeal and his sincerity’. Mary Wheat Hanawalt, Charles Kingsley and Science, Studies in Philology, Vol. 34, No. 4 (Oct., 1937), p. 591. She cites Chambers, Darwin, Gray, Huxley, Miller, Wallace and White as such men. 12 Ibid., p. 8. 13 Ibid., p. 10. 14 Charles Kingsley,The Natural Theology of the Future, read at Sion College in 1871, retrieved 8th January 2009 from http://www.online-literature.com/charles-kingsley/scientific/7/. 15 Yeast: A Problem, (Savill and Edwards, London, 1959) p. 162. This Baconian influence is evident within Kingsley’s work; his application of ideas of perfectibility arise from Bacon’s linking of progress and providence; scientific progression by empirical methods being underscored by scripture. Kingsley cites Bacon in sermons calling him ‘the wisest of all mortal men since the time of Solomon’; sermon 11, Solomon, from The Water of Life and Other Sermons retrieved January 28th 2009 from http://www.online-literature.com/charles-kingsley/water-of-life/11/, see also The Fount of Science, Sermons on National Subjects, retrieved January 28th 2009 from http://www.onlineliterature.com/charles-kingsley/sermons-national/12/ .

314


ed above, was genuinely welcomed. Darwin’s biographers, Desmond and Moore, portray Darwin as a man emotionally affected by the responses of others, taking ‘the knocks very personally’ when reactions were negative16. Kingsley’s positive response was therefore, at the other end of the spectrum, hugely important to him. Desmond and Moore describe him as ecstatic.17 Kingsley’s Darwinian credentials grew as he assimilated the theory of natural selection into his work. His lack of fear of controversy, following his notoriety regarding his Chartist engagements, meant he was highly vocal on the matter. Huxley, also a man who relished the sparring that ensued, writes of him that he ‘is an excellent Darwinian’ and, with evident pleasure, repeats an anecdote in which Kingsley defends an accusation of heresy levelled by Lady Aylesbury with the words ‘what can be more delightful to me Lady Aylesbury, than to know that your Ladyship and myself sprang from the same toadstool.’18 Indeed Huxley and Kingsley, sharing such characteristics, embracing Darwinism and going on to correspond in highly personal letters, became good friends.19 The correspondence between Kingsley and Darwin shows the mutual respect held between them. Darwin’s response to Kingsley’s effusive reaction – ‘I am much gratified by your kindness.— At any future time I shall be delighted to answer any objections as far as lies in my power, or to receive any suggestions’20 – typically seeks to further engage with his correspondent on the subject matter. During this period of initial reaction, Darwin, soon after, writes of Kingsley’s response in letters to Lyell, Huxley, and Gray.21 This in itself reveals the significance that Darwin felt was appropriate to Kingsley’s reaction. He continued to respect Kingsley’s scientific input to the extent that his Descent of Man quotes evidence provided by Kingsley: ‘the fishermen of Rochelle assert “that the males alone make the noise during the spawning-time; and that it is possible by imitating it, to take them without bait.”’22 Further correspondence shows Kingsley reporting discussions he encountered socially, regarding The Origin: We have just returned from Lord Ashburton’s at the Grange, where the Bishop of Oxford, the Duke of Argyle, and I have naturally talked much about you and your book... The Duke is calm, liberal, and 16 Adrian Desmond and James Moore, Darwin, (Penguin, London, 1992) p.492. 17 Ibid., p.477. 18 Huxley to F. Dyster, 29 February 1860, Thomas Huxley Papers, Imperial College of Science and Technology, London ,from Desmond, A., and Moore, J., Darwin, (London, 1991), p. 488. 19 Indeed Kingsley’s communication with Huxley is significant as Huxley was so often the public face of the controversy, dealing with the confrontations that so terrified Darwin. Given Darwin’s reliance on Huxley, and Huxley’s relationship with Kingsley, part of the reason that Kingsley was important to Darwin is therefore through Huxley. 20  Letter 2561 — Darwin, C. R. to Kingsley, Charles, 30 Nov [1859] retrieved 21 January 2009 from http://www.darwinproject.ac.uk/darwinletters/calendar/entry-2561.html. 21 Darwin, Francis, Letters, vol. 2, pp. 237, 282 and 287. 22 Attributed: The Rev. C. Kingsley, in ‘Nature,’ May, 1870, p. 40. in Chapter 12, Charles Darwin, The Descent of Man, (John Murray, 1871) retrieved 21 January 2009 from http://www.literature.org/authors/darwin-charles/the-descent-of-man/chapter-05.html.

315


ready to hear all reason; though puzzled, as every one must be, by a hundred new questions which you have opened... [there follows a discussion regarding the identification of birds]... My own view is—and I coolly stated it, fearless of consequences— that the specimen before us was only to be explained on your theory, and that cushat, stock dove, and blue rock, had been once all one species; and I found—to show how your views are steadily spreading—that of five or six men, only one regarded such a notion as absurd... 23

One can easily see that Kingsley’s correspondence must have been gratifying to Darwin when many clerics were denouncing his theories vociferously. For his own part, Kingsley genuinely seemed to enjoy the controversy and his role within it; in the preface to the fourth edition of Yeast: A Problem (1859) he writes: ‘provided the leaven of sound inductive science leaven the whole lump, what matter who sets it working?’24 His enthusiasm for the excitement surrounding the publication is visible in his correspondence. He writes to F.D.Maurice that the ‘state of the scientific mind is most curious; Darwin is conquering everywhere, and rushing in like a flood, by the mere force of truth and fact.’25 Both men thus can be seen to have reaped personal benefits from their growing correspondence. Kingsley’s work before the publication of The Origin was, for the most part, written for the moral and social improvement of the readership. It often contained scientific terminology along with an expression of Kingsley’s beliefs regarding the mutually inclusive relationship of science and religion. He was aware of the problems associated with intertwining what was often polarised. He can be seen to express this through his protagonist, Lancelot, in Yeast: A Problem: He had the unhappiest knack (as all geniuses have) of seeing connections, humorous or awful, between the most seemingly antipodal things... If he wrote a physical-science article, able editors asked him what the deuce a scrap of high-churchism did in the middle of it? If he took the same article to a high-church magazine, the editor could not commit himself to any theory which made the earth more than six thousand years old. ... 26

In 1849 Alton Locke was published. Essentially a novel about the need for social reform, Alton Locke embraces ideas of perfectibility and Kingsley’s beloved Chartist 23 Charles Kingsley, 1877, in Fanny E. Kingsley, ed. Charles Kingsley: his letters and memories of his life. (London: Henry S. King, 1877) Volume 2 pp. 135-6. 24 Charles Kingsley, Yeast, A Problem, 4th ed. 1859 retrieved 22 January 2009 from http://infomotions.com/etexts/gutenberg/dirs/1/0/3/6/10364/10364.htm. 25 Kingsley to F.D. Maurice in Kingsley, Fanny E., ed. Charles Kingsley: his letters and memories of his life. (London: Henry S. King, 1877) Volume 2 p. 175. 26  Yeast, pp. 255-6.

316


reform, whilst commenting on the extreme difficulty faced by anyone attempting social mobility.27 Although the novel predominantly expounds on these themes, it also brings into its narrative the idea of science and transformation. The feverish dream of the eponymous Locke in chapter 36 shows a lengthy and descriptive sequence of evolution occurring: Locke goes from being ‘the lowest point of created life; a madrepore rooted to the rock’ through many stages until he reaches humanity.28 Klaver writes that ‘the idea of successive creations on a perfect (divine) plan’ intrigued Kingsley, hence ‘it is not surprising that he should have tried to link the animal and spiritual in an even more comprehensive theory of successive development in which a notion of improvement stands central.’29 Further to Locke’s physical evolution, he then enters a period of spiritual development and improvement with a Mosaic episode; Kingsley here is explicitly connecting the evolved man with the providential plan of God for mankind. For Kingsley, science and religion are not only related and requisite parts of the whole, but mutually bolster each other. Thus mankind, the evolved savage, with the word of God, and his own good intention, is perfectible: You went forth in unconscious infancy--you shall return in thoughtful manhood.--You went forth in ignorance and need—you shall return in science and wealth, philosophy and art. You went forth with the world a wilderness before you--you shall return when it is a garden behind you. You went forth selfish-savages--you shall return as the brothers of the Son of God.30

In this novel, Kingsley, ten years before the publication of The Origin, is using the most contemporary theories regarding evolution, at least at an allegorical level, to express his beliefs regarding the improvement of mankind. The correlation between will and transformation that Kingsley shows here is integral to the work of Lamarck, for whom the intention to transform is a vital part of the process. Kingsley’s assimilation of Lamarckian evolution retains a moral direction which keeps mankind at the centre of the story. Within four years of the publication of The Origin, however, Kingsley had produced a work which directly draws on Darwin’s theory of natural selection, revealing his genuine understanding of the theory, and also containing entertaining satire on the reception of the book and the scientific establishment. This new novel, 27 See footnote 15. 28 Kingsley, Charles, Alton Locke, Ch. XXXVI, retrieved 23 January 2009 from http://www.gutenberg.org/dirs/etext05/8allk10.txt. 29 Jan Marten Ivo Klaver , Jean Paul, Carlyle and Kingsley: The Romantic Tradition in Alton Locke .s Dreamland, retrieved 12 January 2009 from http://www.ledonline.it/linguae/allegati/linguae0103klaver.pdf 30 Kingsley, Alton Locke, Chapter 36, retrieved 23 January 2009 from http://www.gutenberg.org/ dirs/etext05/8allk10.txt.

317


The Water Babies (1863) also contained elements of the popular understanding of natural selection, which was to lead to a social Darwinism far from its originator’s intent.31 As in Alton Locke, the protagonist experiences a falling experience into a de-evolved state. Tom, on falling into the river, becomes a foetal aquatic being, with full consciousness. The book charts his progress, ultimately into a ‘great man of science... and all this from what he learnt when he was a water-baby, underneath the sea.’32 The transformation of Tom is the main narrative. As ‘a fairy tale,’ the reader is not required to seek for realism; likewise, in Alton Locke, Kingsley feels the need to set Locke’s transformation into a dream, in order to maintain the realist integrity of the novel. His placing of evolutionary sequences into unreal domains allows him full rein with its metaphorical application. In The Water Babies, having thus located the entire narrative, we shall see that Kingsley freely applies his understanding of Darwinism. As we have seen, by this time evolution was, for Kingsley, the scientific sine qua non: in The Water Babies his adoption of the new theory of natural selection is assimilated throughout. For example, the last of the Gairfowl represents a non-adaptive species approaching obsolescence. Living on her lone rock, awaiting extinction she bemoans the adapted species which will continue after her end: ‘they must all have wings, forsooth, now, every new upstart sort of bird, and fly.... In the days of my ancestors no birds ever thought of having wings, and did very well without; and now they all laugh at me because I keep to the good old fashion.’33

Shortly after this Tom encounters the hoodie crows which peck to death a ‘ladycrow’ who does not behave in the expected manner.34 This example of the competition in nature and the strongest of a species surviving is brutal and shows the recognition of what would come to be called ‘the survival of the fittest.’35 (Whilst this was popularly understood from The Origin, Darwin himself later also argued for humanity’s hegemony as due to the development of being prepared to form community bonds that carry the weak.36) Tom’s encounter with Mother Carey is an ex31 For example, Herbert Spencer, in 1862, writes that ‘though these positions are not enunciated in The The Origin of Species, yet a common friend gives me reason to think that Mr. Darwin would coincide in them... in many cases a group of races, now easily distinguishable from one another, was The Originally one race... The civilized European departs more widely from the mammalian archetype than does the Australian’. Spencer, Herbert, First Principles, retrieved 22/01/09 from http:// socserv.mcmaster.ca/~econ/ugcm/3ll3/spencer/firprin.html. 32 Charles Kingsley, The Water Babies, Chapter VIII, retrieved 15th January 2009 from http://www. gutenberg.org/dirs/etext97/wtrbs10h.htm. 33 Water Babies, Chapter VII. 34 ibid. 35 Herbert Spencer, Principles of Biology (Williams and Norgate, London, 1864), vol. 1, p. 444. 36 Charles Darwin, The Descent of Man, (John Murray, 1871) retrieved 21 January 2009 from http:// www.literature.org/authors/darwin-charles/the-descent-of-man/chapter-05.html.

318


position of the creation of species not being an act of God; she tells Tom that she is ‘not going to trouble myself to make things, my little dear. I sit here and make them make themselves,’ which Tom recognises as more wonderful than merely making things.37 This entirely fits with Kingsley’s initial response to Darwin on reading The Origin. Furthermore, Kingsley addresses artificial selection, with the story of a fairy who created butterflies and comes to Mother Carey to boast of this. Mother Carey’s response is ‘that anyone can make things, if they will take time and trouble enough: but it is not every one who, like me, can make things make themselves.’38 With the thesis of a progression of the development of species came popular fears regarding the possibility of regression. This was touched upon in the preface to The Origin and later theorised by Darwin’s cousin Galton.39 The Water Babies contains a lengthy sequence containing a salutary warning of this possibility. The Doasyoulikes, a civilised and hardworking people, become lazy with their achievements and gradually become more simian until they were all dead and gone, by bad food and wild beasts and hunters; all except one tremendous old fellow with jaws like a jack, who stood full seven feet high; and M. Du Chaillu came up to him, and shot him, as he stood roaring and thumping his breast. And he remembered that his ancestors had once been men, and tried to say, “Am I not a man and a brother?” but had forgotten how to use his tongue; and then he had tried to call for a doctor, but he had forgotten the word for one. So all he said was “Ubboboo!” and died.40 Tom is told, by the fairy Mrs Doasyouwouldbedoneby, a moral persona of Mother Carey, that I can make beasts into men, by circumstance, and selection, and competition, and so forth ...if I can turn beasts into men, I can, by the same laws of circumstance, and selection, and competition, turn men into beasts.41

Kingsley thus again aligns the scientific with the moral, keeping mankind at the centre of the evolutionary narrative. Gillian Beer describes this use of Darwinian theory as a propensity to ‘colonise it with human meaning, to bring man back 37 Water Babies, Chapter VII. 38 ibid, Chapter VI. 39 Darwin, The Origin, p. preface quoting Saint-Hilaire: ‘En résumé, l’observation des animaux sauvages démontre déjà la variabilité limité des espèces. Les expériences sur les animaux sauvages devenus domestiques, et sur les animaux domestiques redevenus sauvages, la démontrent plus clairement encore’, retrieved 23 January 2009 from http://www.literature.org/authors/darwin-charles/ the-origin-of-species/preface.html. For Galton see Jean Gayon, Matthew Cobb, Darwinism’s Struggle for Survival: Heredity and the Hypothesis of Natural Selection (Cambridge University Press, 1998), pp. 147-178. 40 Water Babies, Chapter VI. 41 ibid.

319


to the centre of its intent.’42 Kingsley had, previous to the publication of The Origin, already done this in the Alton Locke evolution sequence, and maintained this anthropocentric stance in The Water Babies when applying the theory of natural selection. Contemporary reaction to this narrative sequence is worthy of note. The Anthropological Review of November 1863 recognises it as an application of ‘the Darwinian laws to the supposed “degradation” of the ape from the human species’ and feels ‘bound to point out that the great flaw in the Darwinian theory, which Professor Kingsley, to a certain extent, we believe, advocates, is admirably illustrated in this passage.’43 A tenuous argument as to why ‘the further transmutation of the scansorial man into the ape would have been rendered functionally unnecessary’ disregards the moral allegory of the tale focussing only on the scientific. This is pertinent as it reveals that the critical reception of the novel was not limited to the literary establishment; this work was being judged on a scientific level. Kingsley was, of course, an Honorary Fellow of the Anthropological Society of London, whose members would be expected to follow his work, but we can see that even his children’s fiction gained the attention of the scientific community. The Doasyoulikes sequence still reveals, albeit in reverse, Kingsley employing a Rosseauian acceptance of the educable noble savage. Darwin also wrote firsthand to Kingsley about this: That is a grand & almost awful question on the genealogy of man to which you allude. It is not so awful & difficult to me, as it seems to be most, partly from familiarity & partly, I think, from having seen a good many Barbarians. I declare the thought, when I first saw in T. del Fuego a naked painted, shivering hideous savage, that my ancestors must have been somewhat similar beings, was at that time as revolting to me, nay more revolting than my present belief that an incomparably more remote ancestor was a hairy beast.44

His experiences with the Fuegians had forced him to consider this in depth. One Fuegian who had spent time in England and become, to the Victorian mind, civilised, only to ‘regress’ to his former behaviour on returning, made a particular impact on Darwin: we could not recognise poor Jemmy. It was quite painful to behold him; thin, pale, & without a remnant of clothes, excepting a bit of blanket round his waist: his hair, hanging over his shoulders; & so 42 Gillian Beer, Darwin’s Plots, Evolutionary Narrative in Darwin, George Eliot and NineteenthCentury Fiction, (Cambridge,2000) p.7. 43 AnthropologicalReview, vol.1, No 3 (Nov., 1863) p. 474. 44 Letter 3439 — Darwin, C. R. to Kingsley, Charles, 6 Feb [1862] retrieved 22 January 2009 from http://www.darwinproject.ac.uk/darwinletters/calendar/entry-3439.html.

320


ashamed of himself, he turned his back to the ship as the canoe approached. When he left us he was very fat, & so particular about his clothese, that he was always afraid of even dirtying his shoes; scarcely ever without gloves & his hair neatly cut. — I never saw so complete & grievous a change.45

Darwin, from the fact that Jemmy had been able to become Anglicised, recognised the fact that all men are the same and mused on this: Although essentially the same creature, how little must the mind of one of these beings resemble that of an educated man. What a scale of improvement is comprehended between the faculties of a Fuegian savage & a Sir Isaac Newton — Whence have these people come?46

Kingsley shows the opposite of such thought in The Water Babies, in his descriptions of the Irish gillie. When describing Dennis’ propensity for telling ‘your honour’ exactly what he wants to hear, Kingsley describes the gillie as simple, rather than expressing an understanding of the socio-political circumstances that render Dennis vulnerable to his employer: instead of being angry with him, you must remember that he is a poor Paddy, and knows no better; so you must just burst out laughing; and then he will burst out laughing too, and slave for you, and trot about after you, and show you good sport if he can—for he is an affectionate fellow, and as fond of sport as you are—and if he can’t, tell you fibs instead, a hundred an hour; and wonder all the while why poor ould Ireland does not prosper like England and Scotland, and some other places, where folk have taken up a ridiculous fancy that honesty is the best policy.

This representation of the Irish as comparatively uncivilised, descends into evolutionary terms in a letter that Kingsley sent to his wife when visiting Ireland in 1860, when he famously describes the Irish as ‘human chimpanzees’ and is unable to associate this with the economic effect of British rule. He says that he believes that ‘they are happier, better, more comfortably fed under our rule than they ever were.’47 The following day he writes of his distress at the post-famine devastation to whole villages, but in this case he excuses this on economic grounds and says nothing to suggest, in his usual mould, that the circumstances may have produced the 45  Charles Darwin, Beagle diary (1831-1836) retrieved January 15 2009 from http://darwin-online. org.uk/content/frameset?viewtype=text&itemID=EHBeagleDiary&keywords=fuegian&pages eq=430. 46 ibid, see also Descent, chapters 7 and 21. 47 Kingsley, Letters, p.236.

321


character. This blind spot is notable and shows a variance with Darwin’s thoughts regarding the Fuegians, and indeed, Darwin later questions the theorising of Irish inferiority as expounded by Greg and Galton.48 Kingsley, like many Victorians, associates less ‘civilised’ with less evolved. In an 1862 letter he writes to Darwin, discussing a hypothesis regarding missing links and states ‘we are not niggers, who can coexist till the 19th century with gorillas a few miles off.’49 This is implicit in The Water Babies, when the salmon is speaking to Tom about trout: My dear, we do not even mention them, if we can help it; for I am sorry to say they are relations of ours who do us no credit. A great many years ago they were just like us: but they were so lazy, and cowardly, and greedy, that instead of going down to the sea every year to see the world and grow strong and fat, they chose to stay and poke about in the little streams and eat worms and grubs; and they are very properly punished for it; for they have grown ugly and brown and spotted and small.50

In a lecture given in 1871, Kingsley does discuss popular fears about race and explicitly avers his agreement with Darwin: Next, as to Race. Some persons now have a nervous fear of that word, and of allowing any importance to difference of races. Some dislike it, because they think that it endangers the modern notions of democratic equality. Others because they fear that it may be proved that the negro is not a man and a brother. I think the fears of both parties groundless. As for the negro, I not only believe him to be of the same race as myself, but that--if Mr. Darwin’s theories are true--science has proved that he must be such. I should have thought, as a humble student of such questions, that the one fact of the unique distribution of the hair in all races of human beings, was full moral proof that they had all had one common ancestor.51

Given the eleven year span of time since his bestialisation of the Irish, one may be tempted to draw from this that he no longer subscribed to his earlier views. However, within the same lecture, his continual extolling of the Englishman as the pinnacle of humanity leaves an ambiguity, or even a dissonance, in his opinion. 48  Darwin, Descent, chapter 5. 49 Letter 3426 — Kingsley, Charles to Darwin, C. R., 31 Jan 1862, ] retrieved 22 January 2009 from http://www.darwinproject.ac.uk/darwinletters/calendar/entry-3426.html. 50 Water Babies, Chapter III. 51 Charles Kingsley, The Natural Theology of the Future, read at Sion College in 1871, retrieved 8th January 2009 from http://www.online-literature.com/charles-kingsley/scientific/7/.

322


This dissonance is reflected primarily in the difference between his private correspondence and the opinions he presents in the public sphere. The free use of simian imagery, seen in letters to his wife and to Darwin, is not made explicit in his published work. His advocacy of Darwinian theory in general, however, is explicitly revealed in his preaching. In his famous sermon ‘The Shaking of the Heavens and the Earth,’ Kingsley lists prior times when the foundations of worldview and belief had been shaken, both theological and scientific, and aligns Darwinism with these pivotal points in history: The Copernican system shook them, when it told men that the earth was but a tiny globular planet revolving round the sun. Geology shook them, when it told men that the earth has endured for countless ages, during which whole continents have been submerged, whole seas become dry land, again and again. Even now the heavens and the earth are being shaken by researches into the antiquity of the human race, and into the origin and the mutability of species, which, issue in what results they may, will shake for us, meanwhile, theories which are venerable with the authority of nearly eighteen hundred years, and of almost every great Doctor since St. Augustine.52

His message, in this exhorting and emotive sermon, is from his text: ‘Yet once more, signifieth the removing of those things that are shaken, as of things that are made, that those things which cannot be shaken may remain’ (Heb 12:26-7): that which science does not shake, that is God himself, will remain solid. He continually reiterates the compatibility of science, and in particular, Darwinism, with religion and theology, and advocates an embracing of science. Interestingly, he prefaces his 1874 publication of Westminster Sermons with the lecture quoted above, ‘The Natural Theology of the Future.’53 Kingsley clearly perceives and requests that his scientific lectures must be considered alongside his sermons. Again we see that his religion and his science remain inextricable. His body of sermons and lectures show exactly the same acceptance of Darwinian theory as his novels. He remains keen to promote scientific advancement, and specifically Darwinism, in every forum to which he has access. Thus we see that Kingsley was indeed an ardent advocate of Darwinism. He became of personal importance to Darwin partly because of Darwin’s susceptibility to the opinions of others and loathing of confrontation, and partly due to Kingsley’s established status and reputation. An eminent cleric such as Kingsley gave Darwin a significant validation within the ecclesiastical establishment, which was as a sphere was for the most part hostile to Darwin, yet mattered to 52 Charles Kingsley, The Shaking of The Heavens and The Earth, retrieved 23 January 2009 from http://www.online-literature.com/charles-kingsley/water-of-life/6/. 53 Charles Kingsley, Westminster Sermons (London: Macmillan, 1874).

323


him. Nonetheless, as a promulgator of Darwinian theory, Kingsley notably retains a providential view of natural history, with consistent reference to elements of Lamarck and Paley: he never moves away from both the intentional will of the species to change, and a complete acceptance of an integral theistic greater plan. Thus, natural selection is continually presented by Kingsley as equated to providential history, and further, both are depicted as symbiotically bolstering the other. His initial reaction to The Origin, encapsulated in his first letter to Darwin, remains his enduring stance, staying true to the sentiments so welcomed by Darwin in 1859, yet selectively assimilating elements which maintain his worldview. The specific application of Darwinism within Kingsley’s work is likewise selective, showing signs of following the popular interpretations of Darwinism rather than the work actually proposed by the man himself, as, for example, we saw with his treatment of the Irish in The Water Babies and in his private letters to his wife. That being said, Kingsley was recognised by both followers and opponents of the theory as a Darwinian. His readiness to accept the theory of natural selection and his enthusiasm in promulgating it are clear from his personal writings and his published works, yet if the theory had not been published, it is clear that Kingsley would have continued to integrate the latest scientific theories within his work; the same application of contemporary scientific findings is seen in Yeast and Alton Locke as is found in The Water Babies; it is the theories which have shifted, not Kingsley’s use of them. Science is also highly visible in his sermons, with Darwinian theory plainly presented as a new and pivotal step in the incremental development of science since Copernicus. The mutual relationship of science and religion is one of the driving forces in works spanning his entire career. The evidence presented by Darwin compelled and excited Kingsley. The highly public nature surrounding the reception of The Origin fed Kingsley’s natural bent towards both controversial debate and championing a cause. All these elements combined proved compelling for Kingsley, who remained unwaveringly proud to have his name associated with the theory proposed in The Origin.

324


325


IRISH PANEL judging panel Dr Regina Uí Chollatáin (UCD) - Chair Dr Pádraigín Riggs (UCC) Dr Brian Ó Catháin (NUIM) Due to space constraints the second winner in thsi category has been omitted from this journal. However the essay is available online at www.uaireland.com JUDGes’ Comments Winner 1 Innéacs cuimsitheach, eolgaiseach a chlúdaíonn bailiúchán amhrán a rinne Cosslett Ó Cuinn i nGabhla, i dToraigh agus in Árainn Mhór átá anseo. Ábhar taighde bunaidh ar ardchaighdeán é a léiríonn tuiscint an-mhaith ar an ábhar féin, ar na foinsí béaloidis agus ar thábhacht na hoibre seo i gcaomhnú thraidisiún luachmhar na n-amhrán Gaeilge. Mar aon le treoir don léitheoir,tugann réamhrá an innéacs eolas ar chúlra Uí Chuinn agus ar a chuid oibre ar amhránaíocht na Gaeilge sna ceantair a ríomhtar san innéacs. Ceann de na gnéithe is spéisiúla faoin saothar seo ná an bhéim a chuirtear ar chúrsaí canúnachais agus ar an dóigh gur féidir taighde a dhéanamh ar chanúintí na Gaeilge trí staidéar a dhéanamh ar leaganacha éagsúla, logánta na n-amhrán. Chloígh an mac léinn le cur chuige straitéiseach a chuirfidh áis thaighde den scoth ar fáil do phobal na Gaeilge, áis ar féidir tógáil uirthi ag leibhéal iarchéime agus na bunchlocha leagtha sa saothar seo. Rinneadh eagarthóireacht an-mhaith ar an ábhar féin sa tslí gur baineadh barr feabhais intleachtúil amach ag an leibhéal seo. Dá bharr seo tá an Chéad Duais á bronnadh ar an iontráil seo i gcomhpháirt le hiontráil amháin eile. This is a comprehensive, well-informed index which covers a collection of songs collected by Cosslett Ó Cuinn in Gola and in Aranmore. This original high standard research demonstrates a very good understanding of the subject area, of the folklore sources and of the importance of this work in the valuable preservation of the Irish song tradition. Alongside a guide for the reader, the foreword to the index provides information on the background of Ó Cuinn and his work on the Irish song tradition in the districts which are dealt with in the index. One of the most interest326


ing aspects of this work is the emphasis that is put on dialectal traits. The student maintained a strategic approach which will provide the Irish public with an excellent research tool, one which can be developed at postgraduate level now that the foundations have been laid through this work. The material was very well edited which showed intellectual excellence at this level. As a result of all the above this submission is being awarded joint first.

Winner 2 San aiste seo déantar anailís chuimsitheach ar an eolas atá bailithe ar chillíní na bpáistí, ar bhunús stairiúil na gcillíní, agus ar thuiscintí an phobail ar ghné den saol a coinníodh rúnda ar chúiseanna áirithe ar feadh i bhfad. Úsáideann an mac léinn an t-eolas áitiúil mar fhoinse bhunaidh agus mar phointe tosaigh le plé a dhéanamh ar cheist dhomhanda. Luaitear an ‘briseadh croí faoi rún’ i dteideal na haiste agus is léir go dtugtar fóram agus guth do ghné shainiúil a bhain le pobal na Gaeltachta, go háirithe don chuid sin de phobal na Gaeltachta nach raibh guth acu, na páistí seo a cuireadh gan aitheantas ceart. Tugann comhthéacs an taighde mionsonraí chun solais a léiríonn tuiscint dhomhain ar an ábhar. Baintear úsáid as foinsí ilghnéitheacha chun an taighde a chur i láthair i slí éifeachtach, chumasach agus léirítear fianaise luachmhar i gcomhthéacs acadúil ar thuairimíocht an phobail chomhaimseartha. Cé go bhfuil cuid áirithe den taighde bunaidh lonnaithe in iarthar na hÉireann, ní leasc leis an mac léinn dul i ngleic le comhthéacs domhanda na gcillíní a léiríonn cur chuige léannta, intleachtúil. Éiríonn leis an mac léinn barr feabhais a bhaint amach a chuireann ábhar bunaidh os comhair an léitheora. Mar thoradh air seo ar fad tá an chéad duais á bronnadh ar an aiste seo i gcomhpháirt le hiontráil eile. This essay presents a comprehensive analysis of information which has been collected about children’s burial grounds, about the historical background of the burial sites and about the community understanding on an aspect of life which remained a secret over a long period of time. The student uses the local knowledge as an original source to discuss a global issue. The ‘secret heartbreak’ is mentioned in the essay title and clearly a forum and voice is given to a particular aspect of the Gaeltacht community. The context of the research brings minute details to light which show a deep understanding of the subject. Various sources are used to present the research in an effective, competent manner, and there is valuable evidence of contemporary public opinion in an academic context. Although a certain amount of the research is based in the west of Ireland, the student embraces the global context of the burial grounds. The student achieves an excellent standard which presents original material to the reader. As a result of this, this submission is being awarded joint first. 327


328


I R I SH

Innéacs de bhailiúchán amhrán a rinne Cosslett Ó Cuinn i nGabhla, i dToraigh agus in Árainn Mhór Kayla Reed

S

liocht atá anseo as aiste níos faide a rinneadh mar thionscadal don chéim BA sa Nua-Ghaeilge. Sa bhunaiste sin, rinneadh iarracht innéacs cuimsitheach a dhéanamh de na hamhráin a bhailigh an tUrramach Cosslett Ó Cuinn i nGabhla, i dToraigh agus in Árainn Mhór i dTír Chonaill, sa dóigh is gurbh fhéidir a n-aithint agus a gcur i gcomparáid le leaganacha eile. Tá céad amhrán san innéacs iomlán agus iad eagraithe de réir aibítre. Sa leagan gearr seo, tugtar 29 de na hamhráin ba shuaithní acu. Faoi ainm gach amhráin, luaitear an cineál amhráin atá ann agus nóta ar bith a scríobh Ó Cuinn faoin amhrán. Luaitear cé uaidh a bhfuair Ó Cuinn an t-amhrán, agus cén áit. Luaitear sonraí aitheantais an amhráin: an dialann ina scríobh Ó Cuinn é, an uimhir a thug Ó Cuinn air, agus uimhir an leathanaigh. Tá sonraí an amhráin féin luaite: cé mhéad véarsa atá ann, cé mhéad líne atá i ngach véarsa, agus cé acu atá líne ar bith ar lár. Ina dhiaidh sin, tugtar an chéad líne de gach véarsa. Rugadh Cosslett Ó Cuinn ar 27 Feabhra 1907 i nDoire Achaidh, Co. Aontroma, agus mhair sé go dtí 6 Nollaig 1995. Ghnóthaigh Ó Cuinn céim onóracha i dTean329


gacha Clasaiceacha ó Choláiste na Tríonóide in 1929, agus chuaigh sé le ministéireacht ina dhiaidh sin. Músclaíodh a shuim sa Ghaeilge nuair a bhí sé ar an Ollscoil agus thosaigh sé ag triall ar an Ghaeltacht. Lean Ó Cuinn ar aghaidh ag saothrú na Gaeilge agus bhain sé clú amach mar scoláire, mar aistritheoir agus mar bhailitheoir béaloidis. Líon Ó Cuinn 39 ndialann d’ábhar béaloidis a bhailigh sé i gCúige Uladh go mór mór. San innéacs seo, dírítear ar thrí cinn de na dialanna sin ina bhfuil ábhar a fuair sé i nGabhla, i dToraigh agus in Árainn Mhór.1 Gabhla: Tháinig 39 de na hamhráin as Gabhla, agus 11 giota. Chruinnigh Ó Cuinn na hamhráin sin nuair a bhí sé i nGabhla i 1930 agus arís i 1932.2 Tá na hamhráin sin níos giorra ná na hamhráin a fuair sé i dToraigh agus in Árainn Mhór. Fuair sé na hamhráin ó Fhrainc Mac Fhionnlaoich, ó Charlie Neansaidh, ó S. Ó Gallchobhair, agus ó Chathal Éamoinn Rua.3 Toraigh: Fuair Ó Cuinn 29 amhrán istigh i dToraigh agus é ar cuairt ann i mí Iúil, 1939. Fuair sé na hamhráin ó John Tom Ó Mianáin, ó Cháit Tom Ní Mhianáin, ó Shéamas Ó Dubhtháin Pádraig (Jimmy), ó Éamonn Ó Dubhtháin, ó Shéamas Mac Ruaidhrí agus ó Phádraig Mac Ruaidhrí.4 Árainn Mhór: Bhí Ó Cuinn ar cuairt in Árainn Mhór i mí Lúnasa, 1940. Bhailigh sé 32 amhrán agus dhá ghiota ó Bhidí Chaitlíne Nic a’ Bhaird, ó Róise Ní Ghrianna, ó Uilliam Ó Ceallaigh, agus ó Bhidí Fransaí Ní Dhomhnaill.5 Is beag eagarthóireacht a rinne mé ar scríbhinní Uí Chuinn ar mhaithe leis an leagan is dílse den ábhar a aithrisíodh do Ó Cuinn a chur ar fáil. Nuair a chuaigh Ó Cuinn ag bailiú ábhair i nGabhla ar dtús, ní raibh sé ag foghlaim na Gaeilge ach le corradh is bliain. Is léir ón ábhar a bhailigh sé ann go raibh deacrachtaí aige an chanúint a thuiscint: tá líon mór focal a raibh sé in amhras fúthu.6 Cúpla bliain ina dhiaidh sin, nuair a bhí sé ag breacadh síos amhrán agus scéalta i dToraigh agus in Árainn Mhór agus cumas maith teanga aige, bhí deacrachtaí aige go fóill muintir Thoraí a thuiscint.7 Rinne sé iarracht na fuaimeanna sin a scríobh amach cé bith; tá tábhacht foghraíochta ag baint leis an bhunleagan ar an ábhar sin, fiú nuair is féidir buille faoi thuairim a thabhairt den fhocal féin. Is cosúil ón ábhar a bhailigh Ó Cuinn go raibh sé ag iarraidh miondifríochtaí sa chaint ó cheantar go ceantar a léiriú. Scríobh sé smaoitighim agus muitir in áit smaoiním agus muintir, go díreach mar atá sa chaint. Léirigh sé na difríochtaí idir chanúintí na dtrí oileán chomh maith: scríobh sé théann agus choidhe in Árainn Mhór agus théid agus chaoidhche in Oileán Thoraí. Bhí trí leagan éagsúla den fhorainm réamhfhoclach dom aige: domh a bhí aige i nGabhla den chuid is mó, dom i dToraigh agus damh in Árainn Mhór.8 Dá réir sin, tá sé tábhachtach ó thaobh thaifead na teanga de nach gceilfí na leaganacha seo ar an léitheoir. Rinne mé mionleasuithe ar an phoncaíocht. Den chuid is mó, ní raibh aon fhleiscín sa bhuntéacs nuair a tháinig t- agus n- roimh ghuta, ach bhí nuair a tháinig h- roimh ghuta. D’athraigh mé sin go bhfuil sé ag teacht le nós an lae inniu. Níor thug Ó Cuinn ainm an amhráin i gcónaí. D’úsáid mé an chéad líne den amhráin sa chás sin, ach amháin nuair a bhí ainm so-aitheanta air, mar shampla, Éirigh Suas a Stóirín. 330


Ní d’aon ghnó a thosaigh Ó Cuinn ag cruinniú amhrán, ach d’fhág sé cnuasach tábhachtach ina dhiaidh. Fuair sé leaganacha d’amhráin nach bhfuil le fáil i bhfoinsí eile agus léirigh sé dílseacht as an choiteann do na leaganacha a chuala sé.

Innéacs Ag Siubhail Amach Dé Dómhnaigh Cur síos: amhrán grá. Tá Béarla measctha tríd an 3ú véarsa. Bidí Chaitilín, An Phlochóg, Árainn Mhór. Dialann 25, amhrán 16, lch 29, 4 véarsa 4 líne. “Ag siubhail amach Dé Dómhnaigh a d’fhag mo chroidhe trom” “Cé (gaidé) dheanfa mé i márach is cé dheanfa mé aríst” “Go hIreland ma théighimsa true fancy must go” “Mar an lán mara ag éirghe fá thaobh an Chnuic Bháin” Amhrán na Scadán9 Cur síos: amhrán iascaireachta. Jimmy, Oileán Thoraí. Dialann 14, amhrán 34, lch 82-89, 6 véarsa 8 líne. Níl ach 7 líne sa 5ú véarsa agus 4 líne sa 6ú véarsa. “Chuaidh John Ó Míonáin na nDúinnibh” “Thainic an Tír Chonaill ‘na Chámuis” “Thainig an sagart ‘un tosaigh” “Da mbeitheá ar an Tír Mhór agus ar aontaigh” “Nach truaigh é a fhear Maighistear Sinkan” “Tá bóchall beag beadaidhe ar an bhaile seo” An Bhanaltra Cur síos: amhrán grá. Tá leagan den amhrán seo in Folksongs of Britain & Ireland.10 F. Mac Fhionnlaoich, Oileán Ghabhla. Dialann 2, amhrán 3, lch 9-12, 8 véarsa 4 líne. Níl ach 3 líne sa 5ú véarsa. “Nuair a fuair mé féin san bhanaltra amach go bruach loch Éirne” “Míle m’anam ar maidín thú ‘s aríst ag teacht na hoidhche” “Ní theachaidh síol i dtalamh ariamh níos deise ná síol aorna” “A chailíní ‘s a chailíní an méid agaibh tá le pósadh”11 “Ar a theacht abhaile damh féin ar a hocht a chloig san oidhche” “Is deas a’ fear i mbaile mé agus níl dúil agam i seanchainnt” “Má thugann tú mil ar maidin damh, ná bé an leanbh aimhréidh” “Tá cailleach i bpáirt na hUilinne agus bhí sí díthe comharsanach” An Bhean Ba Mheasa Liomsa Faoi an Ghréin Cur síos: amhrán grá. Scríobh Ó Cuinn nóta ag bun an leathanaigh deiridh: “Féach Píaras Feirtéir.” Dán de chuid an Fheirtéirigh atá ann go dearfa, agus tá an 331


bunleagan le fáil in Dánta Phiarais Feiritéir.12 Róise Ní Ghrianna, Creag an tSeabhaic, Árainn Mhór. Dialann 25, amhrán 6, lch 14-15, 8 véarsa 4 líne. “An bhean ba mheasa liomsa faoi an ghréin” “Nach fáda fáda bhí tú amuigh” “Is truagh nach dteachaidh an ghaoth ó dheas” “D’fhan mé leatsa bliadhan nó dhó” “Bean adaigh ba tanaidhacha na taobh a ghloine” “Da ba mise an t-éan a rachadh i bhfad” “Seanfhocal agus é bheith fíor” “Fádódh téine le loch” An Lá Sin a D’fhág Mise Toraigh Cur síos: amhrán deoraíochta. Tá leagan den amhrán seo in The Irish of Tory Island.13 Jimmy, Oileán Thoraí. Dialann 14, amhrán 26, lch 58-59, 6 véarsa 4 líne. Níl ach líne amháin den 5ú véarsa, ach tá 5 líne sa 6ú véarsa. “Lá sin a d’fhág mise Toraigh ba deas mo cheapóg nótaí” “Lá sin ag dul go Glascú dom dhul ar bord an California” “An chéad fhear a thainig orainn le siosur a bhain dúinn ar gcuid gruaige” “Ag eirghe domsa ar maidín ‘s mé istoigh ag bean a’ lóistín” “Siud a rabh na fideógaí atá ag cruinniú isteach na hoibridheannaí” “A Mhuire agus a Rí nach mise an fear bhí amaideach” Bláth na gCraobh14 Cur síos: amhrán grá. Tá leagan den amhrán seo in Dhá Chéad de Cheoltaibh Uladh.15 Tá an 5ú véarsa le fáil san amhrán An Buachaill Deas Óg chomh maith.16 Oileán Ghabhla. Dialann 2, amhrán 23, lch 45-46, 6 véarsa 4 líne. “Rachaidh mé na phobal i mbárach” “San uair nach tú a bhí i ndán damh” “Is craobh ar a’ phobal a dfhag tú” “’S is ro dheas a brollach ‘s a braghaidh” “Dá mbéadh ‘s ag do mhuitir í Sheaghain”17 “D’fhag mé annsin í i naoi dtraigh” Conall Cearnach Cur síos: fuadach sí/ suantraí. Scéal an amhráin mar a fuair Ó Cuinn é: “Tugadh duine isteach sa bheinn uair amháin a rabh Conall Cearnach air. An bhean a bhí sé a gul a phósadh bhí sí istoigh róimh.” Oileán Ghabhla. 332


Dialann 2, amhrán 59, lch 90-91, 3 véarsa 4 líne. “Spré ort nach mise Meadhbha” “Bhí dhá ghearáin déag agam” “Huis o hú agus ohó mo leanabáin”18 Cuach na Finne Cur síos: amhrán grá. Baineann cuid de na véarsaí seo le Siún Ní Dhuibhir atá in Dhá Chéad de Cheoltaibh Uladh agus in Amhráin Chúige Uladh, ach más é an t-amhrán céanna atá anseo, leagan neamhghnáth atá ann.19 Oileán Ghabhla. Dialann 2, amhrán 21, lch 41-43, 10 véarsa 4 líne. Níl ach 2 líne sa chéad véarsa. “A Chuach na Finne ma d’imigh tú i ndiaidh do nead” “A gheirrseach bheag bheadaigh bhí i reathaí i ndiaidh na bhfear” “Dá mbeadh ‘s ag mo muitir gur i gCuileann a tharluigh mé” “O bhí bean agam a’s caithfeadh sí an píop a ghrádhadh” “Is saighduir sighilte mé d’migh as garda an ríogh” “D’éirigh mé ar maidín agus ghluais mé ‘un aonaigh Iubhair” “‘S a Shiubhan Ní Dhuibhir an misde leat mé bheith tinn” “O Maire leanna a tharluigh an óigbhean chiúin” “O thíar i nGaillimh tá searc agus rún mo chléibh” “Gan bo gan gamhan mo leanbh gur fagadh mé” Dailtín Toighe Móir Cur síos: amhrán grá. Árainn Mhór. Dialann 25, amhrán 21, lch 37-38, 6 véarsa 4 líne. “Dailtín toighe móir ní phósfad go deó” “Is ag Neidí tá an cú is deise ar a lúth” “Racha mé ar cuairt na bhaile udaigh suas” “Caidé an fáth damh bheith ag coimhit ar sgeimh deas na mná” “Is aici atá Leitir ‘Ic a Baird” “Tá mé mo luighe le corradh ‘s le mí” Dán an Bháis Cur síos: dán diaga. Tá leagan den amhrán seo le cluinstin ar an téip Seal Mo Chuarta le Caitlín Ní Dhomhnaill as Rann na Feirste ach níl na véarsaí seo ar fad ann.20 Tá cuid eile de na véarsaí le fáil in Dánta Diaga Uladh.21 Róise Ní Ghrianna, Creag an tSeabhaic, Árainn Mhór. Dialann 25, amhrán 5, lch 12-13, 9 véarsa 4 líne. Níl ach 2 líne sa dara véarsa. “Éistigidh liomsa anois a pheacthach” “Tiocfaidh sé go colbha do leaptha” “Tiocfaidh sé ag bun na coise” “Cuirfimuid duine fá choinne an tsaghairt” 333


“Nach minic a bhí tú do luighe ar meisce” “Bréagach thú ars an cholann” “Tiocfaidh Mícheál as na Flaithis” “Dhéanfar draichead réidh dhé a ghloine” “Béidh na haighle22 ar uachtar uisce” Dán an Túir Cur síos: dán diaga. Tá an t-amhrán seo i gcló in Cosslett Ó Cuinn le Risteárd Ó Glaisne.23 Tá leagan eile le fáil in Dánta Diaga Uladh.24 Bidí Chaitilín, An Phlochóg, Árainn Mhór. Dialann 25, amhrán 1, lch 1-6, 26 véarsa 4 líne. “A ghiolla atá faoin tsiabadh” “Ó chuir tú an cheist orm in onoir Iosa” “Ceist eile agam ort aríst” “Fiche bliadhan go Dómhnach s’ chuaigh thart” “Is doiligh liom sin a chreidbheáil uait” “Creid thusa go dtiocfaidh an uair” “Ní mise a rinn a’ t-olc” “A chromádaigh mo chroidhe”25 “Bheirfinn comhairle duid” “Nuair a bhí mise ar shaoghal shalach na gcathuighthe” “Nuair a rachainn ‘un Aifrinn Dé Dómhnaigh” “Nuair a thiocfainn na bhaile tráthnóna” “Bhí easnaidheach mhór ar mo theaglach” “Bhí mé sanntach súl-radharcnach” “Níor leig26 mé ríamh i gcoisde an bháis” “Annsin a chonnaic mé bunadh Chríosta” “Labhair an t-Údás Nimhe”27 “Cé gur do sheirbhís bhí sé a dhéanamh” “Níl a fhios agam arsan t-athair síorrthaidhe” “Annsin a labhair Muire faoi umhlaidheacht” “Ba é a thug an deoch nimhe” “(Maise) a mháthair nach dtearn an t-olc” “A Mhic mo chroidhe na leig leis é” “Sin tús agus deireadh mo scéil” “Ach go bé guidhe Mhuire ar a héanmhac” “Ní hé an uair a béas rosg na súl á mbriseadh” Dán na hAoine Cur síos: dán diaga. Tá 19 leagan den amhrán seo le fáil in Caoineadh na dTrí Muire,28 ach is iad na leaganacha DA3 agus DA4, a tógadh síos ó Chaitlín Óg Ní Dhomhnaill, Croith Uí Bhaoighill agus ó Anna Nic Eiteagáin, Mín a’ Ceachan, faoi seach, na leaganacha is cóngaraí don leagan seo.29 334


Bidí Chaitilín, An Phlochóg, Árainn Mhór. Dialann 25, amhrán 3, lch 8-10, 10 véarsa 4 líne. Tá 6 líne sa véarsa deiridh. “Sí seo an Aoine thursach bhrónach” “A Rí mhór na hAoine nach tursach a bíos tú” “Nuair a chualaidh an Mhaighdean gur gabhadh a héanmhac” “Casadh fuil Íosa uirthi go híseal sa ród dí” “Le sin tháinig smúid as cuimse ar na réaltaí” “Chuir siad culaith amadáin ar Éanmhac Íosa” “Sin agaibh ár Slanuigheóir agus é ag dul ‘a chéasadh” “Thóg siad suas ar chrann dólais na páise é” “Thug siad cupan bhinéagra dó a Dhia nárbh a deoch ghéar é” “Bhí a gcuid claimhtheach leóbhtha is iad rólíomhtha” Dán na Sgabail Cur síos: dán diaga. Tá amhrán darb ainm “Sgaball Mhuire” le fáil in Dánta Diaga Uladh, ach níl mórán cosúlachta idir an leagan sin agus an leagan seo.30 Bidí Chaitilín, An Phlochóg, Árainn Mhór. Dialann 25, amhrán 4, lch 10-12, 8 véarsa 4 líne. “Tá an mhaighdean faoi thuirse is tá an mhaighdean faoi bhrón” “A mhic a tsoluis agus a athair a’ truaigh” “A chairde gaoil faithchilligidh an bás” “A chairde gaoil faithchilligidh an bás” “Ag éirghe dúid ar maidin dean do chasaoid le Dia” “Ag éirghe damh ar maidin bhí mé i dtrioblóid a’ tsaoghail” “Tabhair an sgabal is chaith í gach lá” “Peacach bocht mise a pheacaigh go mór” Éirigh Suas a Stóirín Cur síos: amhrán grá. Leagan neamhchoitianta den amhrán seo. Tá an gnáthleagan le fáil in Ceolta Gael 2.31 Jimmy, Oileán Thoraí. Dialann 14, amhrán 19, lch 44-45, 7 véarsa 4 líne. Níl ach 3 líne sa 5ú véarsa. “I ngleanntán na Coilleadh Uaignighe is lag brónach mar bhíos” “Is nuair a éirighimse ar maidín as amharcaim uaim” “Bheirimsa mo bheannacht thart romham insa tslighe” “Rise up my darling nó nach bhfuil tú do shuighe” “Na fagaidh droichmheas orm ma tá mé bog óg” “Bheirimsa mo mhallacht don mhnaoi óg a dtigh32 mo dhéidh” “Racha mise i márach go hAonach a’ Ghleanna” Gleann Éinigh33 Cur síos: amhrán pósta. Sa scéal a fuair Ó Cuinn, dúradh gur tháinig saor as Connachta agus a mhac go Dún na nGall ag tógáil tí agus go raibh cailín sa teach 335


sin. Bhí an t-athair ag iarraidh go bpósfadh a mhac í, ach bhí an mac i ngrá le cailín a bhí sa bhaile i gConnachta. Thug siad deoch láidir don mhac agus pósadh an bheirt agus é ólta. Nuair a tháinig sé chuige féin, rinne sé ceol cráite faoin chéad bhean nach mbeadh aige choíche.34 Árainn Mhór. Dialann 25, amhrán 23, lch 40-41, 5 véarsa 4 líne. “Tá oileán beag i lár na hÉirne a bhfásann ‘féar air go luath” “Go Connachta síar sé mo bhrón nach dtéighim a choidhe” “Is truagh nach bhfuil mé agus mo chéile beag óg” “Dá mbéadh a fhios ag mo mhuirnín go cinnte mar tá” “Dar a nglacfadh mo dhá láimh ‘leabhair bána ‘s chan aon bhréag” Inis Oirthir a’ Port a D’fhág Mé Cur síos: amhrán iascaireachta. Oileán Thoraí. Dialann 14, amhrán 37, lch 87, 3 véarsa 4 líne. “Inis Oirthear a’ port a d’fhág mé” “Chuirfinnse leitir inns ortsa a Dhálaigh” “Má chluineann Mag Gaoith é ní bhíonn sé sastaí” Is Uait Aon Phóg Amháin35 Cur síos: amhrán grá. Jimmy, Oileán Thoraí. Dialann 14, amhrán 24, lch 54-55, 5 véarsa 4 líne. Níl ach 3 líne sa chéad véarsa. “Is uait aon phóg amháin sé d’iarfainn ‘e mhalairt leat” “A bhailintín ná meall36 orm níl geall agam ar aon duine ach thú” “Fásfaidh féar agus fásas fríd an fhairrge uilig go léir” “Ach go bé gur tú bhí i ndán dom gheobhainn árus ag mo mhuintir féin” “Seo cluanaidhe beag dá luaidh liom anoir i dtús mo shaoghail” Luan an tSléibhe Cur síos: dán diaga. Tá leagan den amhrán seo le fáil in Amhráin Hiúdaí Fheilimí agus leagan eile in Na Cruacha: Scéalta agus Seanchas.37 Bidí Chaitilín, An Phlochóg, Árainn Mhór. Dialann 25, amhrán 2 lch 6-8, 9 véarsa 4 líne. Tá 6 líne sa 4ú véarsa agus sa 6ú véarsa agus tá an líne deiridh ar lár sa 9ú véarsa. “Sé Luan an tSléibhe luan an léirsgrios” “Oiread do spéic do chlog Mhic Dé”38 “Tiocfaidh an uair go mbeidh muid buadharthaí” “Annsin a thiocfas a Déagh Mhac ceart a dhéanamh” “Annsin a thiocfas a mhaighdean dheas barramhail spéireamhail” “Sin na maolruisg a shíl do dhéidhsa” “Diomchar sí a héanmhac trí raithche na héanbhruinn” 336


“A’ té chuireas a dhóchas as rí an Dómhnaigh” “Do chroidhe a dhortú don aithrighe ró-gheall” Máire an tSeóid39 Cur síos: amhrán grá. Éamonn Ó Dubhtháin, Oileán Thoraí. Dialann 14, amhrán 13, lch 29-30, 3 véarsa 8 líne. “Ag dul fríd Chonnla40 dom do casadh an cúilfhionn orm” “Stad an ainnir seal beag baoideach” “Fuadh mé un seancharr le fear ón chomhursanacht” Máire Óg na gCiabh Cur síos: amhrán grá. Oileán Ghabhla. Dialann 2, amhrán 33, lch 52, 3 véarsa 4 líne. Níl ach 2 líne sa dara véarsa. “A Mháire Óg na gciabh fá impidhe ó Dhia” “Le lamhachta gunnaí móra, teinnte crábhacha na dhiaidh” “Bhí cíos agus géarrtha agus ministéirí gallta” Naoi gCnó ar a’ gCnoibhínn41 Cur síos: Laoi Fiannaíochta. Cuirtear ceist ar sciathóg faoi na laethanta nuair a rinneadh ar dtús é agus tugann sé cuntas ar na laochra uile a tháinig ag amharc air. Uilliam Ó Ceallaigh, Léidhb Gharbh, Árainn Mhór. Dialann 25, amhrán 7, lch 16-17, 7 véarsa 4 líne. Níl ach 3 líne sa 4ú véarsa. “Sgíachóg ó abhall amach”42 “Táim annseo le haimsir” “Lá dá rabh Fionntún agus é ag cnuasach” “Ag éirghe dó na sheasamh” “D’fhas mé mo lúrthainn”43 “Clann na Baoisc, Clann na Bóinne, Clann Cormaic ‘ic Airt” “Le linn Chormaic Mhic Airt” Nár Dheas a Bheith in Oileán Thoraigh Cur síos: amhrán deoraíochta. Chum Séamas Mac Ruaidhrí thart faoin bhliain 1924 agus é ar an choigríoch i Meiriceá.44 Tá leagan den amhrán seo in The Irish of Tory Island.45 Séamus Mac Ruaidhrí, Oileán Thoraí. Dialann 14, amhrán 1, lch 1, 3 véarsa 4 líne. “Nár dheas a bheith in Oileán Thoraigh” “Is truagh nach bhfuil mé ag iollaidh a’ mhónaidh” “Is é ruaigeadh an smúid dubh go luath as m’aigne” 337


Níl Ór ar Pláta na a’ tSeóid ‘a’ Bhreághachta Cur síos: amhrán grá. Árainn Mhór. Dialann 25, amhrán 22, lch 38-39, 4 véarsa 8 líne. Níl ach 6 líne sa 3ú véarsa. “Níl ór ar pláta na a’ tseóid ‘a’ bhreághachta” “A Mháire a ruansearc éalaigh liomsa” “Mo chreach a Mháire mhodhthamhail ó chonnaic mé tú ‘óigbhean” “Diarr mise ciúmhas uirthi de’n phlúid a bhí faoithe” Ógánaigh na gCarad Cur síos: amhrán grá. Tá leagan den amhrán seo in The Irish of Tory Island.46 Bidí Chaitlín Nic an Bháird, An Phlochóg, Árainn Mhór. Dialann 25, amhrán 12, lch 24, 4 véarsa 4 líne. “A Ógánaigh na gcarad car chodhla tú aréir” “Ag éirghe damh ar maidín is paidir damh an deór” “Dá mbéinn ar an éanlaith udaigh éirigheas go hárd” “Is truagh nach bhfuil mé is mo mhíle grádh” Padraic Mac Ruaidhrí48 Cur síos: Amhrán iascaireachta. Oileán Ghabhla. Dialann 2, amhrán 14, lch 29-30, 4 véarsa 8 líne. “A Phadraic ‘ac Ruaidhrí a chorp an duine uasal” “Ó Nansaidh tá an craobh leat i dToraigh go siorraidhe” “Tá coirce agus aorna chomh fairsing le mónadh” “Tá ‘n rigging in ordú ní baoghal da cuid seoltaí” Tabhair Sgéal Uaim go hÁran49 Cur síos: amhrán pósta. Oileán Ghabhla. Dialann 2, amhrán 39, lch 57, 4 véarsa 4 líne. Tá 6 líne sa 4ú véarsa. “Tabhair sgéal uaim go hÁran” “Tá Séamus ions’ a’ leabaidh” “Bhuail Cutaidh an doras” “Thíos i mBaile an Uachtar” Tá Raiche ar an Bhaile seo ag Eoin Cur síos: Aoir. Cait Tom Ní Mhíonáin, Oileán Thoraí. Dialann 14, amhrán 32, lch 78-79, 3 véarsa 8 líne. “Tá raiche ar an bhaile seo ag Eóin” “gCuala sibh raiche an chinn bháin” “Deir Ned ‘ac Colla más fíor” 338


Tarlach Beag Scrábach Cur síos: amhrán pósta. Scríobh Ó Cuinn go ndearnadh an t-amhrán seo faoi fhear as Rann na Feirste a phós bean as Gaoth Dobhair.50 Oileán Ghabhla. Dialann 2, amhrán 35, lch 54, 2 véarsa, 4 líne agus 5 líne. “Tarlach beag scrabach a bhfuil air dath a’ leaban” “Taobh thall ó Ghaoth Dobhair gan bean insa’ tír” Toraigh na dTonn Cur síos: amhrán ag moladh Oileán Thoraí. D’inis Cait Tom Ní Mhíonáin scéal an amhráin do Ó Cuinn; gur fágadh seoltóir ar an bhlár i dToraigh nuair a d’imigh a bhád gan é. Chuir bunadh an oileáin fáilte fhial roimhe agus rinne sé an t-amhrán seo. Cait Tom Ní Mhíonáin, Oileán Thoraí. Dialann 14, amhrán 8, lch 11, 3 véarsa 4 líne. “Toraigh na dtonn agus na dtom agus na mbeann a bhí árd” “Is mise bhí gan chéill rinn an éagcaointe thart faoi mur gcoim” “A Thoraigh ó thuaidh go buan go mairidh tú beó”

339


References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37. 340

Ó Glaisne, 1996: 2, 4-5, 22-23, 436. Ó Glaisne, 1996: 22, 50. Ó Canainn, 1994: 28; Ó Cuinn, 1930-1932. Ó Cuinn, 1939; Ó Glaisne, 1996: 79. Ó Cuinn, 1940; Ó Cannainn, 1994: 28; Ó Cnámhsí, 1988: 194; Ó Glaisne, 1996: 81. Ó Glaisne, 1996: 4-5, 15, 22; Ó Canainn, 1994: 24 Ó Glaisne, 1996: 78-85. Ó Cuinn, 1930-32; Ó Cuinn, 1939; Ó Cuinn, 1940; Ó Glaisne, 1996: 80, 425. Nóta a scríobh Ó Cuinn: “Éamonn ‘ac Ruaidhrí a rinn.” Kennedy, 1975: 79. Ní hé seo an véarsa atá in An Ghiobóg, cé go dtosaíonn sé leis an líne chéanna. Ó Duinnín, 1934: 109. Hamilton, 1974: 55. Malaidh an tSléibhe Bháin; Ó Muirgheasa, 1974: 137. Ó Muirgheasa, 1974: 137-136. Ó Laoire, 2002: 344. “’S dá mbeadh fhios ag d’athair a Sheáin,” an leagan a bhí i dToraigh den líne seo. Ó Laoire, 2002: 344. Scríobh Ó Cuinn nóta roimh an véarsa seo: “Bhí fear ag tarraingt uirthi fear mór fáda agus é ar mire.” Méith, 1977: 42; Ó Muirgheasa, 1974: 53-54. Ní Dhomhnaill, 1992. Ó Muirgheasa, 1969: 172-175. na haingle; Ní Dhomhnaill, 1992. Ó Glaisne, 1996: 82-85. Ó Muirgheasa, 1969: 176- 178. chomráda, Ó Glaisne, 1996: 83. loic, Ó Glaisne, 1996: 84. Iúdas, Ó Dónaill, 2005. Partrige, 1983: 236-253. Partrige, 1983. Tá leagan DA3 ar na leathanaigh 238-239, agus tá leagan DA4 ar na leathanaigh 239-240. Ó Muirgheasa, 1936: 122-123. Ó Baoill, 1997: 54. údaigh/ úd. Cuaichín Ghleann Néifín, Tógfaidh Mé mo Sheoltaí. Ó Cuinn, 1940. Tá an t-amhrán seo an-chosúil leis an amhrán A Valaintín a fuair Ó Cuinn in Árainn Mhór. Feall. Ó Baoighill, 200: 94-95; Ní Dhíoraí, 2009: 178.


38. 39. 40. 41. 42. 43. 44. 45. 46. 47. 48. 49. 50.

“Ar an treas béid de chlog Mhic Dé” Ó Baoighill, 2001: 94. Tugtar Máirín Seoighe ar an amhrán seo i gConnachta. Ó Cuinn, 1930/1932. Tá leagan congárach de seo le fáil in Ceol na nOileán. Ó Ceallaigh, 1993: 38. Chonga, Ó Ceallaigh, 1993: 38. Ó Canainn, Innéacs Garbh. Sciathóg ó abhaill amach? .i. sciathóg a rinneadh as adhmad ó chrann úll? Scríobh Ó Cuinn “lúireach” ar imeall an leathanaigh. Ó Cuinn, 1940. Ó Cuinn, 1939. Hamilton, 1974: 73. Hamilton, 1974: 59. Scríobh Aodh Ó Canainn gurbh as Toraigh Pádraig ‘ac Ruaidhrí agus gur ghadaí mara é. Ó Canainn, Innéacs Garbh. Nóta a scríobh Ó Cuinn: “Rinn Hiudaigh ‘ac Cathmhaor.” Ó Cuinn: 1930-1932. Nóta a scríobh Ó Cuinn: “Caitilín Ní Mhíonáin (Cait Tom) a chuala é 17 nó 18 mbl ó shoin ó Mháire Dhonnchaidh .i. M. bean Mhic Ruaidhri bhí os cionn 100 bl d’aois.”

341


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.