Whole Genome Approaches to Complex Kidney Disease
February 11-12, 2012 Conference Videos

Ethnicity and Community: Impact of Genetic Findings and Disclosing Results
Malia Fullerton, University of Washington

Video Transcript

SARA HULL: So, now I’m going to introduce Stephanie Malia Fullerton who’s an Associate Professor of Bioethics in Humanities at the

University of Washington School of Medicine. She holds adjunct positions in the departments of genome sciences and epidemiology and is a co-

investigator of the University of Washington Center for Genomics and Healthcare Equality. Her work explores scientists’ understandings of

human genetic variation and its relationship to disease risk, the use of racial and ethnic constructs and the conduct and interpretation of

genetic research, and the responsible incorporation of genomic methodologies into broader programs of health disparities research.

She’s also very involved in exploring participant perspectives on data sharing, research use, and result return in the context of genomics. And so,

she’s going to add an additional nuance to this conversation, talking with us about ethnicity and community and its impact on these issues.

STEPHANIE FULLERTON: Thank you very much, Sara, and thank you so much to the organizers for inviting me to participate, and thank you to all

of you who are here in the audience. Maybe it’s just because on West Coast time but I’m normally sound asleep at this time on Sunday morning. So,

I’m very pleased that you’re here and I hope that I will be able to keep you awake with some of my thoughts. I want to sort of preface this by saying,

as Sara mentioned, I am involved in a range of research ethics-related research within and around the conduct of genomic research, and I’m

used to expounding from the position of having actual data. I think in this domain where we’re thinking in particular about the implications of

findings from exome and whole genome-level research for communities, we’re actually in a bit of a data void at the moment; we don’t have much

good data. And so, most of what I’m going to be talking to you about today is speculation and basically a call for more research as you all move

forward doing the very important science that you’re trying to do. So, just by way of a road map, I’m going to try to do three things in today’s

presentation. I’ll keep an eye on the time here. First, I want to…oh, and I wanted to say at the outset, so we’ve had a genetic counselor talk to

us about informed consent, we had a lawyer talk to us about return of results, and now you’re going to have a population geneticist talk to you

about community, because that’s what my primary training is actually, in human population genetics, and that sort of colors my take on this

topic and I wanted to sort of say that at the outset. I’m going to talk a little bit about the nature of the findings that are anticipated from whole

exome and genome sequence analysis and its implications for recruitment and result return as it impacts communities. I’m going to talk a little bit

more about another aspect of the kinds of anticipated results that might be generated, focusing in particular on the population

distribution of those anticipated findings, and talk a little bit about the possible implications for study design and communication with communities, as

well as the ultimate translation of the information that we’re going to be generating by these approaches. And then in the last few minutes of

my presentation, begin to kind of very briefly address potential strategies that we might wish to employ in addressing these challenges that are

posed by this new scale of genomic science. Okay. This was basically based on a review of science as I was aware of it prior to yesterday’s

presentations. I tried very hard not to fiddle with these slides, but fortunately I think they’re commensurate with a lot of stuff we heard

yesterday. So, what are we going to expect to find from whole exome and whole genome sequencing? As we’ve already heard, there’s

going to be a deluge—a Niagara Fall, a fire hose, all those water analogies—of genetic information that’s going to be generated. As we also know,

with the switch to sequencing, particularly in the exonic regions of the genome, the vast majority of the genetic variation that’s going to be

identified is going to be rare. So, we’re moving out and away from this focus on common genetic variations that might be shared across

populations and we need to know, looking at variations in any given population, are going to be typically in the fractions of percent, certainly less

than half a percent, and many of the data we heard about yesterday, less than a tenth of a percent in a population relative frequency. And in

addition and very relevantly and importantly to Ben’s presentation that we just heard, a significant fraction of the variation that is going to

be identified, particularly in the exonic regions—not all of it but a significant amount of it—is going to be functionally relevant, with functional

relevance variably and incompletely and imperfectly defined and understood. So, let’s just stop and think about those two aspects of the

kinds of information that are likely to be generated from whole exome and whole genome sequencing. What are some of the implications?

As we heard yesterday, in a turn to looking at rare variation and trying to understand the phenotypic effects of rare variation, we are

going to need to increasingly look at very large numbers of research participants. You thought we needed large sample sizes in GWAS; it’s

going to be even more acute to have large populations as we turn to this brave, new world of exome and whole genome sequencing. And

there’s a really big problem from a research ethics standpoint and the really big problem here is that the populations that we have available to

us and that we have studied for decades now in the genome sciences are very skewed towards populations of European ancestry. Work that was

done by Anna Need and David Goldstein and then subsequently commented on by Carlos Bustamante and colleagues last summer

describing the population distribution of the GWAS studies that have been conducted to date, 96% of GWAS studies have been conducted on

populations of European descent, 4% on non-European samples. So even as we need to be increasing sample sizes, it is not going to be

possible if we are interested in understanding diverse contributions to genetic risk to simply go to our freezers and start with what we have.

There’s going to be an acute need for the recruitment of diverse populations from varied population backgrounds. What are some other

implications? A very interesting one. I don’t really have any direct data that I can show you but, we know this. This is a feature from population

genetics and it’s rather different from the situation with common genetic variants. As we increase the number of the people that we studied, our

problem is going to get harder, not easier, because as sample size increases we’re going to identify more and more and more rare variation,

okay? This is not the case with common variations: the more you look, the more you see what you already know is there and it easier it

becomes to make sense of it. When we switch to looking at rare sequences, rare variations, there’s going to be a significant signal-to-noise problem

and we’re going to have to figure out how to sort that out. And although, as we heard yesterday, there’s some very really neat-o cool statistical

ways of making sense of an increasing statistical power surrounding, in particular, the analysis of an aggregate of rare variations, it’s not at all clear

how we’re going to make effective clinical use of this information. We’re moving away from thinking about genetic risk variations as something that

we might be able to kind of use from a broad public health context, moving much more into the clinical domain, and it’s not exactly clear how

we’re going to make use of this information, so I just want to kind of flag those. We can about that more in questions-and-answers. So, those are

some implications of the fact that we’re going to identify a lot of very rare variation. Another feature of the data, as I indicated, is that we’re

going to identify a very large number, a very large number—hundreds and likely, thousands—of variations which our algorithms tells us are

functionally relevant, and this is not in total, this is per person, okay? And that’s a very interesting and important scientific problem but it makes this

business of thinking about the return of information to research participants that we know there’s a lot of interest in and excitement

about currently in the genomics community, enormously complicated, because it’s been talked about and, as we heard in a very nice

presentation by Dr. Solomon yesterday, variant effects are going to need to be understood and prioritized with regard to their clinical salience

prior to communicating individual findings to research participants. And as Ben has already indicated, and I don’t want to spend a lot of time

on this but actually, we don’t quite know how to do this yet. There’s a lot of disagreement in the bioethics community. Even in the clinical genetics

community I have come to understand there are proposals on the table. This particular paper was alluded to in Dr. Solomon’s presentation yesterday

by Berg and colleagues, a way of actually sort of categorizing or binning possible types of incidental findings that might be generated from

whole exome and whole genome approaches, which basically relies on a joint adjudication of the clinical utility and validity of this information in

conjunction with whether we have some prior knowledge or understanding of whether variants are deleterious or presumed deleterious, and

depending on how information falls out in terms of those two bits of kind of decision making, this gives us an indication…let me see if I can figure

out how to do this…never mind. This gives us an indication of which findings we might likely wish to return. This already is complicated and begins

to start giving me a headache as someone who’s interested in this issue. What has been very sobering to realize is you put three medical

geneticists in a room and they will not even agree with regard to the clinical utility of a lot of this information, and it gets very tricky. So, there’s

going to be a lot of work to be done before we can actually meet, even potentially as understood currently, ethical obligations with regard to return,

and I just wanted to make the point that functional relevance, particularly as we were talking about it yesterday where we were talking about sort of

computer-based algorithms which will tell us about the likely effect on protein function, even those algorithms do not necessarily agree with

each other. Functional relevance and an adjudication of functional relevance which might possibly be algorithmically tractable, is not the

same as clinical utility. As I’ve already said, there’s a lot of disagreement in the medical-genetics community about clinical utility. We came

face-to-face with this issue in the context of deliberative work as part of the Electronic Medical Records and Genomics Research Network that

I’m involved in. We have a paper that is about to come out next week in Genetics in Medicine

talking about our experience with deliberating on findings in the context of that research network, and these were incidental findings generated in

the context of genome-wide association studies, not exome or genome scale investigations—sequencing investigation—where we determined

after much discussion and debate and deliberation that there were four possible classes of findings that might rise to the level of return,

and then after we finally came to that agreement and a consensus of the network, people took that back to their individual sites, looked very carefully

at medical record data for the individuals who were affected by the particular genotypes, and where invariably at every single site and due to a

confluence of factors including informed consent, including the understandings of the institutional review board, and other considerations,

community interactions, no decision was made to return any finding. And in a classic statement—understatement—that I attribute to my many years

of living in the United Kingdom, we wrote, “Although a criterion of ‘clinical actionability’ suggests a clear threshold for identifying results

that should be considered for return, in practice, the identification of specific clinically actionable finding generated from the eMERGE studies was

not straightforward.” This was a very difficult conversation. It took a very long time to figure this out for four incidental findings and we are about

to enter in a brave, new world where we’re thinking about this now not for four, but for hundreds and possibly thousands of potentially

functionally relevant variants, so lots of food for thought there. Okay. So, that is one class of finding, one way of thinking about the data. The

data is generating many rare variants, the data is generating a fraction of which are going to be likely functionally relevant and may pose

obligations, but exactly how to act on those obligations is going to be extraordinarily difficult to sort out. Where does ethnicity and community

figure in here? Well, this brings me to the second way in which I want to think about the potential nature of the findings likely to be generated from

the whole exome and whole genome data. This is just a snippet of the very considerable data that were reported out of the 1000 Genomes Project,

a project to sort of sequence in considerable detail the whole genomes of a large number of samples collected from around the world and in

data that was reported in 2010 and the emerging data from the Exome Sequencing Project which was talked about yesterday, but unfortunately,

those data are not yet available publicly. We know that when we go in and look at populations of individuals sampled from different parts of the

world and we look in particular in the exome regions of the genes where there’s lots of this rare variation, that what you identify that has

previously been described tends to be more often shared between populations and what is rare and new and being seen for the first time is

invariably population-specific, and this has very profound implications for how we’re going to actually communicate with communities about the

conduct of this research. So, a little bit more on expected findings. Rare variation not only will require large sample sizes and have lots of

functional relevance, but as I just indicated, rare variation will frequently be geographically restricted, will be population-specific, and in

addition and this is something which was not talked about very much yesterday—it’s something that I don’t think has received nearly enough

attention but I want to put it on the table for discussion—rare variants are going to be non-randomly distributed amongst individuals and

families. Some individuals and families will have more of this rare novel variation than others. Why? Because of differences in background,

polymorphic variation that have to do with human evolutionary history, and that are largely a function of genetic ancestry, which as we know

as we saw in data presented yesterday by Suzanne Leal and others, is often correlated with self-described ethnicity. Okay, let’s think about the

ethical implications of some of this stuff. Population-Specificity. Well, population-specificity, I think, is going to be an inevitable feature of much

of the data being generated by these methodologies. We have to be very careful of how we talk about such results because

population-specificity could frequently be misunderstood. It might lead us to overemphasize race- or community-specific genetic

vulnerabilities in preference to other shared factors, and in an interesting way and a way that has not at all been addressed or explored, it

could have some profound effects on the identity of people. For example, one could imagine the situation where an individual of one

self-described ethnicity is told that they harbor a variant that has otherwise been described as specific to a different population, a different

ethnicity, a different region of the world. Let us not underestimate the extent to which this is likely. There are some truly remarkable population

genetic data out here. This is one of the most colorful and exemplary examples that I’m aware of, work by John Novembre and colleagues

looking at patterns of genetic variation as assayed from sequencing in large numbers of people from Europe and where, basically, these

are not…the colored circles have to do with sort of composite estimates of patterns of genetic variation and first and second principal

components, but where there’s a very clear tendency for individuals whose heritage basically hails from particular places in Europe to have

their genetic variation very closely resemble one another. So this propensity to have rare variants mapped very closely at specific geographic

locations, particularly for individuals and families who have been in situ for several generations, is very high and we’re going to need to be thinking

about this carefully as we begin figuring out how we’re going to communicate with participants about the kinds of findings we’re generating. In

addition, population-specificity could, frankly, be not only misunderstood but potentially misused. It could distract us from the consideration of other

relevant public health remedies in the context of the diseases and traits that we’re interested in studying and understanding, and it might open up

the potential for stigmatization or discrimination in particular groups on the basis of the apparent population-specificity of findings that are

indicated. There’s no evidence that this has happened to date, necessarily, but there interesting little suggestions of what possibly

could occur. I just want to make reference to this paper which came out several years ago by Carlos Bustamante’s group, doing a resequencing

study of a very small number of individuals, self-described Europeans or European Americans or African Americans and going in and making some

inferences in reference to an out-species—chimpanzee—about the likely deleterious nature of genetic variation and coming to, at the time, the

surprising and somewhat controversial conclusion that there was proportionately more deleterious genetic variation in European as

opposed to African populations. Now, leaving aside the fact that they came to this conclusion on the basis of analysis of 15 Europeans and 15

Africans, which may or may not be problematic, I think it’s very easy to imagine if the headline had been reversed—if they had found that there was

deleterious variation in Africans than in Europeans—how we might think about the social salience of that, particularly in the context of

really pronounced and profound health disparities which exist in the United States context. There’s just an example here of cancer disease burden

by different racial and ethnic categories and sexes and incidence and death, and it would be very easy to jump to the likely erroneous

conclusion that that deleterious variation is explaining a broader disease burden that many, many researchers believe have a much more

complicated and multifactorial basis and a basis in social determinants of health. So obviously, there’s a need for great caution. That’s the

population-specificity aspect of the data. There’s also this issue of the fact that the ways in which rare variations are going to be found in individuals

that we sample are going to be non-randomly distributed and that indeed, population genetics tells us—and we already have data to support

this observation—that individuals of sub-Saharan African ancestry are likely to have more rare and consequently more novel variation. What does

this mean for us as investigators? Does this make individuals of sub-Saharan African ancestry interesting objects of investigation, or instead,

difficult to study? I don’t know how to answer that question, but the difficult to study explanation has been one of the explanations provided for

the fact that we have such an uneven distribution of samples currently in human genetics and genomics. For sure, there are going to be fewer

definitive results to be communicated to research participants from such population genetic backgrounds in the near term, and this is already

something that we have begun to encounter in the context of clinical genetic testing that is already clinically available. These are data from a

particularly…they’re indicative data but it’s from a relatively small study of women who’ve developed breast cancer early in their lives,

showing the genetic population mutational distribution of BRCA1 and BRCA2 variants and basically—I know this is very hard to see—but

what I want to draw your attention to is the row describing variants of uncertain significance showing that approximately four times as many

variants identified in the BRCA1 and BRCA 2 genes are found to be of uncertain clinical significance in individuals of African American

ancestry, in part because they have simply been less well-studied to date. We can take what we’re experiencing in the clinical domain and

bring that into the research context, add to the fact that we’re going to have a lot more rare and novel variation to being with and that we’re not

going to be able to tell people about, and I think this poses some particularly profound ethical concerns. In addition, and this goes to the

question that I had for Dr. Rich at the end of his presentation yesterday and in particular, his clear indication and I believe that the writing is on the

wall, that the kinds of designs that are going to work best for the interrogation of rare variation turned up in exome and whole genome

sequencing studies are family-based, pedigree-based investigations with the expected return to such study designs and the context of this kind

of research. We’re increasingly going to be looking to the need to recruit large and extended pedigrees from all sorts of populations, and yet

we don’t really know very much about how easy it is. I was very gratified to hear that there are a large number of pedigrees already available from

underrepresented ethnic minority communities in the kidney disease domain; that’s wonderful. More will need to be collected and yet we don’t

know, really, how to think about recruitment or even the communication of findings in a family-based context in minority communities. The most

definitive report that was published a couple of years ago now on the use of family history in clinical research, making some recommendations

for how to do this work well, unfortunately noted that there’s actually very little evidence to suggest the role that race or ethnicity, cultural

background, religious belief or other characteristics might have on one’s willingness or ability to report on their family history, or

presumably, to participate in research that would involve family-based ascertainment. These are just big, huge lacuna in our understanding and

we’re going to have to be grappling with these issues, I think, as we move forward, of necessity in order to do the science that we want to do. So,

just to quickly re-cap on my sort of summary of the data and where it leads us in terms of its ethical implications. The need to study and

understand rare variation is going to, out of necessity, require the recruitment of new—not simply just using what we have to

hand—recruiting new large cohorts. Understanding uncertain functional significance is going to require us to recruit and analyze diverse

cohorts that are adequately powered. I didn’t really talk about that very much but it’s not just enough to sort of include minority representatives

in our research, but we need to have enough of them in order to make robust inferences; so adequately powered to identify local population

and possibly familial effects. Because of the population-specific nature of much of this variation we’re going to have to exercise extreme

care in describing findings that are apparently restricted to particular social strata, and because of the non-random distribution of rare variation

we’re going to have to acknowledge lots of complications inherent to the participation of certain communities. Even as we are inviting the

participation, we’re going to have to make clear that we may not have the same kinds of information to return to them. What’s a researcher

to do in the face of these somewhat uncomfortable facts about where the science is leading us? Well, obviously we need to be paying

attention to the role of reciprocity in research. We’ve had some wonderful discussion already this morning about the role of informed consent

and relationship-building and partnership-building with participants; I am all for that. As we do this work I think we have to acknowledge that the

longer term public health benefits of much of what we’re doing in the near term are unclear and are going to be hard to explain to research

participants. I cannot overestimate enough that altruistic research participation, which is what we all depend upon, is a luxury. It is easy for

certain groups of people and it is not for others and we need to begin to acknowledge that, and rather than talking about populations that are hard

to recruit, we need to be going out of our ways in order to collect the kinds of people and information that are going to make clear that

public health benefits are going to be equitably distributed, while we recognize that this distant promise of public health benefit may not suffice

as a rationale for participation for many communities. Near-term benefits are also likely to vary. The fact that a lot of the individual research

findings generated from research are going to be indeterminant—and they’re going to be differentially indeterminant in particular

communities and families—is important and is going to need to be acknowledged. We’re going to have to start having some honest

conversations about the reason why we don’t know as much about certain groups is because we haven’t studied you to date and we need to

be figuring out other ways to demonstrate reciprocity and to reward compensation that might be something other than returning individual

findings. And then, recognizing that all of this conversation, all of this careful work, all of this engagement with populations and communities is

basically the research community asking participants for our trust, and that if we’re going to ask participants for trust, we need to be

demonstrating trustworthiness, and that is quite difficult and complicated, but I think it’s doable. I am very privileged to be a part of a study out of

Morehouse University call MH-GRID, Minority Health-Genomic Research Infrastructure Development project, led by Gary Gibbons, which

is going to be looking at African Americans, doing exome sequencing in African Americans in the context of hypertension risk and where we’re

going to be doing some ethical research, talking to the participants as part of that study about their attitudes towards and preferences with regard to

participation in exome sequencing research. The data that we have to hand, other data about just general African American attitudes towards

participation in genetic research is that African Americans are as likely as whites to express their willingness to participate, and yet, also are

more likely to report feeling that genetic research is going to possibly result in higher insurance; to not benefit their communities, to promote racism,

and to use minorities as guinea pigs. These perceptions are real; they must be adequately addressed and dealt with in the context of our

research engagements. At the same, we need to recognize that in many cases the greatest barriers to the participation of the communities

that we want to involve may not be due to an inherent mistrust of research, but rather to a lack of access to healthcare and research

opportunities, and so we need to be going out of our way to make ourselves available. And we also need to be thinking about research

participants as true stakeholders in the research process. Based on some work that we did out of a group health cooperative not with ethnic

minority communities—with white, middle-class, well-off, well-insured participants—we’ve come to some conclusions about ways in which we

might or steps we might take in the context of genomic research to enhance respectful engagement. A lot of this has to do with things

that Julie talked about in her presentation this morning of maintaining ongoing communication with participants; thinking of informed consent as

a process; making sure that we stay in regular contact; we tell participants what we’re doing, why we’re doing it, and the ways in which we

hope it will beneficial to themselves and, ultimately, to their broader communities. We need to have ways to provide access to the

information about how samples are being used. We need transparent, accountable oversight processes, which is why some of the information

that Laura Rodriguez talked about last night is so incredibly important. Even as we’re asking people to sign on to have their data be placed in federally

controlled, semi-public access repositories, we need to be explaining to them why that is necessary and how their data are going to be

protected and not misused, and we need to continually provide opportunities for research participants of all stripes to provide direct input on

the stewardship of their data. And I know that’s something that we’re not really used to in the genomic research community, but we need to be

moving in that direction. And then in those cases where such ongoing engagement or re-contact for particular research uses is not feasible, we

need to be developing other effective methods to communicate with the public about responsible realistic study procedures, including thinking

comprehensibly about ways to educate the public about the need to conduct research of this kind prior to any research ask in the line of the

question that we had earlier today. So, not a lot of hard data, I know, mostly speculation, but what I hope you will go home with are three conclusions

that I have come to as I’ve begun to think more deeply about the implications of this research for different ethnic groups and communities,

particularly in the United States context. Whole genome approaches to complex kidney disease is going to require the participation of large and

diverse cohorts. We are not simply going to be able to go our freezers; new recruitment is going to be needed and new recruitment and

engagement strategies are going to be required. “Making sense” of the information that we are going to generate in the context of our research

is not going to be straightforward and we’re going to have to exercise great care in describing those findings and returning results to individuals

and to their families. It’s posing a whole new set of issues, and yet ironically in the way that was pointed out in the question-and-answer at the

end of the day yesterday, in some ways returning us to sets of ethical concerns that we grappled with a long time ago when we were

first working in the realm of linkage analysis. And in my opinion and in the opinion of my colleagues at the Center for Genomics and Healthcare

Equality, ongoing respectful engagement which is attentive to and attends to community-specific concerns is ultimately the way in which we’re

going to enhance research participation and reduce the perception of harm and actual harms involved in the conduct of such research. So,

thank you very much for your attention…just some of the various organizations that I’ve been involved in. I’ve also had a consulting role in the

Exome Sequencing Project as well as MH-GRID as I mentioned. So thank you very much. I wowed you all. Yeah?

ROBERT KLETA: Robert Kleta, University College London. If I may, I just want to make a comment about whole genome approaches, complex

kidney disease and large, large cohorts because my impression here is that you and others yesterday left the community in the room a little bit

under the impression that we can’t move forward for rare diseases and the point I’m trying to make is, I think there’s a little bit of disconnect with rare

variant and rare disease. So what I’m trying to say is, if you do linkage studies, then 4 to 10 samples—even recessive or dominant

fashion—can net significant findings and you have the locus and you find the gene and you know what’s going and that can even be true for

complex kidney diseases. If you go to genome-wide association studies—and I think that’s what now many people in the room think about—then I

take the liberty also of saying actually you just need a well-defined cohort and it can be as complex as it be, but 100 samples are enough to

find the locus and the genes of interest. Even two genes can be involved and can be very complex. So what I’m afraid is happening from the

event of genome-wide association studies and common diseases is it’s in everybody’s mind, oh, you need huge, huge, huge cohorts, and I think it

would just be sad if this paradigm is now continued for complex kidney diseases which may just be complex because of the nature but

not because of the approach that should be taken. So again, to try to make the point, I think nothing of what has been said yesterday and

today is wrong, but there’s a disconnect in understanding using these tools in terms of what do we do and what’s right in terms of the biology

or the theoretics behind it? Again, in my opinion, you need well-defined cohorts but then actually you need small cohorts to move forward. You do

not need 1,000 or 2,000 or 10,000 samples. If you study common disease, where you, indeed have lots of noise and many rare variants playing to

different pathways, then sure. That was the disaster of the past years of genome-wide association studies, but I think this should now be

over and we should understand how we could use the tools. So again, no offense intended. If I would know anything—and I take pride, actually,

being trained here at the National Human Genome Research Institute in genetics—I would now have the message, go home, okay, we have 100 or

200 samples of a disorder I’d like to study. I understand that’s my cohort but I’m saying, look, the first science paper, genome-wide association

study 2005 by Kline and colleagues, 86 samples, I will be presenting tomorrow in the Neptune study our approach to membranous nephropathy,

people in the room probably agree that’s a complex disease. 75 samples we find the first locus, 150 we find both loci. Sorry, I just thought I

want to put this here in front of everybody. Thank you.

STEPHANIE FULLERTON: Yes, thank you, and I would actually really invite those of you who are more sort of up on the methodology to sort of

comment on that point. I will say, though, where I am coming from as a bioethicist who interacts with communities around making kind of the case

for research participation and as someone who is enormously concerned that as we generate genomic knowledge, we do not generate

knowledge which will preferentially benefit certain populations and not others. The fact that from a methodological standpoint we can adopt a

study design which only requires a small number of samples and which will get us definitive information on a small number of genes, does not

begin to address the population burden, the pronounced disparities in kidney disease outcomes that are present in this country. And so

even while such designs as you talk about might be tractable from the point of view of identifying particular rare etiological contributions is not

going to make a dent in that public health problem, and that is the problem that the communities that I interact with care about, right? And so, we have

to start figuring out how to do both at the same time and I think this is a complicated issue.

MALE: That was a lovely talk and I want to bring you back to, I think, what was the title of the talk about ethnicities and communities and populations

and I worked with Jeffrey Kopp on a beautiful study where we identified an important locus for renal disease in HIV infection. When we designed

that study we were concerned about the ethnicities and we asked that all patients—because it was proposed to be African American

patients—have four—I think that’s as many grandparents as you’re entitled to—four African American grandparents. We’ve also been

involved in the FIND in the study that when we started it was particularly looking at different ethnicities because we thought that the

expression of renal disease might be very different, both in its genetic susceptibilities and bases and in its clinical expression in different

populations. Now I’m wondering, as people from the 50s and 40s are dying out and we have a new sort of multi-ethnic character in populations

in the United States, what’s going to happen to our principal component analyses? What’s going to happen to our idea about using identified—self-

identified—ethnic backgrounds when we don’t know what those mean anymore or people are fluid in their identification of ethnicities? So, that

should be a fun question. STEPHANIE FULLERTON: Thank you. I mean,

that’s a great question and we had a preliminary answer and I know you weren’t able to be with us for a lot of the day yesterday. There was a

preliminary answer from our analytical people, and Suzanne Leal in particular, showing the ways in which we use the aggregate genetic

information to say things about population genetic background, and increasingly in the genomics community, we use this information in preference

to self-reported ethnicity. And yes, that’s absolutely relevant and important from the point of view of controlling for background population,

allele frequency differences, it might be confounding and association analysis. It becomes more problematic, and this gets to topics that I

talked about a couple of years when I last spoke at an NIDDK meeting, this whole issue of it is not bags of genes that walk into doctors’ offices, but

it’s people who are ascribed to particular racial and ethnic identities. And so even as we can use the genetic information and people’s backgrounds

to kind of put them in the right bin for our genetic analyses, there’s still very, very hard questions about then how that translates back out to the

people of mixed ethnicity who are making up an increasingly large part of the population, and we had this sort of interesting, from my point of view,

sort of hopeful response that as invariably in this work there is a return to a more family-based ascertainment in investigation, that those

particular problems of population background confounding go away, because we can actually look very precisely at patterns of variation in the

family and that there’s less of a concern about at least controlling for that background population in genetic variation. I don’t exactly know how to

evaluate those claims. I still think they’re very interesting. I still think we’re going to have problems with recruitment and with making sense

of the functional significance of these findings in a body that has a collection of genes and that walks through the world with a particular

understanding of its place in the world, and that’s where it gets very complicated.

MALE: [inaudible question from audience] STEPHANIE FULLERTON: Sure. Yes, exactly, and

it’s very complicated and I think this is…as you know, my preference is to kind of keep self-reported ethnicity as a variable—as a covariate—

in our analyses even while we’re controlling for genetic population background. But still, then how to use this information in a public health context

becomes very, very complicated. Yes? GERJAN NAVIS: Gerjan Navis, Groningen. I

recognize and appreciate the complexity of this self-reported ethnicity but there are relevant biological variables that go along with ethnicity

and that’s lifestyle, and of course, it might be more easy to grasp the lifestyle differences between people of varying ethnicity, whatever

that may be, and that might also give us tools to intervene with, let’s say, risk behavior, and that also relates to the remark that was being made

about use of small, very well-characterized populations because, of course, it’s a lot of work to bring lifestyle and to document it properly and

to document it in a reliable fashion. But it might be a way around this very complicated issue of “what is ethnicity?” At least it’s something you

can measure and that’s actionable. STEPHANIE FULLERTON: Yes, absolutely, and I

wholeheartedly agree. I think we need to be measuring very well, very comprehensively all sorts of factors that might be contributing to

disease incidence, and lifestyle factors are a very important consideration. There is also the very interesting issues of…the interesting issue

is: can we basically take ethnicity and sort of disaggregate it into its component pieces, its genetic pieces, and its lifestyle pieces? And I

know there’s a lot of interest in doing that. I and others are interested in that but would like to retain the social identifiers as well because we

also have this problem, which may be less of a problem in the Netherlands, I’m not sure, but it’s certainly a problem in the United States of racism.

And so, even as we account for and take advantage of information on lifestyle factors and on genes, there still might be value in holding in

play in our analyses the ways in the people walk through the world so that we understand better the ways in which social interactions intersect

with lifestyle and other behaviors, as well as genetic influences. We need to have it all.

GERJAN NAVIS: I think we agree and unfortunately yes, in the Netherlands racism is a problem also and a big concern, yes.

STEPHANIE FULLERTON: Yes, thank you. SARA HULL: Thank you. That was a very

interesting presentation. It was great and some important points are being made about the impact of self-reported ethnicity and race on science

and interpretation and I think you also hinted at the inverse point that some of this research is going to have an impact on how people self-

identify. I just wanted to mention a very interesting and complicated ethics consult that we had recently that I guess I would characterize

as misattributed ethnicity and a finding where a SNP that had once been classified as being found only in a certain ethnic or racial group was

reclassified later on, and whether that kind of result should be communicated back to an individual who is receiving results. It launched a

very interesting conversation about self-perceptions and whether it’s racist to ask such questions in consults, but I think that’s just a

glimpse at the feedback and how this is going to go in both directions and the importance of being aware of the impact of emerging research of the

fact that, when you have initial results, that evolving results are going to change over time as you generate more and more information—your

very first point—and what that’s going to mean for how we communicate with people will have really important identity implications.

