I-DEEL: Inter-Disciplinary Ecology and Evolution Lab
  • Home
  • People
  • Research
  • Publications
  • Blog
  • Open Science
    • Registrations
    • Registered Reports
    • Published Protocols
    • Preprints
    • EDI
    • Other
  • Opportunities
  • Links

Behind the achievement: a tale of luck and privilege

1/7/2023

0 Comments

 
​By Patrice Pottier

​​Recently, I was awarded the E&ERC Outstanding Postgraduate Researcher award, which commemorates exceptional achievements from PhD students at the Evolution & Ecology Research Centre (E&ERC). 
 
While overwhelmed with pride and gratitude, I believe it was also time for me to reflect on the circumstances and privileges that paved the path to this accomplishment. It is rather common to bask in the glow of success, yet rarely do we show our weaknesses, and the bumpy journeys that often lead to these accomplishments. I invite you to interpret my reflections in your own way, while acknowledging that every path is different and that the role of luck and privilege vary greatly between individuals.
 
Let's start from the beginning. I was born a mentally and physically abled white male in the vicinity of Lyon, France. My mother was a first-generation immigrant from Russia, and my father was born in Algeria and later grew up in France. Shortly after my birth, my parents decided to relocate to Pointe-Noire in Congo, an environment that would soon be marred by the Republic of the Congo civil war. My sister was brought to the world in the midst of a civil war, etching a profound influence on our upbringing. After four years in Congo, my parents, fearful of escalating conflicts, moved us back to France.
 
Like many scientists, my fascination for nature was innate. However, personal events distanced me from this passion until early in my adulthood. I moved cities every few years, preventing me from building lasting social connections. My father's work also took him away for months, sometimes years, culminating in a drawn-out, emotionally draining divorce. This turbulent period forced me to prioritise my mental health over all else – my quest for understanding biology took a back seat. The divorce also placed a financial burden on my mother, making daily sustenance a recurring struggle. This financial insecurity also separated my mother from her other son in Congo - a rift that, two decades later, remains.
 
Undeniably, my upbringing was fraught with economic and emotional trials. However, this telling isn't a bid for comparison. Each of us encounters significant difficulties, some silently destructive despite their lack of obvious tragedy. I'm not insinuating that my journey was harsher than most - quite the contrary. Frankly, I was deeply lucky, and privileged. My upbringing in a secure country with a great health and educational system was a luxury denied to many. University was not a conscious and motivated choice, but more a refuge from indecision. I gravitated towards Biology simply because it resonated with me, and as university education was virtually free, it seemed like the logical choice. I chose this path by default.
 
While my affection for biology now is unwavering, this was not an immediate realisation. My initial two years of university did not captivate me as much as I had hoped. My interest was selectively inclined towards courses concerning ecology and evolution, and the image of myself as a scientist was a blurry vision at best. I also did not think I was good enough to become a scientist. Yet, I stayed because I lacked other options. Luckily, accumulating tuition debt was absent in my journey. If I did, I certainly wouldn't have chosen a university path, and for this, I bear gratitude to the French public university system. Eventually, I fell in love with biology, and it is hard for me to imagine how I could possibly break this bond. I crossed paths with exceptional academics who sowed the seeds of belief that I too, can make a significant contribution to the world. The consistent guidance and encouragement from these remarkable scientists were truly transformative, and for that, I am eternally grateful.
 
You might wonder, how does someone who just stumbled upon his interest in science end up pursuing a PhD? The answer is simple: luck and privilege. Yes, I was dedicated, hardworking, and increasingly captivated by my field, but so are many others. I was not working harder or smarter, nor was I more talented. Most of my opportunities came from being at the right place at the right time, meeting the right people.
 
Towards the end of my undergraduate degree, I faced a daunting uncertainty about my future. My understanding of research was obscure as I was a first-generation scientist, and a PhD seemed an insurmountable challenge. Yet I was pushed to explore this option. The turning point came when I was offered an internship with Marlene Goubault, an associate professor at the University of Tours. This experience sparked a passion for research, and fully re-ignited my genuine interest in Biology.
 
Upon this realisation, I recognised that a master’s degree and a PhD were my next steps. I pursued a master’s program at the same University, and I genuinely loved every second of it. This degree led me back to working alongside Marlene and her PhD student, Anthony Mathiron. During this time, I met a visiting professor from the University of Alabama, who happened to be Anthony's PhD co-supervisor, Ryan Earley. After a few beers at a pub and some good laughs, Ryan invited me to work with him. I was thrilled, but there was a hick. How could I possibly afford moving to the other side of the world for 6 months, with no income? Luck and privilege were again on my side. A bank agreed to lend me a loan, and Ryan pressured his university to pay me a small, yet life-changing, compensation. The stars aligned, and I was ready for more research.

This certainly wasn't easy, however. My English proficiency was not great, and I had to run a research project, supervise undergraduate students, write, and build a social life in a new language. Yet I was privileged enough to have lived in a country where English is taught from a young age. I cannot imagine having to face these challenges with little to no understanding of English - it was already hard enough. Despite the challenges, Ryan’s continual support was my beacon of light. Thank you, Ryan.
 
After this internship, I had no doubts I wanted to do a PhD. I was lucky to stumble upon an advertisement for a PhD program in the I-DEEL lab with Shinichi Nakagawa. To be honest, I was not too familiar with meta-analyses, nor was I too familiar with Shinichi's work. However, I was armed with curiosity and eagerness to learn. I applied, although I had little confidence in the outcome. To my surprise, Shinichi shortlisted me, I believe, thanks to recommendations from my previous supervisors. In parallel, I also tried for an RTP scholarship at the University of Sydney, only to soon realise I had little chances because my university wasn’t ranked highly enough. I was encouraged to apply regardless, but doubts clouded my hopes and determination. I never thought of university rankings until then, and this realisation made me doubt my chances of success for the UNSW scholarship. I felt trapped because of the university I had chosen, something now beyond my control. All I could do is wait and hope for the best.  
 
Unexpectedly, I was awarded the scholarship at UNSW, which filled me with happiness, excitement, and above all, gratitude. Shinichi gifted his energy and skills to put the odds in my favour. He devoted hours to ensure my application was competitive. I was also lucky my previous research experience with Marlene and Anthony resulted in two co-authorships. It's not lost on me that numerous individuals dedicate an often-higher level of effort in other labs without receiving the same recognition. I was at the right place, and surrounded by the right people, and I recognise that many did not have this chance. Thank you, Marlene, Anthony, for being so inclusive. Thank you, Shinichi, for your help in the application and for believing in me. 
 
There I was, ready to embark on my PhD journey in Sydney, yet financial constraints surfaced again. I had to buy a ticket to Australia, cover visa fees, and pay for an overseas student help cover upfront. I did not have this money. Though, I was lucky enough to be financially supported by my father to follow this journey.
 
My luck stretches further. My supervisors, Shinichi and Szymek, proved to be extraordinary individuals, far removed from the horror stories of negligent or abusive mentors. Thank you both for your dedication and empathy. Another turn of good fortune was being granted a generous scholarship, which allowed me to focus solely on my research without having to juggle additional jobs. With this funding, I could organise workshops, attend professional courses, and go to conferences without applying for extra funding. I could also do most of my work remotely, which was a huge privilege when covid hit. Therefore, it is no surprise that I had more time to focus on my research and be productive. Frankly, I believe such an exclusive scholarship should not exist - all PhD students deserve these privileges. We all deserve a living wage.
 
Reflecting on my journey, the evident narrative is not of an individual striving against odds, but rather a story of privileges and supportive networks. The cascading series of fortunate circumstances stemming from my French upbringing, the right connections, and sheer luck, guided my trajectory. If anything deviated along this path, I would not be here telling this story. 
 
Comparing my journey to my Congolese half-brother, Roderick, illuminates these privileges in action. Despite working harder than I ever did, Roderick never achieved financial stability due to his background and lack of opportunities. None of my experiences compare or outweigh the struggles others with less privileged backgrounds face. Roderick is one of the most dedicated people I know and is sacrificing nearly everything to live from his passion, animation. If he had decided to follow an academic path, I am convinced he wouldn't have made it - even if he worked orders of magnitude harder than any other students. Perhaps I should ask, how many Congolese academics do you know? Do you think there aren't many because of a lack of abilities, or a lack of privileges and opportunities? 
 
Perhaps the take home message here is to recognize privilege. Academic expectations must be recalibrated to account for individual circumstances rather than a sole focus on university rankings, prestige, and research output. True achievements are only understood when viewed through the lens of privilege. I hope for a future where equal opportunities are extended to all, irrespective of their background. For this change to happen, we require a collective change in perspective, a genuine understanding of the myriad of challenges individuals face. I firmly believe in equity over equality, a sentiment I hope we can achieve in Academia and beyond.
Picture
Roderick and I in Congo. Same mother, different privileges.
0 Comments

The moral conundrum of Generative AI, is transparency the answer?

17/6/2023

0 Comments

 
by Kyle Morrison
Picture
Image source: Pixabay.com
Generative AI technologies like OpenAI's ChatGPT and Google's Bard have taken the world by storm since the end of 2022. These tools have transformed the way we approach many tasks, from your weekly shopping trip to drafting academic manuscripts. Generative AI has the potential to democratize knowledge for budding scientists by bridging knowledge gaps and enhancing language translation. 
​
As science explores this new frontier, it is evident that generative AI is revolutionizing not only scientific manuscript writing but also the way we think of authorship and integrity of scientific research. Along with the endless implementations of these powerful tools, the use of Generative AI in science poses novel ethical and practical challenges. For instance, how do scientists with dealing with sensitive data use AI? How can science prevent the potential misuse of AI to generate misleading information or potentially data fabrication?

The rapid emergence of generative AI has sparked debate within the scientific community, as to how and if we should use these tools in academia. This issue has left many scientific journals in a quandary about how to regulate the use of generative AI in their publications. Whilst at this uncertain time it is difficult to predict the correct approach as to how to regulate the use of Generative AI. Some Journals have implemented policies requiring authors to disclose their use of generative AI and to take responsibility for the accuracy of the content they generate. For instance, Water Research has stated “Where authors use generative artificial intelligence (AI) and AI-assisted technologies in the writing process, authors should only use these technologies to improve readability and language”. And they have asked authors to provide a declaration of generative AI in scientific writing with all articles who have used it, for example:
 
Declaration of Generative AI and AI-assisted technologies in the writing process
During the preparation of this work, the authors used GPT 4.0 and GPT 3.5 created by OpenAI, and Bard by Google to enhance clarity, readability, and flow of writing. After using the tools, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.

 
Moving forward, it is important that science navigates this transitional time with caution, fostering fruitful discussions across multiple disciplines. In these discussions, we must weigh the remarkable potential of generative AI in science against its possible risks to scientific integrity. Although, it is too early to say how science will ultimately regulate the use of AI, it is clear that transparent reporting is a useful way for balancing innovation and ethics. As scientists it is a collective responsibility to tame the beast of generative AI, and our actions now will define the future of AI in the scientific method.
 
And here is my declaration:

Declaration of Generative AI and AI-assisted technologies in the writing process
During the writing of this blog, the author used GPT 4.0 by OpenAI to enhance the readability and flow of the writing. The author reviewed and edited the content as needed and takes full responsibility for the content of this work.
​
0 Comments

Using GitHub Desktop to work with RStudio and GitHub

31/5/2023

0 Comments

 
by Shinichi
Picture
We earlier wrote this post to show how we can use SSH key to connect GitHub with RStudio. This post was quite popular, but I think I give up teaching how to use SSH to connect to GitHub now, as I had too much trouble with it, especially teaching the course I coordinate (Big Data in BEES). For example, to directly access GitHub via RStudio on a Mac computer, we need to have Git installed; this means one needs to also install the XCode which is a big piece of software, and not everyone’s computer has space left for it.
 
Now, I know,it is so much easier to connect to GitHub with GitHub Desktop – no SSH key required.
 
Thus, my new workflow is:

  1. Create a GitHub repo on the GitHub webpage (my profile page).
  2. Clone the repo my computer using GitHub Desktop to create a local folder (note that you need to log in GitHub Desktop using your GitHub username and password).
  3. Open a RStudio project using this local folder.
  4. Then, you can just use RStudio normally but commit, push, and pull using GitHub Desktop.
 
In RStudio, it was not so easy to resolve problems, for example when a committed file was too big to push. In GitHub Desktop it is more intuitive to resolve. Also, it is easy to open the GitHub repo page via GitHub Desktop.
 
Now, I teach the above workflow at my class too and it seems students could start using GitHub and RStudio together quite smoothly via GitHub Desktop!
 
Here is a good YouTube video more on Git, GitHub and GitHub Desktop (note this video dooes not use RStudio but it is still useful to watch).
0 Comments

Diversity in academia: are we doing enough?

29/4/2023

0 Comments

 
​ by Pietro Pollo
​

Picture
From https://www.tlnt.com/diversity-is-not-just-about-gender-and-race/

Following the last I-DEEL blog post (by Samantha), I’d like to continue the conversation about biases we see in academia (and elsewhere). I am glad to see this issue getting attention, especially because discrimination hits too close to home. I am even happier that I am part of a lab that is particularly open to this conversation, discussing what we can do through equity, diversity, and inclusion initiatives. Fostering an environment in which everyone feels comfortable is, after all, vital. However, is it enough to ensure our workplace is a safe place for all, if not all can get here?

Academia is full of people studying or working in a country they were not born or raised in. Australia is similar, as around half of its citizens’ parents were born elsewhere, making it a place filled with distinct cultures. However, the diversity in academia and in Australia ends up being superficial: the more powerful a position is (socially and professionally), the more likely it is to be filled by people of a certain phenotype and background (e.g. cis heterosexual white men from rich countries). This is a pattern seen everywhere, including in the country where I’m from (Brazil). This, in my opinion, ends up being the true root of the problem. Is there anything we can do about it?

First, research centres, universities, and PIs hold substantial power as they are responsible for hiring and recruiting new professors, researchers, and postgraduate students. Instead of relying on “objective” metrics (e.g. number of papers, journal impact factor, university rankings) to award those that have “done more/better”, they could consider candidates’ diverse backgrounds (in a meaningful and positive way). The metrics currently used to assess performance are not only problematic on their own (do they really reflect advances in science?), but they are also related to one’s phenotype and background for multiple reasons. Being born in a non-English-speaking country already represents a major constraint to people’s productivity, as they first need to dominate the language before being able to read the literature and write their research manuscripts. Thus, creating new ways to assess academics’ journeys that minimises bias is essential to build a diversity community.

Junior scientists are not exempt from responsibility either. Biases can occur when networking or reviewing manuscripts, even if unconscious. For instance, Fox et al. (2023) performed randomised trials in which they compared single- and double-blind reviews (i.e. reviewers did or didn’t have access to author information, respectively) in the journal Functional Ecology, showing that authors from developed countries received better review scores in single-blind reviews than in double-blind reviews. In other words, authors from rich countries were favoured by reviewers when their information was disclosed. Fox et al. (2023) also showed that editors, who can always see the authors’ information, sent manuscripts from developed countries’ authors for review much more often than manuscripts from others. These results were not because of language differences, as authors from both English-speaking and non-English-speaking developed countries were equally favoured. This means that even individuals with less power can perpetuate — or break — cycles of biased decisions.

Promoting diversity is a difficult task: there is no magical solution for the bias we see in academia. However, it is imperative to improve the current situation. I do not believe it should be a contest about who deserves more pity, but I also do not think we should ignore the barriers imposed on minorities and the clear advantages given to already privileged groups. It might seem unfair to deviate from a “pure meritocratic system” to seek alternatives, but meritocracy is an illusion when society is so unequal. If all of us could have the same opportunities that a rich, white British man like Charles Darwin had, perhaps more of us could advance science as much as he did.
 
References:
Fox, C.W., Meyer, J. & Aimé, E. (2023). Double-blind peer review affects reviewer ratings and editor decisions at an ecology journal. Functional Ecology, 1–14.
​
0 Comments

Happy Women’s History Month

26/3/2023

0 Comments

 
by Samantha Burke
Picture
Melbourne march in solidarity with US overturning Roe v Wade PC: Matt Hrkac 2021
This Women’s History Month, a friend asked me to contribute an audio recording about being a woman for a podcast. Anastasia Shavrova runs the podcast Conversations with Chordates (aka Convos with Chordates). Chordates is one of many science podcasts that I recommend to people looking for a new podcast, but Chordates has a much more relaxed and chatty style and sometimes delves into topics that are more tangentially related to science.

I’m incredibly honoured to contribute to this podcast alongside many amazing women. Please check out the episode Convos with Women.

Women’s history is incredibly interesting and integral to the growth of countries and society as a whole. However, like many minorities, much of women’s history has been left out from general historical education. I-DEEL is committed to open and transparent science though transparency shouldn’t be limited to science. I wanted to use my audio recording to highlight some of these moments in history to bring them back into the narrative. Please check out a blog post on my website that goes into this history in more detail.

This Women’s History Month, please take a moment to think about the transparency of women’s influence on the development of the world. In particular, consider the records (or lack thereof) of women’s involvement in science both historically and currently. Many women were not recognized as they should have for their involvement in scientific developments. See this list of the untold stories of women in STEM and the Netflix documentary Picture a Scientist.

I-DEEL has recently updated our policies around Equity, Diversity, and Inclusion initiatives to better our support for each other within the lab, as it’s incredibly difficult to separate one’s identities from their work. We encourage others to do similarly. Acknowledging differences marks the first step to equality (such as Scripps Institute publishing an internal review on gender disparities in lab space).

Happy Women’s History Month, and thank you to all the women who’ve advanced science to what it is today.
​
0 Comments

The non-consumptive effects of academia

28/2/2023

0 Comments

 
By April Martinig

From the start of my academic journey, I have found myself at a crossroads. Research is supposed to be all consuming, and it is, if you let it be. My problem, or rather, the problem I am told by others to have, is that it is not my only passion. 
Picture
I didn't quite grasp that this was an issue until a specific Wednesday during my PhD. Several hours into a lab meeting, I had to leave “early” to go to my varsity wrestling practice. I delayed going for as long as I could, but finally at 5 pm, already late, I had to go otherwise I would miss practice entirely. A lab-mate turned to me and very loudly said to the room how inappropriate it was that I was not prioritizing my graduate studies over all else. I still left the meeting, but their remark highlighted a sentiment[1] we don’t discourage often enough…

My lifestyle is not entirely different from pursuing a work-life balance; something many of us know, regardless of vocation, is already hard to maintain. Unlike what we might think of when hearing the latter, juggling two passions, fully and completely, means having very little time for life outside of them. But, without wrestling, I wouldn't be able to conduct research and without research, I wouldn't be the wrestler and coach I am today.

However, this dichotomy poses a fundamental issue as I pursue success in academia. As I continue to move up the academic ladder, the duality within me (i.e., being wildly passionate about something else in addition to research) does not seem to exist in “successful” academics. To be successful - to make it in the field - I find myself confronted with examples of researchers that work long hours and make sacrifices I deem too costly. This culture, which seems to continue to be glorified and perpetuated, is an example I won't buy into.
Picture

We shouldn't have to fight for space to be a balanced human. I can be a good academic and not make academia my whole life.

The crossroad is where I meet my passions, rather than follow them.
​
Picture

​[1] As the old adage goes: one that chases two rabbits ends up with none.
​

0 Comments

Can ChatGPT do screening for a systematic review? Yes and more!!!

2/1/2023

0 Comments

 
by Shinichi Nakagawa

Before my Xmas break, I met ChatGPT (Generative Pre-trained Transformer). Since then, she has been my teacher, wise but admits her mistakes. Also, she is humorous (when I ask her to be) and very patient.
 
I decided to see whether ChatGPT can actually do the first stage of screening, i.e. title and abstract screening. After negotiating with her for a few hours, I cracked the code and passed her a carefully worded selection criteria based on PECOS: Population, Exposure, Comparator, Outcome and Study design. And there she was. ChatGPT was telling me whether I should exclude or include a particular study after evaluating a study’s title and abstract.
                                        
I used lists of studies and criteria related to  this protocol: 
 
Vendl C, Taylor MD, Braeunig J, Gibson MJ, Hesselson D, Neely GG, Lagisz M, Nakagawa S. Profiling research on PFAS in wildlife: Protocol of a systematic evidence map and bibliometric analysis. Ecological Solutions and Evidence. 2021 Oct;2(4):e12106.
 
What amazed me was that ChatGPT matched the study with our criteria and summarized reasons. Wow, this is better than I can do (see examples: one recommending inclusion and the other recommending exclusion = both are spot on!)
Picture
Picture
Picture
I tested for around 15+ abstracts and ChatGPT was able to reproduce our decisions. So, I stopped there and then started to test whether she can extract some data from the text. This turned out to be more difficult as ChatGPT does not seem to take more than ~2,000 words as an input (although she claims there are no limits). Anyway, as long as I do not give her too much text, ChatGPT seems to be able to extract what animals were studied, PFAS chemicals and locations mentioned,  in a format below:
Picture
​That is all astounding. But some questions remain. How reproducible is this data? Can we make this process much more systematic?
 
I am hoping to work with a computer scientist and see whether some of these processes can be automated for multiple articles. We are entering an exciting but uncertain time. One thing I can say is that I will be trying to incorporate ChatGPT into some parts of my systematic review workflow from now on, not as a replacement for a human screener but as an addition for now.
0 Comments

DuPont loses challenge over cancer victim's $40 mln verdict in PFAS case

19/12/2022

0 Comments

 
by Lorenzo Ricolfi
Picture
Image by 3D Animation Production Company from Pixabay, modified by adding pfas picture from https://www.setac.org/page/PFAS

On the 5th of December, a federal appeals court of the United States released the verdict on the legal litigation between the American multinational chemical company DuPont and a cancer survivor.

The official verdict document is available here and the full article by Clark Mindock (Reuters) is available here.

Per- and polyfluoroalkyl substances (PFAS) are a group of manufactured chemicals used in various industrial and consumer products. They are also known as "forever chemicals" because they do not break down easily in the environment or the human body. Some PFAS chemicals, including perfluorooctanoic acid (PFOA) and perfluorooctanesulfonic acid (PFOS), have been linked to specific health effects, including cancer, immune system effects, and developmental effects in infants and children.

In this case, the plaintiff, Travis Abbott, claimed that prolonged exposure to PFOA in his drinking water caused him to develop testicular cancer twice. A jury awarded him $40 million in damages after finding that PFOA was likely the cause of his illness. DuPont, the chemical manufacturer, had argued that Abbott's level of exposure was unlikely to have caused his cancer and had challenged the verdict, claiming it had been unfairly kept from raising defences based on the specifics of Abbott's alleged exposure.

The 6th Circuit Court of Appeals has upheld the jury's verdict, stating that DuPont could not challenge the decision, which relied on a finding in related cases that PFOA was linked to the man's cancer. This case is one of the thousands consolidated in multidistrict litigation (MDL) in Ohio, which claims that DuPont poisoned drinking water by discharging PFOA into waterways from its plant in West Virginia.
Picture
Picture from https://btlaw.com/insights/blogs/fast-facts-what-is-pfas
Legal disputes over the adverse health effects of environmental contaminants can be tricky for several reasons. One of the main challenges is the difficulty in proving a causal link between exposure to a particular contaminant and developing a specific health condition. This is because many factors can affect an individual's health, and isolating the effects of a specific environmental pollutant can be challenging. Additionally, the impact of environmental contamination may not manifest for many years, making it difficult to determine the exact cause of a specific health condition. In many cases, the burden of proof is on the plaintiffs to demonstrate that their health problems were caused by exposure to a particular contaminant.
Picture
Photo: M.Lagisz
Scientific research plays an essential role in all of this. By studying the chemical and biological properties of specific contaminants, researchers can better understand how these substances interact with living organisms and their surroundings. This information can be used to develop strategies for addressing and mitigating the effects of environmental contamination, such as identifying the source of contamination, developing methods for cleaning up contaminated sites, and implementing policies to prevent future contamination. Additionally, research can help identify the specific health effects of different contaminants and provide guidance on protecting individuals from exposure.
​
By providing a solid evidentiary base, scientific research can help establish the link between environmental pollution and adverse health effects, which can be crucial in determining the responsibility of polluters and the appropriate remedies to be taken.
0 Comments

Split reference list helper for pilot and collaborative screening rounds

30/11/2022

0 Comments

 
by Coralie Williams

When screening for a systematic review or meta-analysis, we conduct several pilot screening rounds. Pilot screenings help us refine our search string, decision tree, and increase the overall accuracy of our screening for literature reviews [check out this nice guide from the I-DEEL team for more info: Foo et al, 2021].

During a pilot screening, we want to select a random subset of references that would be a representative sample of the full set. When possible, screening rounds are conducted in collaboration with another reviewer. To speed up the screening process, we sometimes want to randomly allocate a subset of papers to a collaborator by splitting a reference list into subsets.

There are two reasons we’d want to automate the selection and splitting of a reference list:
  1. It is time consuming to randomly select papers (>100 papers is tedious to select by hand!)
  2. We are not really good at selecting things at random (actually computers aren’t really good at selecting truly at random either*)

​Below is the R (www.r-project.org) code to run two functions that may come in useful when conducting your pilot and collaborative screenings with Rayyan (https://rayyan.ai/), or any other software where you can upload your pilot reference list.

1. Select random pilot set:

First, load the getpilotref function below in your environment:
​
# -----------------------------------
# getpilotref function 
# -----------------------------------
## Description: 
#     Function to obtain a random subset of references for pilot screening.
#
# Arguments
# - x: data frame with reference list
# - n: number of papers for pilot subset (default is 10)
# - write: logical argument whether to save the pilot list as a csv file 
#   in the current working directory (default is FALSE).
# - fileName: name of file (default is "pilot")

getpilotref <- function(x, n=10, write=FALSE, fileName="pilot"){
  
  if (length(n) == 1L && n%%1==0 && n>0 && n<=nrow(x)) { 
    
    # sample randomly the vector n of row indexes and remove id column in the final dataset
    x$ids <- 1:nrow(x)
    pdat <- x[which(x$ids %in% sample(x$ids, n)),]
    pilot <- pdat[,-which(colnames(pdat)=="ids")]
    
    } else {
      # error message n value provided is not valid 
      stop("Incompatible value n supplied, please check. 
      #n must be a positive integer no higher than the total number of references provided.") 
    }
  
  if (write==T){
    
    # save generated pilot list in working directory using the name provided
    write_csv(pilot, paste(fileName, ".csv", sep=""), na="")
    
    # print out summary of saved file name
    cat(paste("Pilot random sample set of ", n, " articles is saved as: ", fileName, ".csv", sep=""))
    
  }
  
  return(pilot)
}
​Let’s try it out
Load example csv file that was exported from Rayyan (a reference list of papers in Ecology & Evolutionary Biology having the word “butterflies” in their title):

# Read example butterfly reference list
articles<-read.csv("https://raw.githubusercontent.com/coraliewilliams/2022/main/data/articles_butterfly.csv")
​First, let’s obtain a random set of 10 papers without saving it as a csv file:
p10 <- getpilotref(articles)
Now, let’s obtain a subset of 100 papers for a pilot screening and save the subset as a csv file called pilot100.csv. Make sure you have the readr package installed and loaded in your environment.
library(readr)
p100 <- getpilotref(articles, n=100, write=T, fileName="pilot100")
## Pilot random sample set of 100 articles is saved as: pilot100.csv
This will save a csv file pilot100.csv in your working directory. If you are unsure where is your working directory run this command getwd() in your console.
2. Split reference list with another collaborator​

Load the splitref_prop function in your environment:
# -----------------------------------
# splitref_prop function 
# -----------------------------------
## Description: 
#     Function to split in two a reference list based on input proportions.
#
## Arguments: 
# - x: data frame with reference list
# - p: vector of two numerical proportions for each split, it must have two positive numerical values that sum to 1.
# - write: logical argument whether to save the pilot list as csv in current working directory.
# - fileName: name to give to the suffix of the two split csv files.

splitref_prop <- function(x, p=c(0.5, 0.5), write=F, sname="split") {
  
    if (length(p) == 2L && is.numeric(p) && sum(p) == 1 && all(p > 0)) {
      
      # randomly allocated a numerical id to each reference
      rids <- sample(1:nrow(x))
      
      # get index of row to split on using the proportion values provided
      spl <- floor(p[-length(p)] * nrow(x))
      
      # get indices of two data frames based on split ids
      indx1 <- rids[1:spl]
      indx2 <- rids[(spl + 1):nrow(x)]
      
      # save split subsets in two separate datasets
      split1 <<- x[indx1,]
      split2 <<- x[indx2,]
      
      # print out summary message
      cat(paste(c("Reference list was randomly split into",length(p), "proportions of", p[1]*100, "% and", p[2]*100, "%")))
      
      if (write == T) {
        # save files
        write_csv(split1, paste(sname, "_set1", ".csv", sep = ""), na ="")
        write_csv(split2, paste(sname, "_set2", ".csv", sep = ""), na ="")
        }
      
      } else {
      # error message if provided n value is not valid
      stop("Incompatible values for p (proportions) supplied, please check.
           Proportion values must be positive integers less than 1, and the total sum of all proportions should equal to 1.")
        
    }
}
​Let’s try it out
Using the example butterfly reference list, let’s first split the reference list in two equal splits (50% each):
splitref_prop(articles)
## Reference list was randomly split into 2 proportions of 50 % and 50 %
This will give you two separate data frames to share between two reviewers: split1 and split2.

Now let’s get 30% of references in the first subset (split1) and 70% in the second subset (split2), for example if one reviewer has more time to spend on the screening:
splitref_prop(articles, p=c(0.3,0.7))
## Reference list was randomly split into 2 proportions of 30 % and 70 %
Let’s save the 30% and 70% split list of references as csv files with the suffix “testsplit”:
splitref_prop(articles, p=c(0.3,0.7), write=T, sname="testsplit")
## Reference list was randomly split into 2 proportions of 30 % and 70 %
This will save two csv files, testsplit_set1.csv and testsplit_set2.csv, in your working directory.
Picture
*computers aren’t really good at selecting truly at random…Random number generators from most computer programs are actually “pseudo-random”, meaning they are produced from a deterministic mathematical model or algorithm. The R code above uses a pseudo-random number generator. Pseudo-random number generators are usually good enough for their intended purpose (basically better than what any human could do). A good pseudo-random number generator will reproduce statistics that are consistent with true randomness, but they are not truly random. A truly random number can be generated based on a constantly changing physical process that can’t be modelled as an algorithm. If you’re curious about true randomness check out these websites: https://www.random.org/; https://qrng.anu.edu.au/random-colours/.


(Any comments, questions or feedback, you can reach me at: [email protected])
0 Comments

Say goodbye to fixed- and random-effects meta-analyses

27/10/2022

0 Comments

 
By Yefeng Yang

As I have been doing more surveys on meta-analytic practices in many disciplines and re-analysing more published meta-analysis (MA) papers, there is one “recommendation” that is growing stronger and stronger in my brain. That is, we should say goodbye to traditional fixed- and random-effects MAs and conduct our MAs using advanced methods like multilevel and multivariate models because meta-analytic datasets are often multilevel and multivariate in nature. Doing so can make sure you properly handle statistical issues like dependency, and heteroscedasticity, resulting in more robust parameter estimations and inferences. My main argument is that in the “worst-case” scenario, where your dataset does not have a complex structure thereof, these advanced models will automatically reduce into a normal fixed- and random-effects models, all with similar (or identical) results to those expected. More importantly, applying advanced methods can help you decompose variances (Figure 1) and separate correlations of true effects from observed effects (Figure 2), delivering new biological insights. I can see the between-study heterogeneity and correlation are overestimated in many published meta-analyses using fixed-and random-effects models.
Picture
Figure 1. Imaginary example of hierarchical data structure.
Although these advanced methods are good, there are (at least) three remarks worth noting here. First, all your models should be built strictly based on predefined questions (e.g., a priori hypotheses). Second, before applying these models, you need to correctly understand the statistical theory behind them. Otherwise, you very likely disseminate misleading information if you published results from them. Third (but not the last), do not use complex models to fit a small-sample-size dataset. This is especially true for multivariate models, which are often heavily parameterized (even overparameterized). So, always do (at least some basic) model checking (e.g., likelihood profile, convergence) to ensure stability of your model fitting.
Picture
Figure 2. Joint probability distribution (bivariate normal joint density). Photo source: Multivariate normal distribution. (2022, October 16). In Wikipedia. https://en.wikipedia.org/wiki/Multivariate_normal_distribution
​As I have been knowing more about statistics, I realised that many methods are just a special form of a more general framework. For example, (two-sample) Student t-test is a special form of ANOVA, which is a special form of linear regression, which is a special form of generalized linear model or linear mixed model, which is a special form of generalized linear mixed model, which is a special form of the generalized additive mixed model. In the same vein, fixed-effect MA is a special form of random-effects MA, which is a special form of a multilevel or multivariate model. I can imagine that one might disagree with “say goodbye to fixed- and random-effects meta-analyses”. For example, fixed-effects MA can still provide valid inferences if limiting your results to the included studies (e.g., conditional inference). I acknowledge this is true as long as you are not goanna generalize results beyond the included studies. I know asking people to resort to complex methods is difficult because people like easily-understandable tools - just think about P-value. I am always open and happy to see different ideas. Lastly, all the above claims only represent my personal intuition and opinion (I might extend it into a paper in future). They might be wrong and do not necessarily speak for my lab’s attitudes toward meta-analyses.
0 Comments
<<Previous
Forward>>

    Author

    Posts are written by our group members and guests.

    Archives

    January 2026
    December 2025
    November 2025
    October 2025
    September 2025
    August 2025
    July 2025
    June 2025
    May 2025
    April 2025
    March 2025
    January 2025
    December 2024
    November 2024
    October 2024
    September 2024
    August 2024
    July 2024
    June 2024
    May 2024
    April 2024
    March 2024
    February 2024
    January 2024
    December 2023
    November 2023
    October 2023
    September 2023
    August 2023
    July 2023
    June 2023
    May 2023
    April 2023
    March 2023
    February 2023
    January 2023
    December 2022
    November 2022
    October 2022
    September 2022
    August 2022
    July 2022
    June 2022
    May 2022
    April 2022
    March 2022
    February 2022
    January 2022
    December 2021
    November 2021
    October 2021
    September 2021
    August 2021
    July 2021
    June 2021
    April 2021
    March 2021
    February 2021
    January 2021
    December 2020
    November 2020
    August 2020
    July 2020
    June 2020
    April 2020
    December 2019
    November 2019
    October 2019
    September 2019
    June 2019
    April 2019
    March 2019
    February 2019
    January 2019
    December 2018
    November 2018
    October 2018
    September 2018
    August 2018
    July 2018
    June 2018
    May 2018
    March 2018
    January 2018
    December 2017
    October 2017
    September 2017
    July 2017
    June 2017
    May 2017
    April 2017
    March 2017
    January 2017
    October 2016
    August 2016
    July 2016
    June 2016
    May 2016
    March 2016

    Categories

    All

    RSS Feed

HOME
PEOPLE
RESEARCH
PUBLICATIONS
OPEN SCIENCE
OPPORTUNITIES
LINKS
BLOG

Created by Losia Lagisz, last modified on June 24, 2015