Hello,
I am trying to set up a system where I have a plain text file to write experiment notes as metadata of that experiment. I recently attended to a seminar where the speaker talked that such file should be human and machine readable; but I am not getting there.
My goal is to make a file that works as a template for metadata of experiments, with a fixed header, and a variable body according to the experiment.
The problems I am facing:
- not sure how much ground I have to play with a file for it to be human and machine readable
- how can I deal with the fact that more variables are being added overtime, or
- more details for the same variable
- how can I make such a file that later on, if I want to do any analysis (like statistics) I could stack all the file of the same experiment (e.g. stack the 3 files of the 3 repeats of the experiment)
An example would help
Imagine that I have to observe a protein (protein-x) in cells using a confocal microscope after an immunocytochemistry experiment -- therefore, I will have to use a primary antibody and a secondary antibody, each with their respective dilution; skipping the details here for sake of simplicity.
This is how I picture the file to be like '.csv' (with random examples to fill in)
date, "20200220"
experiment, "test experiment"
cell_line, "U2OS"
target, "protein-x"
primary_antibody, "anti-protein-x"
primary_antibody_dilution, "1:500"
secondary_antibody, "anti-anti-a594"
secondary_antibody_dilution, "1:5000"
So far my problem is: if I use two 'primary_antibody', what is the best way to do it? Create another file, or add like:
primary_antibody, {"anti-protein-x", "anti-protein-b"}
Moving on, I would like log the file names of the raw data that correspond to a given microscopy slide; I have the same problem, and one more: I don't know in advance how many photos I will acquire for a given slide, and for different experiments I'll have different number of slides. As an example:
date, "20200220"
experiment, "test experiment"
cell_line, "U2OS"
target, "protein-x"
primary_antibody, "anti-protein-x"
primary_antibody_dilution, "1:500"
secondary_antibody, "anti-anti-a594"
secondary_antibody_dilution, "1:5000"
slide_a, {"-001", "-002", "-005"}
slide_b, {"-010", "-015", "-016"}
Hope that this is not too confusing.
What would be your advice? Keeping in mind that the less time making annotation and logs would be beneficial. In other words, what could here be automated to end up with a nice metadata file while keep the log in the lab sustainable; for example, in the lab its far easier and faster to write "slide_a", ""-001", "-002", "-005"
than having three files with the same (duplicated) info in the heading.
Thank you in advance.
Cheers,
Leonardo