IWETHEY v. 0.3.0 | TODO
1,095 registered users | 2 active users | 0 LpH | Statistics
Login | Create New User
IWETHEY Banner

Welcome to IWETHEY!

New 20% of genetics papers checked have errors due to Excel reformatting fields.
https://www.washingtonpost.com/news/wonk/wp/2016/08/26/an-alarming-number-of-scientific-papers-contain-excel-errors/

A team of Australian researchers analyzed nearly 3,600 genetics papers published in a number of leading scientific journals — like Nature, Science and PLoS One. As is common practice in the field, these papers all came with supplementary files containing lists of genes used in the research.

The Australian researchers found that roughly 1 in 5 of these papers included errors in their gene lists that were due to Excel automatically converting gene names to things like calendar dates or random numbers.

You see, genes are often referred to in scientific literature by symbols — essentially shortened versions of full gene names. The gene "Septin 2" is typically shortened as SEPT2. "Membrane-Associated Ring Finger (C3HC4) 1, E3 Ubiquitin Protein Ligase" gets mercifully shortened to MARCH1.

But when you type these shortened gene names into Excel, the program automatically assumes they refer to dates — Sept. 2 and March 1, respectively. If you type SEPT2 into a default Excel cell, it magically becomes "2-Sep." It's stored by the program as the date 9/2/2016.

Even worse, there's no easy way to undo this automatic formatting once it has happened. Edit -> Undo simply deletes everything in the cell. You can try to convert the formatting from "General," the default, to "Text," which you might expect to change it back to the original characters you enter. But instead, changing the formatting to "Text" makes the cell contents appear as 42615 — Excel's internal numeric code referring to the date 9/2/2016.

Even more troubling, the researchers note that there's no way to permanently disable automatic date formatting within Excel. Researchers still have to remember to manually format columns to "Text" before you type anything in new Excel sheets — every. single. time.

But even the genetics researchers among us are only human, and they sometimes forget to do this. Hence, you end up with 20 percent of these genetics papers containing preventable errors introduced by Excel.

The Australian researchers note that this problem was first identified in a paper published more than a decade ago. "Nevertheless, we find that these errors continue to pervade supplementary files in the scientific literature," they write.


:-/

I'm sure this is a ubiquitous problem in many, many fields. Remember Reinhart and Rogoff.

(via G+)

Cheers,
Scott.
New never applies to climate change right?
always look out for number one and don't step in number two
New Nah, spreadshhets can lie, but pictures don't.
BBC: Surface water shifting around the Earth.
The team found that vast areas that were once land are now submerged beneath water, with the largest change occurring in the Tibetan Plateau, where melting glaciers are creating huge new lakes.
Alex

"There is a cult of ignorance in the United States, and there has always been. The strain of anti-intellectualism has been a constant thread winding its way through our political and cultural life, nurtured by the false notion that democracy means that "my ignorance is just as good as your knowledge."

-- Isaac Asimov
New 'SEPT2
I blame Visicalc :-)
New :-)
As others have noted, LibreOffice Calc (v5 anyway) doesn't do this "automatic date" formatting for SEPT2 and MARCH1 - contra the WP story.

Cheers,
Scott.
New LibreOffice Calc doesn't do lots of things.
For good or ill, Excel is top of the spreadsheet dunghill for a reason.
     20% of genetics papers checked have errors due to Excel reformatting fields. - (Another Scott) - (5)
         never applies to climate change right? -NT - (boxley) - (1)
             Nah, spreadshhets can lie, but pictures don't. - (a6l6e6x)
         'SEPT2 - (scoenye) - (2)
             :-) - (Another Scott) - (1)
                 LibreOffice Calc doesn't do lots of things. - (pwhysall)

The kids should've been walking to school with EMP lunchboxes.
60 ms