Hello,
I download big raw data containing that is supposed to show minute per
minute quotation of an indice (SP500). The problem is that some quotations
have been included twice in the same minute (at some seconds of distance),
for example here
2013-10-11 13:47:51;1701.0601;1701.21;1701;1701;696500
2013-10-11 13:47:57;1701.02;1701.21;1700.99;1700.99;644100
2013-10-11 13:48:51;1700.97;1700.97;1700.7;1700.7;496500
2013-10-11 13:49:51;1700.67;1700.67;1700.53;1700.53;369700
2013-10-11 13:49:57;1700.6801;1700.6801;1700.52;1700.52;374800
2013-10-11 13:50:57;1700.51;1700.51;1700.35;1700.37;441100
I tried to clean that manually but it appears to be a huge work. I'd like to
have a smarter way to ask calc to keep only one data of each minute time,
like this :
2013-10-11 13:47:51;1701.0601;1701.21;1701;1701;696500
2013-10-11 13:48:51;1700.97;1700.97;1700.7;1700.7;496500
2013-10-11 13:49:51;1700.67;1700.67;1700.53;1700.53;369700
2013-10-11 13:50:51;1700.47;1700.47;1700.35;1700.35;444900
The hard part is that not all minutes have twins (erasing one out of two
would'nt do) : here only 13:49 was twined.
How could that be solved ?