Public
- Public
- Groups
- Popular
- People

Conversation

Notices

jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:36 UTC jartigag
in reply to

don't understand the size difference by now, though
In conversation Wednesday, 17-Nov-2021 20:45:36 UTC from mastodon.social permalink
Attachments
1. Untitled attachment
  https://files.mastodon.social/media_attachments/files/106/490/028/437/798/692/original/bfb372c252e5195a.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:37 UTC jartigag
  in reply to
  
  `copy from program '*csv'` is an even better trick
  In conversation Wednesday, 17-Nov-2021 20:45:37 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/490/022/172/876/839/original/18cf3cce06635c67.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:39 UTC jartigag
  in reply to
  
  `set default` is a good trick to add columns
  In conversation Wednesday, 17-Nov-2021 20:45:39 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/489/994/324/189/173/original/b9a25bf692dde0df.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:41 UTC jartigag
  in reply to
  
  let's move it to #postgresql
  In conversation Wednesday, 17-Nov-2021 20:45:41 UTC permalink
  Attachments
  1. copy from csv to postgres table, about 1-2 seconds each 16MB file
    https://files.mastodon.social/media_attachments/files/106/489/956/260/808/330/original/e33da06f1d3671cd.png
  2. update the same 778k rows in the postgres table, 12-13 seconds in total
    https://files.mastodon.social/media_attachments/files/106/489/957/113/041/456/original/759c0ab4e6774bc1.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:42 UTC jartigag
  in reply to
  
  just want to paste here the original raw data, for the record.
  i've only processed 2020-2021 data
  In conversation Wednesday, 17-Nov-2021 20:45:42 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/364/592/647/904/077/original/409a5f27f47aa91b.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:45 UTC jartigag
  in reply to
  
  comparing my machines. i don't know very much about cpus, but to me left-side seems better than right-side, isn't it? (except for number of cores and the highlighted line, but it should be irrelevant for this processing, since it isn't multithread).
  right-side processes faster, as you can see 🤷
  In conversation Wednesday, 17-Nov-2021 20:45:45 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/336/962/982/729/949/original/6e5251b53ef10ee8.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:45 UTC jartigag
  in reply to
  
  well, this could be the reason:
  left-side, hdd disk
  right-side, ssd disk
  🤔
  In conversation Wednesday, 17-Nov-2021 20:45:45 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/337/005/074/348/912/original/aaadd5d2ac647d68.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:46 UTC jartigag
  in reply to
  
  it's gonna be a hard work night
  In conversation Wednesday, 17-Nov-2021 20:45:46 UTC permalink
  Attachments
  1. 4 cores processing chess blunders on my laptop
    https://files.mastodon.social/media_attachments/files/106/332/068/333/892/794/original/5e14cdad476a178f.png
  2. 4 cores processing chess blunders on another of my available computers
    https://files.mastodon.social/media_attachments/files/106/332/069/730/456/789/original/b07de4f0cc9716e6.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:48 UTC jartigag
  in reply to
  
  let's put some sense on those dataframes. these some of the most common errors on <1500 elo #chess players:
  In conversation Wednesday, 17-Nov-2021 20:45:48 UTC permalink
  Attachments
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:49 UTC jartigag
  in reply to
  
  now i begin to see something 🎉
  apparently good players make less mistakes than not-so-good ones.. 🤔😜
  https://github.com/jartigag/chess-blunders/blob/master/notebooks/1.0-jartigag-explore_interim_data.ipynb
  In conversation Wednesday, 17-Nov-2021 20:45:49 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/325/741/108/842/572/original/894ec1924bc239aa.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:50 UTC jartigag
  in reply to
  
  to this (4x2% each 5 mins).
  i know, obviously parallelization is key. but in this case the important decision was to split raw data and review it manually, instead of wasting more time trying to automatize everything.
  In conversation Wednesday, 17-Nov-2021 20:45:50 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/325/214/686/169/487/original/c5d7d06fd10299a1.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:51 UTC jartigag
  in reply to
  
  such a simple "tweak" (much simpler than multiprocessing.Pool and anything else i've tried these days) make a decisive improvement in performance ✌️
  In conversation Wednesday, 17-Nov-2021 20:45:51 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/325/178/574/555/882/original/9171b56da340be17.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:51 UTC jartigag
  in reply to
  
  from this (1% each ~3h):
  In conversation Wednesday, 17-Nov-2021 20:45:51 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/325/197/082/423/775/original/2ed17a6d9c332580.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:52 UTC jartigag
  in reply to
  
  6h 😶 fortunately it was done in one night
  In conversation Wednesday, 17-Nov-2021 20:45:52 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/220/830/642/931/468/original/d8d981edd8c3d2fb.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:53 UTC jartigag
  
  recently i started an analysis project, using #lichess data. these screenshots were just downloading this year so far and uncompressing a month 🥵
  now i have to preprocess such a BIG data 😅
  In conversation Wednesday, 17-Nov-2021 20:45:53 UTC permalink
  Attachments
  1. wget to each file in https://database.lichess.org
    https://files.mastodon.social/media_attachments/files/106/212/315/002/693/258/original/60ad2006367a5f3a.png
  2. time uncompressing just one month: 2h 15mins
    https://files.mastodon.social/media_attachments/files/106/212/317/941/608/035/original/16ec443bab516890.png
- jartigag (jartigag@mastodon.social)'s status on Wednesday, 17-Nov-2021 20:45:53 UTC jartigag
  in reply to
  
  found a solution. #grep to the rescue! 🦸
  https://github.com/jartigag/chess-blunders/blob/master/data/raw/pre_preprocess.sh
  In conversation Wednesday, 17-Nov-2021 20:45:53 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/106/218/861/880/330/363/original/f3d8951c040a16f7.png
- jartigag (jartigag@mastodon.social)'s status on Sunday, 21-Nov-2021 23:44:33 UTC jartigag
  
  i knew #pandas wasn't a good idea with this #csv files..
  why i keep doing this to myself? 🤦♂️
  In conversation Sunday, 21-Nov-2021 23:44:33 UTC permalink
  Attachments
  1. diff in my pgn2csv.py script: now i use the csv library instead of pandas
    https://files.mastodon.social/media_attachments/files/107/317/697/460/820/351/original/8f998184e5d99af5.png
  2. results with a ~120MB input: with csv library: 2,5s with pandas library: 82,5s
    https://files.mastodon.social/media_attachments/files/107/317/698/400/602/978/original/10321dd06be02318.png
- jartigag (jartigag@mastodon.social)'s status on Monday, 22-Nov-2021 23:30:46 UTC jartigag
  
  right now, it looks like this (hopefully tomorrow i will have all the data)
  In conversation Monday, 22-Nov-2021 23:30:46 UTC permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/107/323/314/290/696/174/original/7b67be449f960cf5.png
- jartigag (jartigag@mastodon.social)'s status on Tuesday, 23-Nov-2021 21:27:16 UTC jartigag
  
  covid19-lockdown effect you say? 😁
  In conversation Tuesday, 23-Nov-2021 21:27:16 UTC permalink
  Attachments
  1. number of evaluated chess games on lichess.org during first half of 2020 (there's a noticeable increasement in march)
    https://files.mastodon.social/media_attachments/files/107/328/483/891/239/296/original/8843e708f5db9bce.png

Feeds