More

hudvin · on Aug 1, 2023

Location: Ukraine, Kyiv (safe mostly)

Remote: yes

Willing to relocate: want, but can't (

Technologies: Python backend stack, some computer vision, PDF processing (split, merge, ocr, text extraction, fixing all weird issues, mupdf, ghostscript - worked on pdf related projects), AWS

Resume:https://vadym.bartko.me/my_cv/

email: vadym.bartko@protonmail.com

hudvin · on Aug 18, 2022

Doesn't work very well. First of all there are several separate problems: 1) detect if OCR is required 2) image optimization 3) preprocessing of broken pdf files. And all of them are not easy: 1) page could contain selectable text, but text can't be copied because embedded font doesn't contain glyph->symbol code mapping. Mapping table could contain complete garbage. Sometimes page could contain long urls (added by email services) but all text is provided as image. Sometimes text contains normal text and garbage. And many many other cases. 2) some old scanners generate pdf documents built from 2-5 pixel image stripes. Some of them try to do OCR (poorly). Some of them uses huge DPI. Sometimes you get uncompressed doc in which each page could take up to 200mb. So you need to convert pdf page to image. But you have to choose format and compression options. PNG is ok, but you have to choose correct options (for ghostscript). But output image will be huge. JPG is better, but quality could be low. Sometimes multistage optimization is required. Also tools like ghostscript, fitz or imagemagic doesn't handle all possible pdf/image. 3)weird pdfs - endless story. Poor fonts, broken fonts, very specific cases in pdf standard, issues with image extraction, table of content, viruses, embedded files, annotations, margins/paddings/rotations/translations.

Probably I have to write long post about this )

hudvin · on Nov 19, 2021

I was thinking about similar service for emails/news/forums/social networks. Main idea the same - slow updates.

discardedrefuse · on Nov 20, 2021

For news and forum threads you might want to check out Fraidycat. Its a browser extension that handles your feeds (rss and some others). You can categorize feeds by importance (real-time, frequent, occasional, etc) and it updates the main page accordingly.

https://fraidyc.at/

dmitryminkovsky · on Nov 21, 2021

Fraidycat is really cool. I used it a bit. Have a twitter account already, but would probably have kept using fraidycat if I didn't have one.

hudvin · on Dec 27, 2020

Some info: I found this story while reading Fallen Giants (Maurice Isserman) - history of mountaineering in himalayas.

hudvin · on June 1, 2020

Location: Ukraine, Kyiv

Remote: Yes

Willing to relocate: someday

Technologies: Deep Learning/Python Backend stack - keras, pytorch, Flask, Docker, Kubernetes and so on. Fields - image processing, face detection, face recognition, object detection/classification, segmentation.

Résumé/CV: https://drive.google.com/file/d/1RF-eoiC5GMVhJwSvKZsy32bVto5...

Email: hudvin@gmail.com

Last 12 month was working on AI-related startup (image search.)

Interested in remote position in Deep Learning/Computer Vision field.

hudvin · on April 5, 2020

Deutsch some ancient history NLP / Deep Learning

hudvin · on April 1, 2020

Location: Ukraine Remote: Yes!!!

Willing to relocate: someday

Technologies: Deep Learning/Computer Vision (convnets, facenet, image classification, segmentation, opencv, keras, scikit-image etc), Python Backend Stack

CV: https://drive.google.com/open?id=1RF-eoiC5GMVhJwSvKZsy32bVto....

email: hudvin@gmail.com

hudvin · on March 23, 2020

Yes, why not to sell data from dbs, backups, drives? :D

hudvin · on March 23, 2020

Location: Ukraine

Remote: Yes!!!

Willing to relocate: someday

Technologies: Deep Learning/Computer Vision (convnets, facenet, image classification, segmentation, opencv, keras, scikit-image etc), Python Backend Stack

CV: https://drive.google.com/open?id=1RF-eoiC5GMVhJwSvKZsy32bVto...

email: hudvin@gmail.com

hudvin · on March 21, 2020

I am working on search tool for images.

It detects faces, objects, tags, extracts metadata and provides search interface and API. So you can for example find image with "cat and dog near river with some person" I want to build enterprise level image search :)

Google Images, for example, uses in most cases alt text and surrounding content. Google Photos is limited to personal collections.

I have finished prototype and now trying to convert it to startup

Demo https://app.khumbu.im/search/5dff72e66483e25b40e0222e info https://khumbu.im

omarchowdhury · on March 22, 2020

I typed "man jumping on bed" and none of the results match the term, on any of the result pages. Whereas Google Images is very accurate.