The article discusses GPT-4o's process of encoding high-res images into tiles and posits speculative CNN architectures for the transformation. It questions the number 170 used by OpenAI for token counting and suggests strategies such as pyramid strategy for image encoding. Finally, it explores GPT-4o's approach towards OCR and its handling of alpha channels in images.