Gordon's Tech: AI

Showing posts with label AI. Show all posts

Thursday, October 09, 2025

Vibe coding a python script to create a plain text file with my macOS Photos (Aperture) folder hierarchy

My single greatest Photos.app frustration (I have many) is the inability to search the folder hierarchy. Mine contains hundreds semantically important folder names where the hierarchy is also meaningful. Meaning lost in the catastrophic Aperture to Photos migraiton.

This morning I had an hour free so I asked an ai about available utilities and workarounds. It said there are really no good options, but the Python osxphotos module might be able to traverse the folder hierarchy.

I have dabbled in minor Python coding and I have a half-baked Visual Studio Code environment. So I asked Claude 4.5 in Perplexity (this is not a formal supported coding environment) to write me a script that would use osxphotos to build a text file representation of the hierarchy. I ran whatever it generated.

It took 4-5 tries. I never edited the code myself. The first time there were copious errors, I describe errors and requested a redo. The next two times there were fewer errors, but I only got the top level of the hierarchy. The ai added debug code. It took two more tries of running and reporting errors to get a script that generated the text file I wanted (example):

[Teams and Orgs / MN Special Hockey / MNSH 2006 pre-season] MNSpecialHockey_060317
[Teams and Orgs / MN Special Hockey / MNSH 2022-2023] MSH 2023 Printed
[Teams and Orgs / MN Special Hockey / MNSH Woodbury 2019-2020] MSH Portraits Jan 2020
[Teams and Orgs / MN Special Hockey / MNSH 2008-2009] Nov 2008 MN SH Section 108 Event
[Teams and Orgs / MN Special Hockey / MNSH 2021-2022] Portraits MNSH Woodbury 2022

This is most personally valuable code I have ever "produced" since my days of writing the "medtrans" C program to turn 1990s MEDLINE output into tab delimited importable text.

And I wrote none of it.

I'll be cleaning it up and refining it, but below is the code I have today. It also included album names within a containing folder - I didn't want that but now I find it useful so I'll leave it.

Code

#!/usr/bin/env python3
"""
Export Photos folder and album hierarchy to a text file with tab indentation.
Requires: pip install osxphotos
"""

import osxphotos
from pathlib import Path

def export_folder_hierarchy(output_file="photos_folders.txt"):
    """
    Export Photos library folder/album structure to a text file.
    
    Args:
        output_file: Path to output text file (default: photos_folders.txt)
    """
    # Initialize connection to Photos library
    photosdb = osxphotos.PhotosDB()
    
    # Get folders and albums
    folders = photosdb.folder_info
    albums = photosdb.album_info
    
    print(f"Found {len(folders)} folders and {len(albums)} albums")
    
    # Build maps for folders
    folder_map = {f.uuid: {'obj': f, 'children': [], 'type': 'folder'} for f in folders}
    
    # Add albums to the structure
    for album in albums:
        # Albums have folder_names property which is a list of folder names in the path
        if hasattr(album, 'folder_names') and album.folder_names:
            # Try to find the parent folder by matching folder names
            # folder_names is a list like ['Top Folder', 'Sub Folder']
            # We want to match the first (top-level) folder name
            top_folder_name = album.folder_names[0] if album.folder_names else None
            
            if top_folder_name:
                # Find folder with matching title
                parent_folder = None
                for folder in folders:
                    if folder.title == top_folder_name:
                        parent_folder = folder
                        break
                
                if parent_folder and parent_folder.uuid in folder_map:
                    folder_map[parent_folder.uuid]['children'].append({
                        'obj': album,
                        'type': 'album',
                        'title': album.title,
                        'folder_path': ' / '.join(album.folder_names) if album.folder_names else ''
                    })
    
    # Recursive function to write hierarchy
    def write_item(f, item_data, level=0):
        """Write item and its children with proper indentation."""
        indent = '\t' * level
        
        if item_data['type'] == 'folder':
            folder = item_data['obj']
            f.write(f"{indent}{folder.title}/\n")
            # Sort children alphabetically
            children = sorted(item_data['children'], key=lambda x: x.get('title', x.get('obj').title).lower())
            for child in children:
                if child['type'] == 'album':
                    # Write album with its folder path if nested
                    folder_path = child.get('folder_path', '')
                    if folder_path:
                        f.write(f"{indent}\t[{folder_path}] {child['title']}\n")
                    else:
                        f.write(f"{indent}\t{child['title']}\n")
                else:
                    write_item(f, child, level + 1)
    
    # Find root folders
    root_folders = [folder_map[f.uuid] for f in folders if f.parent is None]
    root_folders = sorted(root_folders, key=lambda x: x['obj'].title.lower())
    
    # Write to output file
    with open(output_file, 'w', encoding='utf-8') as f:
        f.write("Photos Library Folder Hierarchy\n")
        f.write("=" * 50 + "\n\n")
        
        for root in root_folders:
            write_item(f, root)
    
    print(f"Folder hierarchy exported to: {Path(output_file).absolute()}")
    print(f"Total top-level folders: {len(root_folders)}")

if __name__ == "__main__":
    export_folder_hierarchy()

Monday, August 04, 2025

Apple's ai opportunity is context

I use Perplexity as my $20/m answer engine -- and for generic ai tasks. I don't want Perplexity as a longterm ai provider, but I do like being able to experiment with different models. Currently all the leading not-free models are pretty good, but some are more sycophantic than others. I dislike sycophancy; ChatGPT and Sonnet have less of it than Gemini but all are too eager to please (prompts help but one has to be careful not to reveal a preference for a particular response).

For me, at this time, all of the models work significantly better with quite a bit of context. In Perplexity that context is provided in Spaces. Spaces include reference material and the model/prompt settings for the Space.

The great mass of people are not going to do that sort of context work. So vendors are trying to answer questions and apply (lower cost) models without context. Meanwhile they try to scrape together a lot of knowledge about the user from whatever source they get.

Apple's opportunity is they can assemble a lot of context. In my case GBs of information on my main drive, not to mention my calendars, contacts, notes and so on. Apple could ask questions to provide a general default context, such as preference for sycophancy, references to use, web resources, textbooks and so on.

I consider Apple to be a broken company. I don't think they will be able to get their ai act together under Cook. But if they can, they do have advantages.

Sunday, July 20, 2025

Tip: Let your ai tell you what's new and novel in an iOS or macOS release

I like to wait a month (iOS) or six months (macOS) before applying major updates. By the time I apply them all the useful tips and tricks I read along the way are ancient history.

Instead of trying to keep track of these things before the OS is installed wait until you are ready to pull the trigger. Then ask your ($20/m) ai to summarize known issues and interesting new features, tips and tricks. You can provide context as needed (ex: I am an expert user, etc).

PS. Apple got away from providing PDF versions of manuals and user guides -- but if they still did that I'd drop the PDFs into my Perplexity macOS Space.

Tuesday, February 27, 2024

Extracting core concepts with ChatGPT 4 from OCR of scanned sample examination PDF - Feb 2024

I think this is an interesting example of what works and doesn't work on the personal AI front in early 2024.

My son was given a printed practice exam in microeconomics. I wanted ChatGPT 4 to extract and summarize the core concepts. This turned out to require two steps, one of which only worked with Google.

Step One: OCR and download text file

I scanned the document in ScanSnap and produced a scan PDF. I tried getting ChatGPT to do the OCR but it abandoned that task. I then tried Gemini and it told me it didn't do OCR. Next I tried Microsoft Lens, but it seemed to only do OCR from a local image, I couldn't see how to use it with a OneDrive PDF. ChatGPT claimed that I could open a OneDrive PDF in Office 365 Word but that did not work with the web version (perhaps it works with full Word?). ChatGPT did not know of a way to do PDF OCR on Sonoma.

The only thing that worked was Google Drive. It allowed me to open the PDF in Google Docs and then export a .txt version.

Step Two: ChatGPT 4 analysis

I asked ChatGPT 4 to extract the key concepts from the .txt file. It provided a plausible set and then proceeded to answer some of the exam questions. Concepts captured were:

... equilibrium price, consumer's surplus, producer's surplus, total surplus, efficient output levels, negative externalities, deadweight loss (DWL), price ceilings, and the impact of taxes on market outcomes ...

I don't think it added much to the textbook chapter topic loss but it did provide a plausible set of topics to emphasize in my son's studying. I was primarily interested in the workflow today. It will be interesting to look back on this in a year and see what's different.

Saturday, January 06, 2024

Rendering ChatGPT output in readable form in a Juypter Notebook

Update: This post is still useful, but there's also a way to enable line wrap in Visual Studio Code's Jupyter extension. You can also use the Python Print function instead of the Display example in my original post. For example:

output = client.completions.create(
model="gpt-3.5-turbo-instruct",
prompt="List the days of the week: ",
max_tokens=100,
stop = "Saturday", #put this in for fun
)
print(output.choices[0].text)

Note these print parameters are specific to this particular object's structure. (I think JSON but I'm a newbie.)

Original below

--------

This was a bit of a revelation. I don't know Python but I've been working through a ChatGPT / LLM tutorial using Visual Studio Code and a Juypter Notebook on macOS. In a Jupyter cell the output renders below the cell and it looks like this:

Completion(id='cmpl-....', choices=[CompletionChoice(finish_reason='stop', index=0, logprobs=None, text='1...

All in one unreadable line with \n as a paragraph deliver and no line wrap.

I asked ChatGPT 4 to help. Over a series of interactions I tried different things and got various error messages I passed to ChatGPT 4. In turn it analyzed my error message and suggested fixes.

This is what I ended up with in about 15 minutes, here added to a cell that ran a simple prompt query

from IPython.display import display, HTML
output = client.completions.create(
model="gpt-3.5-turbo-instruct",
prompt="write me a poem",
max_tokens=100,
n=3
)
text_content = output.choices[0].text if output.choices else ""
html_output = text_content.replace('\n', '<br>')
display(HTML(html_output))

This is what the output looks like now (the poetry is greeting card quality and mildly painful):

A poem, a weave of words and rhyme
A tapestry of thoughts and time
A magic spell from the poet's pen
A story of love, of loss, of when

The stars above, they guide my hand
As I write of distant lands
Of fiery sunsets and ocean tides
Of moments we hold and let slip by ...

[Adolescent poetry truncated]

This screenshot shows it best ...

We are in a new world.