Turns out that After Effects will generate markers automatically based on XMP, but you must toggle this feature on in your global preferences.

This would allow you to import a Premiere Pro file with speech-to-text transcription and get markers automatically generated.
