Author here. Yes, this is a big feature I've been working on, should be ready fo...

liotier · on Sept 24, 2024

Derushing in general is the most time consuming, so not only language pattern recognition but also image recognition: "From the rushes, extract all the sequences with bicycle crashes to give me a pile of clips to use in my edit" !

burningion · on Sept 24, 2024

Yes, agreed.

I film a bunch of skateboarding, and it can take tens of tries to land a trick. Similarly, there's usually an unique sound that signals a trick was finally landed.

Good multi-modal search and discovery is a huge part of cracking the editing problem.

liotier · on Sept 24, 2024

Looks like https://kino.ai addresses that derushing stage, but as a specialized tool rather than as a function inside a video editor - which makes a lot of sense to me.

trinix912 · on Sept 24, 2024

Tens? It sometimes takes my crew hundreds of tries (all on DV tapes).

How far have you been able to come with search for trick variations? It would be interesting to see a system that can reliably recognize what’s switch, nollie vs fakie etc. Then have it generate a list of all tricks for each skater and perhaps outstanding fails. Just some thoughts.

sitkack · on Sept 24, 2024

Detect the cheer everyone makes when the trick lands. Lots of proxy indicators to key off of.

nashashmi · on Sept 24, 2024

> I allude to it

And that’s why I read the comments to see if anyone mentioned it.

To be able to literally take the source files used to put the video together and edit each piece individually would be great.

I wanted to create a car driving down a road covered in arches if greenery. I got lots of great options but I wanted a particular combination of options. If I could do something like that with video, that would be terrific