Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Author here.

Yes, this is a big feature I've been working on, should be ready for a beta by the end of the month.

I allude to it in the post, but good search (for editing) is a challenge, and necessitates a mix of embeddings/vector search and text models.



Derushing in general is the most time consuming, so not only language pattern recognition but also image recognition: "From the rushes, extract all the sequences with bicycle crashes to give me a pile of clips to use in my edit" !


Yes, agreed.

I film a bunch of skateboarding, and it can take tens of tries to land a trick. Similarly, there's usually an unique sound that signals a trick was finally landed.

Good multi-modal search and discovery is a huge part of cracking the editing problem.


Looks like https://kino.ai addresses that derushing stage, but as a specialized tool rather than as a function inside a video editor - which makes a lot of sense to me.


Tens? It sometimes takes my crew hundreds of tries (all on DV tapes).

How far have you been able to come with search for trick variations? It would be interesting to see a system that can reliably recognize what’s switch, nollie vs fakie etc. Then have it generate a list of all tricks for each skater and perhaps outstanding fails. Just some thoughts.


Detect the cheer everyone makes when the trick lands. Lots of proxy indicators to key off of.


> I allude to it

And that’s why I read the comments to see if anyone mentioned it.

To be able to literally take the source files used to put the video together and edit each piece individually would be great.

I wanted to create a car driving down a road covered in arches if greenery. I got lots of great options but I wanted a particular combination of options. If I could do something like that with video, that would be terrific




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: