stephantul - Hacker News

34

From Chesterton's fence to Chesterton's gap (stephantul.github.io)

7 hours ago stephantul github.io

1

Why scikit learn's fit transform is probably not for you (stephantul.github.io)

3 weeks ago stephantul github.io

4

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep (github.com/minishlab)

a month ago stephantul github.com

3

Show HN: Semble – Fast code search for agents with near-transformer accuracy (github.com/minishlab)

a month ago stephantul github.com

1

Show HN: Skeletoken, a Python package for editing model tokenizers (github.com/stephantul)

4 months ago stephantul github.com

1

Show HN: PyNIFE. 400-900× speedup for embedding-based retrieval pipelines (github.com/stephantul)

7 months ago stephantul github.com

1

Show HN: Skeletoken, a Package for Editing Tokenizers (github.com/stephantul)

9 months ago stephantul github.com

2

Turning any tokenizer into a greedy one (stephantul.github.io)

10 months ago stephantul github.io

3

Decasing Transformers for Fun (stephantul.github.io)

10 months ago stephantul github.io

4

Model2Vec as a Fasttext Alternative (minish.ai)

11 months ago stephantul minish.ai

2

Using overloads to handle union return types in Python (stephantul.github.io)

a year ago stephantul github.io

2

Ask HN: Favourite resources for learning programming type theory?

a year ago stephantul ycombinator.com

1

Evaluating ML classifiers using relative error instead of absolute accuracy (stephantul.github.io)

a year ago stephantul github.io

1

Defeat stringly typing without making your users unhappy (stephantul.github.io)

a year ago stephantul github.io

5

Distilling ModernBERT into a static model doesn't work (minishlab.github.io)

a year ago stephantul github.io

4

Show HN: SemHash – Fast Semantic Text Deduplication for Cleaner Datasets (github.com/minishlab)

a year ago stephantul github.com

18

Train faster static embedding models with sentence transformers (huggingface.co)

a year ago stephantul huggingface.co

4

Semhash: Fast deduplication and dataset multitool in Python (minishlab.github.io)

a year ago stephantul github.io

5

Model2Vec: Make sentence transformers 500x faster on CPU, 15x smaller (huggingface.co)

a year ago stephantul huggingface.co

6

Show HN: Model2Vec: make sentence transformers 500x faster on CPU, 15x smaller (github.com/minishlab)

a year ago stephantul github.com

3

Show HN: Model2Vec: make sentence transformers 500x faster on CPU, 15x smaller (github.com/minishlab)

a year ago stephantul github.com