Skip to content

Instantly share code, notes, and snippets.

View MaTriXy's full-sized avatar

Yossi Elkrief MaTriXy

View GitHub Profile
@MaTriXy
MaTriXy / ContentView.swift
Created December 25, 2025 11:08 — forked from dkun7944/ContentView.swift
AirDrop iOS 17 Swift.Shader Animation
//
// ContentView.swift
// Airdrop Demo
//
// Created by Daniel Kuntz on 7/30/23.
//
import SwiftUI
struct ContentView: View {
@MaTriXy
MaTriXy / inferal-workspace-architecture.md
Created December 25, 2025 06:22 — forked from yrashk/inferal-workspace-architecture.md
Inferal Workspace Architecture

Inferal Workspace Architecture

Your org's brain that AI can use

This document describes the conceptual architecture of the Inferal Workspace - a text-based, version-controlled knowledge and operations hub designed to replace tools like Notion and Webflow while being natively accessible to AI assistants.

Why We Built This

As an engineering-driven organization, we found ourselves fighting our tools instead of using them. Notion couldn't keep up with how we actually work - context scattered across pages, no version control, and AI that could read but not act. Webflow meant our website lived in a silo, disconnected from our codebase and deployment pipelines. Every tool was another tab, another context switch, another place where knowledge went to die.

@MaTriXy
MaTriXy / SHOWCASE.md
Created October 19, 2025 11:22 — forked from jmanhype/SHOWCASE.md
Agent Learning via Early Experience + ACE Integration - Production Framework for Continuous Agent Learning

Agent Learning via Early Experience + ACE Integration

Production-Ready Framework for Continuous Agent Learning

A complete implementation of reward-free reinforcement learning through world modeling, exploration, and self-reflection, with full ACE (Adaptive Context Engineering) integration for knowledge curation and semantic deduplication.


What Is This?

@MaTriXy
MaTriXy / stress-test-prompt
Created September 6, 2025 05:32 — forked from netanelavr/stress-test-prompt
stress-test-prompt
# API Stress Test
Create a Bash-based stress testing suite that simulates real user interactions with
your API and measures system performance under load.
## Core Script (`api_stress_test.sh`)
- Simulate realistic user flows with sequential API calls (GET, POST, PUT, DELETE)
- DELETE requests should be executed last to clean up test data
- Support bearer token authentication and custom headers
- Base URL should be configurable
@MaTriXy
MaTriXy / MarchingCubesParams.h
Created August 14, 2025 21:33 — forked from Matt54/MarchingCubesParams.h
Marching Cubes Metaballs in RealityKit (Metal + LowLevelMesh)
#ifndef MarchingCubesParams_h
#define MarchingCubesParams_h
#include <simd/simd.h>
struct MarchingCubesParams {
simd_uint3 cells;
simd_float3 origin;
simd_float3 cellSize;
float isoLevel;
simd_float3 centerA;
@MaTriXy
MaTriXy / gpt-5-voice.agent.py
Created August 8, 2025 13:53 — forked from kwindla/gpt-5-voice.agent.py
GPT-5 voice agent single-file example
# export OPENAI_API_KEY=sk_proj-...
# uv run gpt-5-voice-agent.py
# /// script
# dependencies = [
# "numba==0.61.2",
# "openai==1.99.1",
# "python-dotenv",
# "fastapi[all]",
# "uvicorn",
@MaTriXy
MaTriXy / default.md
Created July 9, 2025 03:59 — forked from cablej/default.md
Cluely System prompt

<core_identity> You are an assistant called Cluely, developed and created by Cluely, whose sole purpose is to analyze and solve problems asked by the user or shown on the screen. Your responses must be specific, accurate, and actionable. </core_identity>

<general_guidelines>

  • NEVER use meta-phrases (e.g., "let me help you", "I can see that").
  • NEVER summarize unless explicitly requested.
  • NEVER provide unsolicited advice.
  • NEVER refer to "screenshot" or "image" - refer to it as "the screen" if needed.
  • ALWAYS be specific, detailed, and accurate.
Hi LinkedIn friend!
Here is how I managed to have the agent I'm building reduce token usage.
1. My agent has 62 tools (and growing quickly in terms of number of tools..)
2. Each tool has a description. All in all I was sending the entire 62 tools+description in every agent turn.
It came out to 10k tokens before even the system prompt+user prompt - ON EVERY TURN.
The solution I found was to do a preflight LLM request to select only the relevant tools for the user request.
@MaTriXy
MaTriXy / bot.py
Created March 12, 2025 11:26 — forked from kwindla/bot.py
Cartesia Sonic-2 Language Teacher
import asyncio
import os
import sys
from dataclasses import dataclass
import aiohttp
import google.ai.generativelanguage as glm
from dotenv import load_dotenv
from loguru import logger
from runner import configure
@MaTriXy
MaTriXy / grpo_demo.py
Created January 29, 2025 16:26 — forked from willccbb/grpo_demo.py
GRPO Llama-1B
# train_grpo.py
import re
import torch
from datasets import load_dataset, Dataset
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import LoraConfig
from trl import GRPOConfig, GRPOTrainer
# Load and prep dataset