--- library_name: transformers license: apache-2.0 datasets: - TFLai/Turkish-Alpaca language: - tr --- # Model Card: SykoLLM-V2.1-Turkish-Instruct SykoLLM-V2.1-Turkish-Instruct is a custom-architected, lightweight Large Language Model (LLM) designed specifically for Turkish conversational tasks. Unlike standard pre-built models, this version features a custom configuration optimized for speed and efficiency in low-resource environments. ## Model Description * **Developed by:** syko818121 * **Model Name:** SykoLLM-V2.1-Turkish-Instructt * **Model Type:** Causal Decoder-Only Custom Architecture * **Language:** Turkish * **Parameters:** ~95.7 Million * **Training Data:** Turkish Wikipedia + Custom High-Quality Chat Dataset ## Fine-Tuning & Conversation Style The model was fine-tuned on a high-quality, curated Turkish dataset to ensure natural, human-like responses. The training data distribution was carefully balanced: * **Greetings & Daily Talk (40%):** Natural openings and casual conversation. * **Direct Question-Answering (30%):** Short and concise answers to general knowledge queries. * **Brief Explanations (20%):** Simplified definitions for complex concepts. * **Slang & Short Inputs (10%):** Robustness against one-word or incomplete messages. ## Usage You can load and test SykoLLM-V2.1-Turkish-Instruct using the following snippet: ```python from transformers import AutoModelForCausalLM, AutoTokenizer model_id = "SykoLLM-V2.1-Turkish-Instruct" tokenizer = AutoTokenizer.from_pretrained(model_id) model = AutoModelForCausalLM.from_pretrained(model_id, trust_remote_code=True) prompt = " Selam, naber?" inputs = tokenizer(prompt, return_tensors="pt") outputs = model.generate(**inputs, max_new_tokens=50, pad_token_id=tokenizer.eos_token_id) print(tokenizer.decode(outputs[0], skip_special_tokens=True)) ``` ## Training Configuration * **Learning Rate:** 5e-5 * **Scheduler:** Cosine ## Limitations * **Size:** As a 95.7M parameter model, it is a "mini-LLM." It excels at short chats but may hallucinate on highly complex logical tasks. * **Response Length:** The model is intentionally biased toward concise and direct answers rather than long-form essays. ---