Type Alias: LlamaContextSequenceRepeatPenalty

type LlamaContextSequenceRepeatPenalty = {
  punishTokens: Token[] | () => Token[];
  maxPunishTokens?: number;
  penalty?: number;
  frequencyPenalty?: number;
  presencePenalty?: number;
};

Defined in: evaluator/LlamaContext/types.ts:198

Properties

punishTokens

punishTokens: Token[] | () => Token[];

Defined in: evaluator/LlamaContext/types.ts:200

Tokens to lower the predication probability of to be the next predicted token

maxPunishTokens?

optional maxPunishTokens: number;

Defined in: evaluator/LlamaContext/types.ts:211

The maximum number of tokens that will be provided in the punishTokens array.

This is used as a hint for a performance optimization for avoiding frequent memory deallocation and reallocation.

Don't set this value too high, as it can allocate too much memory.

Defaults to 64.

penalty?

optional penalty: number;

Defined in: evaluator/LlamaContext/types.ts:219

The relative amount to lower the probability of the tokens in punishTokens by.

Defaults to 1.1. Set to 1 to disable.

frequencyPenalty?

optional frequencyPenalty: number;

Defined in: evaluator/LlamaContext/types.ts:227

For n time a token is in the punishTokens array, lower its probability by n * frequencyPenalty.

Disabled by default (0). Set to a value between 0 and 1 to enable.

presencePenalty?

optional presencePenalty: number;

Defined in: evaluator/LlamaContext/types.ts:235

Lower the probability of all the tokens in the punishTokens array by presencePenalty.

Disabled by default (0). Set to a value between 0 and 1 to enable.

LlamaModel

LlamaModelTokens

LlamaChatSession

LlamaText

GgufInsights

GbnfJsonSchema

ChatHistoryItem

ChatModelResponse

LlamaChatResponse

GgufFileInfo

GgufMetadata

LlamaContextOptions

BatchingOptions

LlamaChatSessionOptions

LLamaChatPromptOptions

Chat Wrapper Options

JinjaTemplateChatWrapperOptions

Type Alias: LlamaContextSequenceRepeatPenalty

Properties

punishTokens

maxPunishTokens?

penalty?

frequencyPenalty?

presencePenalty?

LlamaModelTokens

ChatModelResponse

GgufMetadata

LlamaContextOptions

BatchingOptions

LlamaChatSessionOptions

LLamaChatPromptOptions

JinjaTemplateChatWrapperOptions

Type Alias: LlamaContextSequenceRepeatPenalty ​

Properties ​

punishTokens ​

maxPunishTokens? ​

penalty? ​

frequencyPenalty? ​

presencePenalty? ​

Type Alias: LlamaContextSequenceRepeatPenalty

Properties

punishTokens

maxPunishTokens?

penalty?

frequencyPenalty?

presencePenalty?