Skip to content

Instantly share code, notes, and snippets.

View johtani's full-sized avatar

Jun Ohtani johtani

View GitHub Profile
@johtani
johtani / mscoco_search.py
Created December 17, 2024 14:41
MS COCOのデータをWeaviateで検索する画面をStreamlitで構成
from weaviate_connection import WeaviateConnection
import streamlit as st
def main():
collection_name = "MultiModalKagome"
properties_kagome = ["caption_ja"]
properties_gse = ["caption_ja_gse"]
limit = 20
@johtani
johtani / mscoco_search.py
Created December 17, 2024 14:40
MS COCOのデータをWeaviateで検索する画面をStreamlitで構成
from weaviate_connection import WeaviateConnection
import streamlit as st
def main():
collection_name = "MultiModalKagome"
properties_kagome = ["caption_ja"]
properties_gse = ["caption_ja_gse"]
limit = 20
@johtani
johtani / main.go
Created October 29, 2024 01:28
GSE difference between dictionaries
package main
import (
"fmt"
"github.com/go-ego/gse"
)
func main() {
@johtani
johtani / list.orma.json.gohtml
Last active December 12, 2023 09:22
ブログ記事のOrama検索実装の紹介用Gist
{{/* Generates an array of blog contents for indexing orama */}}
{{- $.Scratch.Add "item" slice -}}
{{- $section := $.Site.GetPage "section" .Section }}
{{- range .Site.AllPages -}}
{{- if or (and (.IsDescendant $section) (and (not .Draft) (not .Params.private))) $section.IsHome -}}
{{- if (and (eq .Section "post") .File) -}}
{{- if .Params.Tags -}}
{{- $.Scratch.Add "item" (dict "id" .File.UniqueID "date" .Date.UTC.Unix "dir" .File.Dir "lang" .Lang "lastmod" .Lastmod.UTC.Unix "permalink" .Permalink "publishdate" .PublishDate "title" .Title "tags" .Params.Tags "summary" .Summary "contents" .Plain)}}
{{- else -}}
{{- $.Scratch.Add "item" (dict "id" .File.UniqueID "date" .Date.UTC.Unix "dir" .File.Dir "lang" .Lang "lastmod" .Lastmod.UTC.Unix "permalink" .Permalink "publishdate" .PublishDate "title" .Title "summary" .Summary "contents" .Plain)}}
@johtani
johtani / sample.json
Created July 20, 2020 04:17
joinで子供をハイライト
PUT join_highlight
{
"mappings": {
"properties": {
"my_id": {
"type": "keyword"
},
"my_join_field": {
"type": "join",
"relations": {
@johtani
johtani / demo.json
Created June 18, 2020 15:37
Elasticsearch勉強会のデモで利用したDevConsoleのスクリプト
## version 7.9で動作するはずです。 discard_compound_tokenの設定は7.8までは動作しないです(それ以外については動作します。)
DELETE en_synonym_test
PUT en_synonym_test
{
"settings": {
"analysis": {
"analyzer": {
"en_synonym": {
@johtani
johtani / words.txt
Last active June 12, 2020 14:56
UniDicに出てくる、Kuromojiが区切り文字と判断する文字が含まれる単語たち
start with punctuation: [606]. one char is [384]
all punctuations: [111]
hasPunctuations: [1780]
+++++++++++++++++ Start with Punctuation +++++++++++++++++++++
¡
¢
£
¤
¦
§
@johtani
johtani / gist:50aa2776a385c5c8dfa3a0d1e4e268cd
Last active June 12, 2020 14:42
ipadicに出てくる、Kuromojiが区切り文字と判断する文字が含まれる単語の一覧
start with punctuation: [101]. one char is [97]
all punctuations: [3]
hasPunctuations: [723]
+++++++++++++++++ Start with Punctuation +++++++++++++++++++++
@johtani
johtani / 1_辞書の更新方法_ファイル編.json
Last active April 27, 2020 03:37
「辞書の更新についての注意事項」ブログの補足手順など
## 0. インデックスの準備
### 再現可能なようにまず削除
DELETE file_dic_sample
### 辞書ファイルの用意
ファイル名 : custom_dic.txt
配置場所 : ES_PATH_CONF/analyzer/custom_dic.txt
内容
```
@johtani
johtani / .zshrc
Created December 17, 2019 07:09
dot_zshrc_20191217
## For rewrite prezto default settings
EDITOR=vim
VISUAL=vim
### aliases
alias ls='ls -aG'
alias vscode='open -a ~/Applications/Visual\ Studio\ Code.app '
### For ant
export ANT_OPTS=-Dfile.encoding=UTF-8